Abstract
Cluster ensembles or consensus clusterings have been shown to be better than any standard clustering algorithm at improving accuracy and robustness across various sets of data. This meta-learning formalism also helps users to overcome the dilemma of selecting an appropriate technique and the parameters for that technique. Since founded, different research areas have emerged with the common purpose of enhancing the effectiveness and applicability of cluster ensembles. These include the selection of ensemble members, the imputation of missing values, and the summarization of ensemble members. In particular, this paper is set to provide the review of different matrix refinement approaches that have been recently proposed in the literature for summarizing information of multiple clusterings. With various benchmark datasets and quality measures, the comparative study of these novel techniques is carried out to provide empirical findings from which a practical guideline can be drawn.
Original language | English |
---|---|
Pages (from-to) | 269-300 |
Number of pages | 32 |
Journal | Machine Learning |
Volume | 98 |
Issue number | 1-2 |
Early online date | 10 Apr 2013 |
DOIs | |
Publication status | Published - 01 Jan 2015 |
Externally published | Yes |
Keywords
- Cluster ensemble
- Information matrix
- Multiple clusterings
- Summarization