TY - JOUR
T1 - Cluster ensembles
T2 - A survey of approaches with recent extensions and applications
AU - Boongoen, Tossapon
AU - Iam-On, Natthakan
N1 - Funding Information:
This work is funded by ST/P005594/1 - Newton STFC-NARIT : Using astronomy surveys to train Thai researchers in Big Data analysis.
Publisher Copyright:
© 2018 Elsevier Inc.
PY - 2018/5/1
Y1 - 2018/5/1
N2 - Cluster ensembles have been shown to be better than any standard clustering algorithm at improving accuracy and robustness across different data collections. This meta-learning formalism also helps users to overcome the dilemma of selecting an appropriate technique and the corresponding parameters, given a set of data to be investigated. Almost two decades after the first publication of a kind, the method has proven effective for many problem domains, especially microarray data analysis and its down-streaming applications. Recently, it has been greatly extended both in terms of theoretical modelling and deployment to problem solving. The survey attempts to match this emerging attention with the provision of fundamental basis and theoretical details of state-of-the-art methods found in the present literature. It yields the ranges of ensemble generation strategies, summarization and representation of ensemble members, as well as the topic of consensus clustering. This review also includes different applications and extensions of cluster ensemble, with several research issues and challenges being highlighted.
AB - Cluster ensembles have been shown to be better than any standard clustering algorithm at improving accuracy and robustness across different data collections. This meta-learning formalism also helps users to overcome the dilemma of selecting an appropriate technique and the corresponding parameters, given a set of data to be investigated. Almost two decades after the first publication of a kind, the method has proven effective for many problem domains, especially microarray data analysis and its down-streaming applications. Recently, it has been greatly extended both in terms of theoretical modelling and deployment to problem solving. The survey attempts to match this emerging attention with the provision of fundamental basis and theoretical details of state-of-the-art methods found in the present literature. It yields the ranges of ensemble generation strategies, summarization and representation of ensemble members, as well as the topic of consensus clustering. This review also includes different applications and extensions of cluster ensemble, with several research issues and challenges being highlighted.
KW - Cluster ensemble
KW - Data clustering
KW - Domain specific application
KW - Theoretical extension
UR - http://www.scopus.com/inward/record.url?scp=85047607959&partnerID=8YFLogxK
U2 - 10.1016/j.cosrev.2018.01.003
DO - 10.1016/j.cosrev.2018.01.003
M3 - Review Article
AN - SCOPUS:85047607959
SN - 1574-0137
VL - 28
SP - 1
EP - 25
JO - Computer Science Review
JF - Computer Science Review
ER -