TY - GEN
T1 - Revisiting link-based cluster ensembles for microarray data classification
AU - Iam-On, Natthakan
AU - Boongoen, Tossapon
PY - 2013
Y1 - 2013
N2 - Cancer has been identified as the leading cause of death. It is predicted that around 20-26 million people will be diagnosed with cancer by 2020. With this alarming rate, there is an urgent need for a more effective methodology to understand, prevent and cure cancer. Microarray technology provides a useful basis of achieving this ultimate goal. In particular to cancer research, it has become almost routine to create gene expression profiles, which can discriminate patients into good and poor prognosis groups, and identify possible tumor subtypes. This classification or predictive model offers a useful tool for individualized treatment of disease. However, the accuracy of existing classifiers have been constrained by the curse of dimensionality typically observed in microarray data. In addition to gene selection, one may transform the original data to another variation, where only key gene components are included. Unlike conventional transformation-based techniques found in the literature, this paper presents a novel method that makes use of cluster ensembles, specifically the summarizing information matrix, as the transformed data for the following classification step. Among different state-of-the-art methods, the link-based cluster ensemble approach (LCE) provides a highly accurate clustering, and thus particularly employed here. The performance of this transformation model is evaluated on published microarray datasets and C4.5, in comparison with benchmark techniques. The findings suggest that the new model can improve the classification accuracy of original data and performs better than the other transformation methods investigated in the empirical study.
AB - Cancer has been identified as the leading cause of death. It is predicted that around 20-26 million people will be diagnosed with cancer by 2020. With this alarming rate, there is an urgent need for a more effective methodology to understand, prevent and cure cancer. Microarray technology provides a useful basis of achieving this ultimate goal. In particular to cancer research, it has become almost routine to create gene expression profiles, which can discriminate patients into good and poor prognosis groups, and identify possible tumor subtypes. This classification or predictive model offers a useful tool for individualized treatment of disease. However, the accuracy of existing classifiers have been constrained by the curse of dimensionality typically observed in microarray data. In addition to gene selection, one may transform the original data to another variation, where only key gene components are included. Unlike conventional transformation-based techniques found in the literature, this paper presents a novel method that makes use of cluster ensembles, specifically the summarizing information matrix, as the transformed data for the following classification step. Among different state-of-the-art methods, the link-based cluster ensemble approach (LCE) provides a highly accurate clustering, and thus particularly employed here. The performance of this transformation model is evaluated on published microarray datasets and C4.5, in comparison with benchmark techniques. The findings suggest that the new model can improve the classification accuracy of original data and performs better than the other transformation methods investigated in the empirical study.
KW - Classification
KW - Cluster ensembles
KW - Link-based similarity
KW - Microarray data
UR - http://www.scopus.com/inward/record.url?scp=84893625451&partnerID=8YFLogxK
U2 - 10.1109/SMC.2013.773
DO - 10.1109/SMC.2013.773
M3 - Conference Proceeding (Non-Journal item)
AN - SCOPUS:84893625451
SN - 9780769551548
T3 - Proceedings - 2013 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2013
SP - 4543
EP - 4548
BT - Proceedings - 2013 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2013
PB - IEEE Press
T2 - 2013 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2013
Y2 - 13 October 2013 through 16 October 2013
ER -