Abstract
Clinical data has been employed as the major factor for traditional cancer prognosis. However, this classic approach may be ineffective for analysing morphologically indistinguishable tumour subtypes. As such, microarray technology emerges as the promising alternative. Despite a large number of microarray studies, the actual clinical application of gene expression data analysis remains limited owing to the complexity of generated data and the noise level. Recently, the integrative cluster analysis of both clinical and gene expression data has been shown to be an effective alternative to overcome the above-mentioned problems. This paper presents a novel method for using cluster ensembles that is accurate for analysing heterogeneous biological data. Evaluation against real biological and benchmark data sets suggests that the quality of the proposed model is higher than many state-of-the-art cluster ensemble techniques and standard clustering algorithms.
Original language | English |
---|---|
Pages (from-to) | 150-168 |
Number of pages | 19 |
Journal | International Journal of Data Mining and Bioinformatics |
Volume | 8 |
Issue number | 2 |
DOIs | |
Publication status | Published - 01 Aug 2013 |
Keywords
- Cluster ensembles
- Clustering
- Heterogeneous biological data
- Link analysis
- Gene Expression Profiling/methods
- Gene Expression
- Pattern Recognition, Automated/methods
- Algorithms
- Oligonucleotide Array Sequence Analysis
- Neoplasms/genetics
- Cluster Analysis