Bagging linear sparse Bayesian learning models for variable selection in cancer diagnosis

Andy Devos, Sabine Van Huffel, Chuan Lu, Johan A. K. Suykens, Carles Arus

Research output: Contribution to journalArticlepeer-review

35 Citations (Scopus)
236 Downloads (Pure)

Abstract

This work investigates variable selection and classification for biomedical datasets with a small sample size and a very high input dimension. The sequential sparse Bayesian learning methods with linear bases are used as the basic variable selection algorithm. Selected variables are fed to the kernel based probabilistic classifiers: Bayesian least squares support vector machines (LS-SVMs) and relevance vector machines (RVMs). We employ the bagging techniques for both variable selection and model building in order to improve the reliability of the selected variables and the predictive performance. This modelling strategy is applied to real-life medical classification problems, including two binary cancer diagnosis problems based on microarray data and a brain tumor multiclass classification problem using spectra acquired via magnetic resonance spectroscopy. The work is experimentally compared to other variable selection methods. It is shown that the use of bagging can improve the reliability and stability of both variable selection and model prediction.
Original languageEnglish
Pages (from-to)338-347
Number of pages10
JournalIEEE Transactions on Information Technology in Biomedicine
Volume11
Issue number3
DOIs
Publication statusPublished - May 2007

Fingerprint

Dive into the research topics of 'Bagging linear sparse Bayesian learning models for variable selection in cancer diagnosis'. Together they form a unique fingerprint.

Cite this