Systems-based biological concordance and predictive reproducibility of gene set discovery methods in cardiovascular disease

Francisco Azuaje*, Huiru Zheng, Haiying Wang, Anyela Velentine Camargo-Rodriguez

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

8 Citations (Scopus)

Abstract

The discovery of novel disease biomarkers is a crucial challenge for translational bioinformatics. Demonstration of both their classification power and reproducibility across independent datasets are essential requirements to assess their potential clinical relevance. Small datasets and multiplicity of putative biomarker sets may explain lack of predictive reproducibility. Studies based on pathway-driven discovery approaches have suggested that, despite such discrepancies, the resulting putative biomarkers tend to be implicated in common biological processes. Investigations of this problem have been mainly focused on datasets derived from cancer research. We investigated the predictive and functional concordance of five methods for discovering putative biomarkers in four independently-generated datasets from the cardiovascular disease domain. A diversity of biosignatures was identified by the different methods. However, we found strong biological process concordance between them, especially in the case of methods based on gene set analysis. With a few exceptions, we observed lack of classification reproducibility using independent datasets. Partial overlaps between our putative sets of biomarkers and the primary studies exist. Despite the observed limitations, pathway-driven or gene set analysis can predict potentially novel biomarkers and can jointly point to biomedically-relevant underlying molecular mechanisms. (C) 2011 Elsevier Inc. All rights reserved.

Original languageEnglish
Pages (from-to)637-647
Number of pages11
JournalJournal of Biomedical Informatics
Volume44
Issue number4
Early online date09 Feb 2011
DOIs
Publication statusPublished - Aug 2011

Keywords

  • ATHEROSCLEROSIS
  • GENOME-WIDE EXPRESSION
  • BIOINFORMATICS
  • CANCER
  • HEART-FAILURE
  • Pathway analysis
  • SIGNATURES
  • MICROARRAY DATA
  • NETWORK
  • Biomarker discovery
  • Human heart failure
  • Disease networks
  • Gene set analysis
  • PATHWAY ANALYSIS
  • Translational bioinformatics
  • Cardiovascular diseases
  • ENRICHMENT ANALYSIS

Fingerprint

Dive into the research topics of 'Systems-based biological concordance and predictive reproducibility of gene set discovery methods in cardiovascular disease'. Together they form a unique fingerprint.

Cite this