Using varying negative examples to improve computational predictions of transcription factor binding sites

Faisal I Rezwan, Yi Sun, Neil Davey, Rod Adams, Alistair G. Rust, Mark Robinson

Research output: Chapter in Book/Report/Conference proceedingConference Proceeding (Non-Journal item)

1 Citation (Scopus)

Abstract

The identification of transcription factor binding sites (TFBSs ) is a non-trivial problem as the existing computational predictors produce a lot of false predictions. Though it is proven that combining these predictions with a meta-classifier, like Support Vector Machines (SVMs), can improve the overall results, this improvement is not as significant as expected. The reason for this is that the predictors are not reliable for the negative examples from non-binding sites in the promoter region. Therefore, using negative examples from different sources during training an SVM can be one of the solutions to this problem. In this study, we used different types of negative examples during training the classifier. These negative examples can be far away from the promoter regions or produced by randomisation or from the intronic region of genes. By using these negative examples during training, we observed their effect in improving predictions of TFBSs in the yeast. We also used a modified cross-validation method for this type of problem. Thus we observed substantial improvement in the classifier performance that could constitute a model for predicting TFBSs. Therefore, the major contribution of the analysis is that for the yeast genome, the position of binding sites could be predicted with high confidence using our technique and the predictions are of much higher quality than the predictions of the original prediction algorithms.
Original languageEnglish
Title of host publicationEngineering Applications of Neural Networks - 13th International Conference, EANN 2012, Proceedings
Subtitle of host publication13th International Conference, EANN 2012, London, UK, September 20-23, 2012.
EditorsChristina Jayne, Shigang Yue, Lazaros Iliadis
PublisherSpringer Nature
Pages234-243
Number of pages10
ISBN (Electronic)978-3-642-32909-8
ISBN (Print)978-3-642-32908-1
DOIs
Publication statusPublished - 16 Aug 2012
Externally publishedYes
Event13th International Conference of Engineering Applications of Neural Networks: EANN 2012 - London, United Kingdom of Great Britain and Northern Ireland
Duration: 20 Sept 201223 Sept 2012

Publication series

NameCommunications in Computer and Information Science
PublisherSpringer Nature
Volume311
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference13th International Conference of Engineering Applications of Neural Networks
Country/TerritoryUnited Kingdom of Great Britain and Northern Ireland
CityLondon
Period20 Sept 201223 Sept 2012

Fingerprint

Dive into the research topics of 'Using varying negative examples to improve computational predictions of transcription factor binding sites'. Together they form a unique fingerprint.

Cite this