Abstract
We discuss application of a machine learning method, Random Forest (RF), for the extraction of relevant biological knowledge from metabolomics fingerprinting experiments. The importance of RF margins and variable significance as well as prediction accuracy is discussed to provide insight into model generalisability and explanatory power. A method is described for detection of relevant features while conserving the redundant structure of the fingerprint data. The methodology is illustrated using two datasets from electrospray ionisation mass spectrometry from 27 Arabidopsis genotypes and a set of transgenic potato lines.
| Original language | English |
|---|---|
| Title of host publication | Second International Symposium, CompLife 2006, Cambridge, UK, September 27-29, 2006. Proceedings |
| Publisher | Springer Nature |
| Pages | 226-235 |
| Number of pages | 10 |
| ISBN (Electronic) | 978-3-540-45768-8 |
| ISBN (Print) | 978-3-540-45767-1 |
| DOIs | |
| Publication status | Published - 2006 |
| Event | International Symposium, CompLife - Cambridge, United Kingdom of Great Britain and Northern Ireland Duration: 27 Sept 2006 → 29 Sept 2006 |
Conference
| Conference | International Symposium, CompLife |
|---|---|
| Country/Territory | United Kingdom of Great Britain and Northern Ireland |
| City | Cambridge |
| Period | 27 Sept 2006 → 29 Sept 2006 |
Fingerprint
Dive into the research topics of 'On the interpretation of high throughput MS based metabolomics fingerprints with Random Forest'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver