Fast Comparison of Microbial Genomes Using the Chaos Games Representation for Metagenomic Applications

Allbwn ymchwil: Cyfraniad at gyfnodolynErthygladolygiad gan gymheiriaid

10 Dyfyniadau (Scopus)
151 Wedi eu Llwytho i Lawr (Pure)

Crynodeb

Genome sequencing technology is generating large databases of sequence at such a rate that advances in computer hardware alone are not adequate to handle them: more efficient algorithms are needed. Here an alignment-free method of sequence comparison and visualisation based on the Chaos Games Representation (CGR) and multifractal analysis is explored as an approach to search and filter through a data set of over 1500 microbial genomes. Whereas BLAST takes 25 hours to search this data set with large sequence fragments (e.g. 100 Kb), the method introduced here can reduce this data set by 95% (from 1550 target species to just 50) in about 15 minutes, and it is able to predict the exact species correctly in 67% of cases. The results presented here demonstrate that CGR is worth further investigation as a fast method to perform genome sequence comparison on large data sets, and various ways to further develop the method are discussed.
Iaith wreiddiolSaesneg
Tudalennau (o-i)1372-1381
Nifer y tudalennau9
CyfnodolynProcedia Computer Science
Cyfrol18
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 01 Meh 2013
Digwyddiad2013 International Conference on Computational Science - Barcelona, Sbaen
Hyd: 05 Meh 201307 Meh 2013

Ôl bys

Gweld gwybodaeth am bynciau ymchwil 'Fast Comparison of Microbial Genomes Using the Chaos Games Representation for Metagenomic Applications'. Gyda’i gilydd, maen nhw’n ffurfio ôl bys unigryw.

Dyfynnu hyn