TY - GEN
T1 - Generating Reliable Genome Assemblies of Intestinal Protozoans from Clinical Samples for the Purpose of Biomarker Discovery
AU - Morris, Arthur
AU - Pachebat, Justin
AU - Tyson, Graeme
AU - Robinson, Guy
AU - Chalmers, Rachel
AU - Swain, Martin
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - Protozoan parasites that cause diarrhoeal diseases in humans take a massive toll on global public health annually, with over 200,000 deaths in children of less than two years old in Asia and Sub-Saharan Africa being attributed to Cryptosporidium alone. They can, in particular, be a serious health risk for immuno-incompetent individuals. Genomics can be a valuable asset in helping combat these parasites, but there are still problems associated with performing whole genome sequencing from human stool samples. In particular there are issues associated with highly uneven sequence coverage of these parasite genomes, which may result in critical errors in the genome assemblies produced using a number of popular assemblers. We have developed an approach using the Gini statistic to better characterise depth of sequencing coverage. Furthermore, we have explored the sequencing biases resulting from Whole Genome Amplification approaches, and have attempted to relate those to the Gini statistic. We discuss these issues in two parasite genera: Cryptosporidium and Cyclospora, and perform an analysis of the sequencing coverage depth over these genomes. Finally we present our strategy to generate reliable genome assemblies of sufficient quality to facilitate discovery of new Variable Number Tandem Repeat (VNTR) biomarkers.
AB - Protozoan parasites that cause diarrhoeal diseases in humans take a massive toll on global public health annually, with over 200,000 deaths in children of less than two years old in Asia and Sub-Saharan Africa being attributed to Cryptosporidium alone. They can, in particular, be a serious health risk for immuno-incompetent individuals. Genomics can be a valuable asset in helping combat these parasites, but there are still problems associated with performing whole genome sequencing from human stool samples. In particular there are issues associated with highly uneven sequence coverage of these parasite genomes, which may result in critical errors in the genome assemblies produced using a number of popular assemblers. We have developed an approach using the Gini statistic to better characterise depth of sequencing coverage. Furthermore, we have explored the sequencing biases resulting from Whole Genome Amplification approaches, and have attempted to relate those to the Gini statistic. We discuss these issues in two parasite genera: Cryptosporidium and Cyclospora, and perform an analysis of the sequencing coverage depth over these genomes. Finally we present our strategy to generate reliable genome assemblies of sufficient quality to facilitate discovery of new Variable Number Tandem Repeat (VNTR) biomarkers.
KW - Biomarker discovery
KW - Cryptosporidium
KW - Genome assembly
UR - http://www.scopus.com/inward/record.url?scp=85085022668&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-46970-2_11
DO - 10.1007/978-3-030-46970-2_11
M3 - Conference Proceeding (Non-Journal item)
AN - SCOPUS:85085022668
SN - 9783030469696
T3 - Communications in Computer and Information Science
SP - 216
EP - 241
BT - Biomedical Engineering Systems and Technologies - 12th International Joint Conference, BIOSTEC 2019, Revised Selected Papers
A2 - Roque, Ana
A2 - Gamboa, Hugo
A2 - Tomczyk, Arkadiusz
A2 - De Maria, Elisabetta
A2 - Putze, Felix
A2 - Moucek, Roman
A2 - Fred, Ana
PB - Springer Nature
T2 - 12th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2019
Y2 - 22 February 2019 through 24 February 2019
ER -