Cargando…

MetaGeniE: Characterizing Human Clinical Samples Using Deep Metagenomic Sequencing

With the decreasing cost of next-generation sequencing, deep sequencing of clinical samples provides unique opportunities to understand host-associated microbial communities. Among the primary challenges of clinical metagenomic sequencing is the rapid filtering of human reads to survey for pathogens...

Descripción completa

Detalles Bibliográficos
Autores principales: Rawat, Arun, Engelthaler, David M., Driebe, Elizabeth M., Keim, Paul, Foster, Jeffrey T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4218713/
https://www.ncbi.nlm.nih.gov/pubmed/25365329
http://dx.doi.org/10.1371/journal.pone.0110915
_version_ 1782342462589632512
author Rawat, Arun
Engelthaler, David M.
Driebe, Elizabeth M.
Keim, Paul
Foster, Jeffrey T.
author_facet Rawat, Arun
Engelthaler, David M.
Driebe, Elizabeth M.
Keim, Paul
Foster, Jeffrey T.
author_sort Rawat, Arun
collection PubMed
description With the decreasing cost of next-generation sequencing, deep sequencing of clinical samples provides unique opportunities to understand host-associated microbial communities. Among the primary challenges of clinical metagenomic sequencing is the rapid filtering of human reads to survey for pathogens with high specificity and sensitivity. Metagenomes are inherently variable due to different microbes in the samples and their relative abundance, the size and architecture of genomes, and factors such as target DNA amounts in tissue samples (i.e. human DNA versus pathogen DNA concentration). This variation in metagenomes typically manifests in sequencing datasets as low pathogen abundance, a high number of host reads, and the presence of close relatives and complex microbial communities. In addition to these challenges posed by the composition of metagenomes, high numbers of reads generated from high-throughput deep sequencing pose immense computational challenges. Accurate identification of pathogens is confounded by individual reads mapping to multiple different reference genomes due to gene similarity in different taxa present in the community or close relatives in the reference database. Available global and local sequence aligners also vary in sensitivity, specificity, and speed of detection. The efficiency of detection of pathogens in clinical samples is largely dependent on the desired taxonomic resolution of the organisms. We have developed an efficient strategy that identifies “all against all” relationships between sequencing reads and reference genomes. Our approach allows for scaling to large reference databases and then genome reconstruction by aggregating global and local alignments, thus allowing genetic characterization of pathogens at higher taxonomic resolution. These results were consistent with strain level SNP genotyping and bacterial identification from laboratory culture.
format Online
Article
Text
id pubmed-4218713
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42187132014-11-05 MetaGeniE: Characterizing Human Clinical Samples Using Deep Metagenomic Sequencing Rawat, Arun Engelthaler, David M. Driebe, Elizabeth M. Keim, Paul Foster, Jeffrey T. PLoS One Research Article With the decreasing cost of next-generation sequencing, deep sequencing of clinical samples provides unique opportunities to understand host-associated microbial communities. Among the primary challenges of clinical metagenomic sequencing is the rapid filtering of human reads to survey for pathogens with high specificity and sensitivity. Metagenomes are inherently variable due to different microbes in the samples and their relative abundance, the size and architecture of genomes, and factors such as target DNA amounts in tissue samples (i.e. human DNA versus pathogen DNA concentration). This variation in metagenomes typically manifests in sequencing datasets as low pathogen abundance, a high number of host reads, and the presence of close relatives and complex microbial communities. In addition to these challenges posed by the composition of metagenomes, high numbers of reads generated from high-throughput deep sequencing pose immense computational challenges. Accurate identification of pathogens is confounded by individual reads mapping to multiple different reference genomes due to gene similarity in different taxa present in the community or close relatives in the reference database. Available global and local sequence aligners also vary in sensitivity, specificity, and speed of detection. The efficiency of detection of pathogens in clinical samples is largely dependent on the desired taxonomic resolution of the organisms. We have developed an efficient strategy that identifies “all against all” relationships between sequencing reads and reference genomes. Our approach allows for scaling to large reference databases and then genome reconstruction by aggregating global and local alignments, thus allowing genetic characterization of pathogens at higher taxonomic resolution. These results were consistent with strain level SNP genotyping and bacterial identification from laboratory culture. Public Library of Science 2014-11-03 /pmc/articles/PMC4218713/ /pubmed/25365329 http://dx.doi.org/10.1371/journal.pone.0110915 Text en © 2014 Rawat et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Rawat, Arun
Engelthaler, David M.
Driebe, Elizabeth M.
Keim, Paul
Foster, Jeffrey T.
MetaGeniE: Characterizing Human Clinical Samples Using Deep Metagenomic Sequencing
title MetaGeniE: Characterizing Human Clinical Samples Using Deep Metagenomic Sequencing
title_full MetaGeniE: Characterizing Human Clinical Samples Using Deep Metagenomic Sequencing
title_fullStr MetaGeniE: Characterizing Human Clinical Samples Using Deep Metagenomic Sequencing
title_full_unstemmed MetaGeniE: Characterizing Human Clinical Samples Using Deep Metagenomic Sequencing
title_short MetaGeniE: Characterizing Human Clinical Samples Using Deep Metagenomic Sequencing
title_sort metagenie: characterizing human clinical samples using deep metagenomic sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4218713/
https://www.ncbi.nlm.nih.gov/pubmed/25365329
http://dx.doi.org/10.1371/journal.pone.0110915
work_keys_str_mv AT rawatarun metageniecharacterizinghumanclinicalsamplesusingdeepmetagenomicsequencing
AT engelthalerdavidm metageniecharacterizinghumanclinicalsamplesusingdeepmetagenomicsequencing
AT driebeelizabethm metageniecharacterizinghumanclinicalsamplesusingdeepmetagenomicsequencing
AT keimpaul metageniecharacterizinghumanclinicalsamplesusingdeepmetagenomicsequencing
AT fosterjeffreyt metageniecharacterizinghumanclinicalsamplesusingdeepmetagenomicsequencing