Cargando…

Comparative Analysis of Functional Metagenomic Annotation and the Mappability of Short Reads

To assess the functional capacities of microbial communities, including those inhabiting the human body, shotgun metagenomic reads are often aligned to a database of known genes. Such homology-based annotation practices critically rely on the assumption that short reads can map to orthologous genes...

Descripción completa

Detalles Bibliográficos
Autores principales: Carr, Rogan, Borenstein, Elhanan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4141809/
https://www.ncbi.nlm.nih.gov/pubmed/25148512
http://dx.doi.org/10.1371/journal.pone.0105776
_version_ 1782331696442507264
author Carr, Rogan
Borenstein, Elhanan
author_facet Carr, Rogan
Borenstein, Elhanan
author_sort Carr, Rogan
collection PubMed
description To assess the functional capacities of microbial communities, including those inhabiting the human body, shotgun metagenomic reads are often aligned to a database of known genes. Such homology-based annotation practices critically rely on the assumption that short reads can map to orthologous genes of similar function. This assumption, however, and the various factors that impact short read annotation, have not been systematically evaluated. To address this challenge, we generated an extremely large database of simulated reads (totaling 15.9 Gb), spanning over 500,000 microbial genes and 170 curated genomes and including, for many genomes, every possible read of a given length. We annotated each read using common metagenomic protocols, fully characterizing the effect of read length, sequencing error, phylogeny, database coverage, and mapping parameters. We additionally rigorously quantified gene-, genome-, and protocol-specific annotation biases. Overall, our findings provide a first comprehensive evaluation of the capabilities and limitations of functional metagenomic annotation, providing crucial goal-specific best-practice guidelines to inform future metagenomic research.
format Online
Article
Text
id pubmed-4141809
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41418092014-08-25 Comparative Analysis of Functional Metagenomic Annotation and the Mappability of Short Reads Carr, Rogan Borenstein, Elhanan PLoS One Research Article To assess the functional capacities of microbial communities, including those inhabiting the human body, shotgun metagenomic reads are often aligned to a database of known genes. Such homology-based annotation practices critically rely on the assumption that short reads can map to orthologous genes of similar function. This assumption, however, and the various factors that impact short read annotation, have not been systematically evaluated. To address this challenge, we generated an extremely large database of simulated reads (totaling 15.9 Gb), spanning over 500,000 microbial genes and 170 curated genomes and including, for many genomes, every possible read of a given length. We annotated each read using common metagenomic protocols, fully characterizing the effect of read length, sequencing error, phylogeny, database coverage, and mapping parameters. We additionally rigorously quantified gene-, genome-, and protocol-specific annotation biases. Overall, our findings provide a first comprehensive evaluation of the capabilities and limitations of functional metagenomic annotation, providing crucial goal-specific best-practice guidelines to inform future metagenomic research. Public Library of Science 2014-08-22 /pmc/articles/PMC4141809/ /pubmed/25148512 http://dx.doi.org/10.1371/journal.pone.0105776 Text en © 2014 Carr, Borenstein http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Carr, Rogan
Borenstein, Elhanan
Comparative Analysis of Functional Metagenomic Annotation and the Mappability of Short Reads
title Comparative Analysis of Functional Metagenomic Annotation and the Mappability of Short Reads
title_full Comparative Analysis of Functional Metagenomic Annotation and the Mappability of Short Reads
title_fullStr Comparative Analysis of Functional Metagenomic Annotation and the Mappability of Short Reads
title_full_unstemmed Comparative Analysis of Functional Metagenomic Annotation and the Mappability of Short Reads
title_short Comparative Analysis of Functional Metagenomic Annotation and the Mappability of Short Reads
title_sort comparative analysis of functional metagenomic annotation and the mappability of short reads
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4141809/
https://www.ncbi.nlm.nih.gov/pubmed/25148512
http://dx.doi.org/10.1371/journal.pone.0105776
work_keys_str_mv AT carrrogan comparativeanalysisoffunctionalmetagenomicannotationandthemappabilityofshortreads
AT borensteinelhanan comparativeanalysisoffunctionalmetagenomicannotationandthemappabilityofshortreads