Cargando…

High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED

Profiling microbial community function from metagenomic sequencing data remains a computationally challenging problem. Mapping millions of DNA reads from such samples to reference protein databases requires long run-times, and short read lengths can result in spurious hits to unrelated proteins (los...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaminski, James, Gibson, Molly K., Franzosa, Eric A., Segata, Nicola, Dantas, Gautam, Huttenhower, Curtis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4684307/
https://www.ncbi.nlm.nih.gov/pubmed/26682918
http://dx.doi.org/10.1371/journal.pcbi.1004557
_version_ 1782406166137012224
author Kaminski, James
Gibson, Molly K.
Franzosa, Eric A.
Segata, Nicola
Dantas, Gautam
Huttenhower, Curtis
author_facet Kaminski, James
Gibson, Molly K.
Franzosa, Eric A.
Segata, Nicola
Dantas, Gautam
Huttenhower, Curtis
author_sort Kaminski, James
collection PubMed
description Profiling microbial community function from metagenomic sequencing data remains a computationally challenging problem. Mapping millions of DNA reads from such samples to reference protein databases requires long run-times, and short read lengths can result in spurious hits to unrelated proteins (loss of specificity). We developed ShortBRED (Short, Better Representative Extract Dataset) to address these challenges, facilitating fast, accurate functional profiling of metagenomic samples. ShortBRED consists of two components: (i) a method that reduces reference proteins of interest to short, highly representative amino acid sequences (“markers”) and (ii) a search step that maps reads to these markers to quantify the relative abundance of their associated proteins. After evaluating ShortBRED on synthetic data, we applied it to profile antibiotic resistance protein families in the gut microbiomes of individuals from the United States, China, Malawi, and Venezuela. Our results support antibiotic resistance as a core function in the human gut microbiome, with tetracycline-resistant ribosomal protection proteins and Class A beta-lactamases being the most widely distributed resistance mechanisms worldwide. ShortBRED markers are applicable to other homology-based search tasks, which we demonstrate here by identifying phylogenetic signatures of antibiotic resistance across more than 3,000 microbial isolate genomes. ShortBRED can be applied to profile a wide variety of protein families of interest; the software, source code, and documentation are available for download at http://huttenhower.sph.harvard.edu/shortbred
format Online
Article
Text
id pubmed-4684307
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46843072015-12-31 High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED Kaminski, James Gibson, Molly K. Franzosa, Eric A. Segata, Nicola Dantas, Gautam Huttenhower, Curtis PLoS Comput Biol Research Article Profiling microbial community function from metagenomic sequencing data remains a computationally challenging problem. Mapping millions of DNA reads from such samples to reference protein databases requires long run-times, and short read lengths can result in spurious hits to unrelated proteins (loss of specificity). We developed ShortBRED (Short, Better Representative Extract Dataset) to address these challenges, facilitating fast, accurate functional profiling of metagenomic samples. ShortBRED consists of two components: (i) a method that reduces reference proteins of interest to short, highly representative amino acid sequences (“markers”) and (ii) a search step that maps reads to these markers to quantify the relative abundance of their associated proteins. After evaluating ShortBRED on synthetic data, we applied it to profile antibiotic resistance protein families in the gut microbiomes of individuals from the United States, China, Malawi, and Venezuela. Our results support antibiotic resistance as a core function in the human gut microbiome, with tetracycline-resistant ribosomal protection proteins and Class A beta-lactamases being the most widely distributed resistance mechanisms worldwide. ShortBRED markers are applicable to other homology-based search tasks, which we demonstrate here by identifying phylogenetic signatures of antibiotic resistance across more than 3,000 microbial isolate genomes. ShortBRED can be applied to profile a wide variety of protein families of interest; the software, source code, and documentation are available for download at http://huttenhower.sph.harvard.edu/shortbred Public Library of Science 2015-12-18 /pmc/articles/PMC4684307/ /pubmed/26682918 http://dx.doi.org/10.1371/journal.pcbi.1004557 Text en © 2015 Kaminski et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kaminski, James
Gibson, Molly K.
Franzosa, Eric A.
Segata, Nicola
Dantas, Gautam
Huttenhower, Curtis
High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED
title High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED
title_full High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED
title_fullStr High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED
title_full_unstemmed High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED
title_short High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED
title_sort high-specificity targeted functional profiling in microbial communities with shortbred
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4684307/
https://www.ncbi.nlm.nih.gov/pubmed/26682918
http://dx.doi.org/10.1371/journal.pcbi.1004557
work_keys_str_mv AT kaminskijames highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred
AT gibsonmollyk highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred
AT franzosaerica highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred
AT segatanicola highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred
AT dantasgautam highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred
AT huttenhowercurtis highspecificitytargetedfunctionalprofilinginmicrobialcommunitieswithshortbred