Cargando…

Accurate, multi-kb reads resolve complex populations and detect rare microorganisms

Accurate evaluation of microbial communities is essential for understanding global biogeochemical processes and can guide bioremediation and medical treatments. Metagenomics is most commonly used to analyze microbial diversity and metabolic potential, but assemblies of the short reads generated by c...

Descripción completa

Detalles Bibliográficos
Autores principales: Sharon, Itai, Kertesz, Michael, Hug, Laura A., Pushkarev, Dmitry, Blauwkamp, Timothy A., Castelle, Cindy J., Amirebrahimi, Mojgan, Thomas, Brian C., Burstein, David, Tringe, Susannah G., Williams, Kenneth H., Banfield, Jillian F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4381525/
https://www.ncbi.nlm.nih.gov/pubmed/25665577
http://dx.doi.org/10.1101/gr.183012.114
_version_ 1782364469279588352
author Sharon, Itai
Kertesz, Michael
Hug, Laura A.
Pushkarev, Dmitry
Blauwkamp, Timothy A.
Castelle, Cindy J.
Amirebrahimi, Mojgan
Thomas, Brian C.
Burstein, David
Tringe, Susannah G.
Williams, Kenneth H.
Banfield, Jillian F.
author_facet Sharon, Itai
Kertesz, Michael
Hug, Laura A.
Pushkarev, Dmitry
Blauwkamp, Timothy A.
Castelle, Cindy J.
Amirebrahimi, Mojgan
Thomas, Brian C.
Burstein, David
Tringe, Susannah G.
Williams, Kenneth H.
Banfield, Jillian F.
author_sort Sharon, Itai
collection PubMed
description Accurate evaluation of microbial communities is essential for understanding global biogeochemical processes and can guide bioremediation and medical treatments. Metagenomics is most commonly used to analyze microbial diversity and metabolic potential, but assemblies of the short reads generated by current sequencing platforms may fail to recover heterogeneous strain populations and rare organisms. Here we used short (150-bp) and long (multi-kb) synthetic reads to evaluate strain heterogeneity and study microorganisms at low abundance in complex microbial communities from terrestrial sediments. The long-read data revealed multiple (probably dozens of) closely related species and strains from previously undescribed Deltaproteobacteria and Aminicenantes (candidate phylum OP8). Notably, these are the most abundant organisms in the communities, yet short-read assemblies achieved only partial genome coverage, mostly in the form of short scaffolds (N50 = ∼2200 bp). Genome architecture and metabolic potential for these lineages were reconstructed using a new synteny-based method. Analysis of long-read data also revealed thousands of species whose abundances were <0.1% in all samples. Most of the organisms in this “long tail” of rare organisms belong to phyla that are also represented by abundant organisms. Genes encoding glycosyl hydrolases are significantly more abundant than expected in rare genomes, suggesting that rare species may augment the capability for carbon turnover and confer resilience to changing environmental conditions. Overall, the study showed that a diversity of closely related strains and rare organisms account for a major portion of the communities. These are probably common features of many microbial communities and can be effectively studied using a combination of long and short reads.
format Online
Article
Text
id pubmed-4381525
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-43815252015-10-01 Accurate, multi-kb reads resolve complex populations and detect rare microorganisms Sharon, Itai Kertesz, Michael Hug, Laura A. Pushkarev, Dmitry Blauwkamp, Timothy A. Castelle, Cindy J. Amirebrahimi, Mojgan Thomas, Brian C. Burstein, David Tringe, Susannah G. Williams, Kenneth H. Banfield, Jillian F. Genome Res Research Accurate evaluation of microbial communities is essential for understanding global biogeochemical processes and can guide bioremediation and medical treatments. Metagenomics is most commonly used to analyze microbial diversity and metabolic potential, but assemblies of the short reads generated by current sequencing platforms may fail to recover heterogeneous strain populations and rare organisms. Here we used short (150-bp) and long (multi-kb) synthetic reads to evaluate strain heterogeneity and study microorganisms at low abundance in complex microbial communities from terrestrial sediments. The long-read data revealed multiple (probably dozens of) closely related species and strains from previously undescribed Deltaproteobacteria and Aminicenantes (candidate phylum OP8). Notably, these are the most abundant organisms in the communities, yet short-read assemblies achieved only partial genome coverage, mostly in the form of short scaffolds (N50 = ∼2200 bp). Genome architecture and metabolic potential for these lineages were reconstructed using a new synteny-based method. Analysis of long-read data also revealed thousands of species whose abundances were <0.1% in all samples. Most of the organisms in this “long tail” of rare organisms belong to phyla that are also represented by abundant organisms. Genes encoding glycosyl hydrolases are significantly more abundant than expected in rare genomes, suggesting that rare species may augment the capability for carbon turnover and confer resilience to changing environmental conditions. Overall, the study showed that a diversity of closely related strains and rare organisms account for a major portion of the communities. These are probably common features of many microbial communities and can be effectively studied using a combination of long and short reads. Cold Spring Harbor Laboratory Press 2015-04 /pmc/articles/PMC4381525/ /pubmed/25665577 http://dx.doi.org/10.1101/gr.183012.114 Text en © 2015 Sharon et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Research
Sharon, Itai
Kertesz, Michael
Hug, Laura A.
Pushkarev, Dmitry
Blauwkamp, Timothy A.
Castelle, Cindy J.
Amirebrahimi, Mojgan
Thomas, Brian C.
Burstein, David
Tringe, Susannah G.
Williams, Kenneth H.
Banfield, Jillian F.
Accurate, multi-kb reads resolve complex populations and detect rare microorganisms
title Accurate, multi-kb reads resolve complex populations and detect rare microorganisms
title_full Accurate, multi-kb reads resolve complex populations and detect rare microorganisms
title_fullStr Accurate, multi-kb reads resolve complex populations and detect rare microorganisms
title_full_unstemmed Accurate, multi-kb reads resolve complex populations and detect rare microorganisms
title_short Accurate, multi-kb reads resolve complex populations and detect rare microorganisms
title_sort accurate, multi-kb reads resolve complex populations and detect rare microorganisms
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4381525/
https://www.ncbi.nlm.nih.gov/pubmed/25665577
http://dx.doi.org/10.1101/gr.183012.114
work_keys_str_mv AT sharonitai accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms
AT kerteszmichael accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms
AT huglauraa accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms
AT pushkarevdmitry accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms
AT blauwkamptimothya accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms
AT castellecindyj accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms
AT amirebrahimimojgan accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms
AT thomasbrianc accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms
AT bursteindavid accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms
AT tringesusannahg accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms
AT williamskennethh accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms
AT banfieldjillianf accuratemultikbreadsresolvecomplexpopulationsanddetectraremicroorganisms