Cargando…

Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes

Soil metagenomics has been touted as the “grand challenge” for metagenomics, as the high microbial diversity and spatial heterogeneity of soils make them unamenable to current assembly platforms. Here, we aimed to improve soil metagenomic sequence assembly by applying the Moleculo synthetic long-rea...

Descripción completa

Detalles Bibliográficos
Autores principales: White, Richard Allen, Bottos, Eric M., Roy Chowdhury, Taniya, Zucker, Jeremy D., Brislawn, Colin J., Nicora, Carrie D., Fansler, Sarah J., Glaesemann, Kurt R., Glass, Kevin, Jansson, Janet K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5069762/
https://www.ncbi.nlm.nih.gov/pubmed/27822530
http://dx.doi.org/10.1128/mSystems.00045-16
_version_ 1782460997377720320
author White, Richard Allen
Bottos, Eric M.
Roy Chowdhury, Taniya
Zucker, Jeremy D.
Brislawn, Colin J.
Nicora, Carrie D.
Fansler, Sarah J.
Glaesemann, Kurt R.
Glass, Kevin
Jansson, Janet K.
author_facet White, Richard Allen
Bottos, Eric M.
Roy Chowdhury, Taniya
Zucker, Jeremy D.
Brislawn, Colin J.
Nicora, Carrie D.
Fansler, Sarah J.
Glaesemann, Kurt R.
Glass, Kevin
Jansson, Janet K.
author_sort White, Richard Allen
collection PubMed
description Soil metagenomics has been touted as the “grand challenge” for metagenomics, as the high microbial diversity and spatial heterogeneity of soils make them unamenable to current assembly platforms. Here, we aimed to improve soil metagenomic sequence assembly by applying the Moleculo synthetic long-read sequencing technology. In total, we obtained 267 Gbp of raw sequence data from a native prairie soil; these data included 109.7 Gbp of short-read data (~100 bp) from the Joint Genome Institute (JGI), an additional 87.7 Gbp of rapid-mode read data (~250 bp), plus 69.6 Gbp (>1.5 kbp) from Moleculo sequencing. The Moleculo data alone yielded over 5,600 reads of >10 kbp in length, and over 95% of the unassembled reads mapped to contigs of >1.5 kbp. Hybrid assembly of all data resulted in more than 10,000 contigs over 10 kbp in length. We mapped three replicate metatranscriptomes derived from the same parent soil to the Moleculo subassembly and found that 95% of the predicted genes, based on their assignments to Enzyme Commission (EC) numbers, were expressed. The Moleculo subassembly also enabled binning of >100 microbial genome bins. We obtained via direct binning the first complete genome, that of “Candidatus Pseudomonas sp. strain JKJ-1” from a native soil metagenome. By mapping metatranscriptome sequence reads back to the bins, we found that several bins corresponding to low-relative-abundance Acidobacteria were highly transcriptionally active, whereas bins corresponding to high-relative-abundance Verrucomicrobia were not. These results demonstrate that Moleculo sequencing provides a significant advance for resolving complex soil microbial communities. IMPORTANCE Soil microorganisms carry out key processes for life on our planet, including cycling of carbon and other nutrients and supporting growth of plants. However, there is poor molecular-level understanding of their functional roles in ecosystem stability and responses to environmental perturbations. This knowledge gap is largely due to the difficulty in culturing the majority of soil microbes. Thus, use of culture-independent approaches, such as metagenomics, promises the direct assessment of the functional potential of soil microbiomes. Soil is, however, a challenge for metagenomic assembly due to its high microbial diversity and variable evenness, resulting in low coverage and uneven sampling of microbial genomes. Despite increasingly large soil metagenome data volumes (>200 Gbp), the majority of the data do not assemble. Here, we used the cutting-edge approach of synthetic long-read sequencing technology (Moleculo) to assemble soil metagenome sequence data into long contigs and used the assemblies for binning of genomes. Author Video: An author video summary of this article is available.
format Online
Article
Text
id pubmed-5069762
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-50697622016-11-07 Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes White, Richard Allen Bottos, Eric M. Roy Chowdhury, Taniya Zucker, Jeremy D. Brislawn, Colin J. Nicora, Carrie D. Fansler, Sarah J. Glaesemann, Kurt R. Glass, Kevin Jansson, Janet K. mSystems Research Article Soil metagenomics has been touted as the “grand challenge” for metagenomics, as the high microbial diversity and spatial heterogeneity of soils make them unamenable to current assembly platforms. Here, we aimed to improve soil metagenomic sequence assembly by applying the Moleculo synthetic long-read sequencing technology. In total, we obtained 267 Gbp of raw sequence data from a native prairie soil; these data included 109.7 Gbp of short-read data (~100 bp) from the Joint Genome Institute (JGI), an additional 87.7 Gbp of rapid-mode read data (~250 bp), plus 69.6 Gbp (>1.5 kbp) from Moleculo sequencing. The Moleculo data alone yielded over 5,600 reads of >10 kbp in length, and over 95% of the unassembled reads mapped to contigs of >1.5 kbp. Hybrid assembly of all data resulted in more than 10,000 contigs over 10 kbp in length. We mapped three replicate metatranscriptomes derived from the same parent soil to the Moleculo subassembly and found that 95% of the predicted genes, based on their assignments to Enzyme Commission (EC) numbers, were expressed. The Moleculo subassembly also enabled binning of >100 microbial genome bins. We obtained via direct binning the first complete genome, that of “Candidatus Pseudomonas sp. strain JKJ-1” from a native soil metagenome. By mapping metatranscriptome sequence reads back to the bins, we found that several bins corresponding to low-relative-abundance Acidobacteria were highly transcriptionally active, whereas bins corresponding to high-relative-abundance Verrucomicrobia were not. These results demonstrate that Moleculo sequencing provides a significant advance for resolving complex soil microbial communities. IMPORTANCE Soil microorganisms carry out key processes for life on our planet, including cycling of carbon and other nutrients and supporting growth of plants. However, there is poor molecular-level understanding of their functional roles in ecosystem stability and responses to environmental perturbations. This knowledge gap is largely due to the difficulty in culturing the majority of soil microbes. Thus, use of culture-independent approaches, such as metagenomics, promises the direct assessment of the functional potential of soil microbiomes. Soil is, however, a challenge for metagenomic assembly due to its high microbial diversity and variable evenness, resulting in low coverage and uneven sampling of microbial genomes. Despite increasingly large soil metagenome data volumes (>200 Gbp), the majority of the data do not assemble. Here, we used the cutting-edge approach of synthetic long-read sequencing technology (Moleculo) to assemble soil metagenome sequence data into long contigs and used the assemblies for binning of genomes. Author Video: An author video summary of this article is available. American Society for Microbiology 2016-06-28 /pmc/articles/PMC5069762/ /pubmed/27822530 http://dx.doi.org/10.1128/mSystems.00045-16 Text en Copyright © 2016 White et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (http://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
White, Richard Allen
Bottos, Eric M.
Roy Chowdhury, Taniya
Zucker, Jeremy D.
Brislawn, Colin J.
Nicora, Carrie D.
Fansler, Sarah J.
Glaesemann, Kurt R.
Glass, Kevin
Jansson, Janet K.
Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes
title Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes
title_full Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes
title_fullStr Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes
title_full_unstemmed Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes
title_short Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes
title_sort moleculo long-read sequencing facilitates assembly and genomic binning from complex soil metagenomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5069762/
https://www.ncbi.nlm.nih.gov/pubmed/27822530
http://dx.doi.org/10.1128/mSystems.00045-16
work_keys_str_mv AT whiterichardallen moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes
AT bottosericm moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes
AT roychowdhurytaniya moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes
AT zuckerjeremyd moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes
AT brislawncolinj moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes
AT nicoracarried moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes
AT fanslersarahj moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes
AT glaesemannkurtr moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes
AT glasskevin moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes
AT janssonjanetk moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes