Cargando…
Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes
Soil metagenomics has been touted as the “grand challenge” for metagenomics, as the high microbial diversity and spatial heterogeneity of soils make them unamenable to current assembly platforms. Here, we aimed to improve soil metagenomic sequence assembly by applying the Moleculo synthetic long-rea...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society for Microbiology
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5069762/ https://www.ncbi.nlm.nih.gov/pubmed/27822530 http://dx.doi.org/10.1128/mSystems.00045-16 |
_version_ | 1782460997377720320 |
---|---|
author | White, Richard Allen Bottos, Eric M. Roy Chowdhury, Taniya Zucker, Jeremy D. Brislawn, Colin J. Nicora, Carrie D. Fansler, Sarah J. Glaesemann, Kurt R. Glass, Kevin Jansson, Janet K. |
author_facet | White, Richard Allen Bottos, Eric M. Roy Chowdhury, Taniya Zucker, Jeremy D. Brislawn, Colin J. Nicora, Carrie D. Fansler, Sarah J. Glaesemann, Kurt R. Glass, Kevin Jansson, Janet K. |
author_sort | White, Richard Allen |
collection | PubMed |
description | Soil metagenomics has been touted as the “grand challenge” for metagenomics, as the high microbial diversity and spatial heterogeneity of soils make them unamenable to current assembly platforms. Here, we aimed to improve soil metagenomic sequence assembly by applying the Moleculo synthetic long-read sequencing technology. In total, we obtained 267 Gbp of raw sequence data from a native prairie soil; these data included 109.7 Gbp of short-read data (~100 bp) from the Joint Genome Institute (JGI), an additional 87.7 Gbp of rapid-mode read data (~250 bp), plus 69.6 Gbp (>1.5 kbp) from Moleculo sequencing. The Moleculo data alone yielded over 5,600 reads of >10 kbp in length, and over 95% of the unassembled reads mapped to contigs of >1.5 kbp. Hybrid assembly of all data resulted in more than 10,000 contigs over 10 kbp in length. We mapped three replicate metatranscriptomes derived from the same parent soil to the Moleculo subassembly and found that 95% of the predicted genes, based on their assignments to Enzyme Commission (EC) numbers, were expressed. The Moleculo subassembly also enabled binning of >100 microbial genome bins. We obtained via direct binning the first complete genome, that of “Candidatus Pseudomonas sp. strain JKJ-1” from a native soil metagenome. By mapping metatranscriptome sequence reads back to the bins, we found that several bins corresponding to low-relative-abundance Acidobacteria were highly transcriptionally active, whereas bins corresponding to high-relative-abundance Verrucomicrobia were not. These results demonstrate that Moleculo sequencing provides a significant advance for resolving complex soil microbial communities. IMPORTANCE Soil microorganisms carry out key processes for life on our planet, including cycling of carbon and other nutrients and supporting growth of plants. However, there is poor molecular-level understanding of their functional roles in ecosystem stability and responses to environmental perturbations. This knowledge gap is largely due to the difficulty in culturing the majority of soil microbes. Thus, use of culture-independent approaches, such as metagenomics, promises the direct assessment of the functional potential of soil microbiomes. Soil is, however, a challenge for metagenomic assembly due to its high microbial diversity and variable evenness, resulting in low coverage and uneven sampling of microbial genomes. Despite increasingly large soil metagenome data volumes (>200 Gbp), the majority of the data do not assemble. Here, we used the cutting-edge approach of synthetic long-read sequencing technology (Moleculo) to assemble soil metagenome sequence data into long contigs and used the assemblies for binning of genomes. Author Video: An author video summary of this article is available. |
format | Online Article Text |
id | pubmed-5069762 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | American Society for Microbiology |
record_format | MEDLINE/PubMed |
spelling | pubmed-50697622016-11-07 Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes White, Richard Allen Bottos, Eric M. Roy Chowdhury, Taniya Zucker, Jeremy D. Brislawn, Colin J. Nicora, Carrie D. Fansler, Sarah J. Glaesemann, Kurt R. Glass, Kevin Jansson, Janet K. mSystems Research Article Soil metagenomics has been touted as the “grand challenge” for metagenomics, as the high microbial diversity and spatial heterogeneity of soils make them unamenable to current assembly platforms. Here, we aimed to improve soil metagenomic sequence assembly by applying the Moleculo synthetic long-read sequencing technology. In total, we obtained 267 Gbp of raw sequence data from a native prairie soil; these data included 109.7 Gbp of short-read data (~100 bp) from the Joint Genome Institute (JGI), an additional 87.7 Gbp of rapid-mode read data (~250 bp), plus 69.6 Gbp (>1.5 kbp) from Moleculo sequencing. The Moleculo data alone yielded over 5,600 reads of >10 kbp in length, and over 95% of the unassembled reads mapped to contigs of >1.5 kbp. Hybrid assembly of all data resulted in more than 10,000 contigs over 10 kbp in length. We mapped three replicate metatranscriptomes derived from the same parent soil to the Moleculo subassembly and found that 95% of the predicted genes, based on their assignments to Enzyme Commission (EC) numbers, were expressed. The Moleculo subassembly also enabled binning of >100 microbial genome bins. We obtained via direct binning the first complete genome, that of “Candidatus Pseudomonas sp. strain JKJ-1” from a native soil metagenome. By mapping metatranscriptome sequence reads back to the bins, we found that several bins corresponding to low-relative-abundance Acidobacteria were highly transcriptionally active, whereas bins corresponding to high-relative-abundance Verrucomicrobia were not. These results demonstrate that Moleculo sequencing provides a significant advance for resolving complex soil microbial communities. IMPORTANCE Soil microorganisms carry out key processes for life on our planet, including cycling of carbon and other nutrients and supporting growth of plants. However, there is poor molecular-level understanding of their functional roles in ecosystem stability and responses to environmental perturbations. This knowledge gap is largely due to the difficulty in culturing the majority of soil microbes. Thus, use of culture-independent approaches, such as metagenomics, promises the direct assessment of the functional potential of soil microbiomes. Soil is, however, a challenge for metagenomic assembly due to its high microbial diversity and variable evenness, resulting in low coverage and uneven sampling of microbial genomes. Despite increasingly large soil metagenome data volumes (>200 Gbp), the majority of the data do not assemble. Here, we used the cutting-edge approach of synthetic long-read sequencing technology (Moleculo) to assemble soil metagenome sequence data into long contigs and used the assemblies for binning of genomes. Author Video: An author video summary of this article is available. American Society for Microbiology 2016-06-28 /pmc/articles/PMC5069762/ /pubmed/27822530 http://dx.doi.org/10.1128/mSystems.00045-16 Text en Copyright © 2016 White et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (http://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Article White, Richard Allen Bottos, Eric M. Roy Chowdhury, Taniya Zucker, Jeremy D. Brislawn, Colin J. Nicora, Carrie D. Fansler, Sarah J. Glaesemann, Kurt R. Glass, Kevin Jansson, Janet K. Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes |
title | Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes |
title_full | Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes |
title_fullStr | Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes |
title_full_unstemmed | Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes |
title_short | Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes |
title_sort | moleculo long-read sequencing facilitates assembly and genomic binning from complex soil metagenomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5069762/ https://www.ncbi.nlm.nih.gov/pubmed/27822530 http://dx.doi.org/10.1128/mSystems.00045-16 |
work_keys_str_mv | AT whiterichardallen moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes AT bottosericm moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes AT roychowdhurytaniya moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes AT zuckerjeremyd moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes AT brislawncolinj moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes AT nicoracarried moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes AT fanslersarahj moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes AT glaesemannkurtr moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes AT glasskevin moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes AT janssonjanetk moleculolongreadsequencingfacilitatesassemblyandgenomicbinningfromcomplexsoilmetagenomes |