Cargando…
Reduced metagenome sequencing for strain-resolution taxonomic profiles
BACKGROUND: Studies of shifts in microbial community composition has many applications. For studies at species or subspecies levels, the 16S amplicon sequencing lacks resolution and is often replaced by full shotgun sequencing. Due to higher costs, this restricts the number of samples sequenced. As...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008692/ https://www.ncbi.nlm.nih.gov/pubmed/33781324 http://dx.doi.org/10.1186/s40168-021-01019-8 |
_version_ | 1783672737672200192 |
---|---|
author | Snipen, Lars Angell, Inga-Leena Rognes, Torbjørn Rudi, Knut |
author_facet | Snipen, Lars Angell, Inga-Leena Rognes, Torbjørn Rudi, Knut |
author_sort | Snipen, Lars |
collection | PubMed |
description | BACKGROUND: Studies of shifts in microbial community composition has many applications. For studies at species or subspecies levels, the 16S amplicon sequencing lacks resolution and is often replaced by full shotgun sequencing. Due to higher costs, this restricts the number of samples sequenced. As an alternative to a full shotgun sequencing we have investigated the use of Reduced Metagenome Sequencing (RMS) to estimate the composition of a microbial community. This involves the use of double-digested restriction-associated DNA sequencing, which means only a smaller fraction of the genomes are sequenced. The read sets obtained by this approach have properties different from both amplicon and shotgun data, and analysis pipelines for both can either not be used at all or not explore the full potential of RMS data. RESULTS: We suggest a procedure for analyzing such data, based on fragment clustering and the use of a constrained ordinary least square de-convolution for estimating the relative abundance of all community members. Mock community datasets show the potential to clearly separate strains even when the 16S is 100% identical, and genome-wide differences is < 0.02, indicating RMS has a very high resolution. From a simulation study, we compare RMS to shotgun sequencing and show that we get improved abundance estimates when the community has many very closely related genomes. From a real dataset of infant guts, we show that RMS is capable of detecting a strain diversity gradient for Escherichia coli across time. CONCLUSION: We find that RMS is a good alternative to either metabarcoding or shotgun sequencing when it comes to resolving microbial communities at the strain level. Like shotgun metagenomics, it requires a good database of reference genomes and is well suited for studies of the human gut or other communities where many reference genomes exist. A data analysis pipeline is offered, as an R package at https://github.com/larssnip/microRMS. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01019-8. |
format | Online Article Text |
id | pubmed-8008692 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-80086922021-03-31 Reduced metagenome sequencing for strain-resolution taxonomic profiles Snipen, Lars Angell, Inga-Leena Rognes, Torbjørn Rudi, Knut Microbiome Methodology BACKGROUND: Studies of shifts in microbial community composition has many applications. For studies at species or subspecies levels, the 16S amplicon sequencing lacks resolution and is often replaced by full shotgun sequencing. Due to higher costs, this restricts the number of samples sequenced. As an alternative to a full shotgun sequencing we have investigated the use of Reduced Metagenome Sequencing (RMS) to estimate the composition of a microbial community. This involves the use of double-digested restriction-associated DNA sequencing, which means only a smaller fraction of the genomes are sequenced. The read sets obtained by this approach have properties different from both amplicon and shotgun data, and analysis pipelines for both can either not be used at all or not explore the full potential of RMS data. RESULTS: We suggest a procedure for analyzing such data, based on fragment clustering and the use of a constrained ordinary least square de-convolution for estimating the relative abundance of all community members. Mock community datasets show the potential to clearly separate strains even when the 16S is 100% identical, and genome-wide differences is < 0.02, indicating RMS has a very high resolution. From a simulation study, we compare RMS to shotgun sequencing and show that we get improved abundance estimates when the community has many very closely related genomes. From a real dataset of infant guts, we show that RMS is capable of detecting a strain diversity gradient for Escherichia coli across time. CONCLUSION: We find that RMS is a good alternative to either metabarcoding or shotgun sequencing when it comes to resolving microbial communities at the strain level. Like shotgun metagenomics, it requires a good database of reference genomes and is well suited for studies of the human gut or other communities where many reference genomes exist. A data analysis pipeline is offered, as an R package at https://github.com/larssnip/microRMS. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01019-8. BioMed Central 2021-03-29 /pmc/articles/PMC8008692/ /pubmed/33781324 http://dx.doi.org/10.1186/s40168-021-01019-8 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Snipen, Lars Angell, Inga-Leena Rognes, Torbjørn Rudi, Knut Reduced metagenome sequencing for strain-resolution taxonomic profiles |
title | Reduced metagenome sequencing for strain-resolution taxonomic profiles |
title_full | Reduced metagenome sequencing for strain-resolution taxonomic profiles |
title_fullStr | Reduced metagenome sequencing for strain-resolution taxonomic profiles |
title_full_unstemmed | Reduced metagenome sequencing for strain-resolution taxonomic profiles |
title_short | Reduced metagenome sequencing for strain-resolution taxonomic profiles |
title_sort | reduced metagenome sequencing for strain-resolution taxonomic profiles |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008692/ https://www.ncbi.nlm.nih.gov/pubmed/33781324 http://dx.doi.org/10.1186/s40168-021-01019-8 |
work_keys_str_mv | AT snipenlars reducedmetagenomesequencingforstrainresolutiontaxonomicprofiles AT angellingaleena reducedmetagenomesequencingforstrainresolutiontaxonomicprofiles AT rognestorbjørn reducedmetagenomesequencingforstrainresolutiontaxonomicprofiles AT rudiknut reducedmetagenomesequencingforstrainresolutiontaxonomicprofiles |