Cargando…

Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies

Shotgun metagenomics sequencing is a powerful tool for the characterization of complex biological matrices, enabling analysis of prokaryotic and eukaryotic organisms and viruses in a single experiment, with the possibility of reconstructing de novo the whole metagenome or a set of genes of interest....

Descripción completa

Detalles Bibliográficos
Autores principales: Cattonaro, Federica, Spadotto, Alessandro, Radovic, Slobodanka, Marroni, Fabio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7059852/
https://www.ncbi.nlm.nih.gov/pubmed/32185014
http://dx.doi.org/10.12688/f1000research.16804.4
_version_ 1783504130944270336
author Cattonaro, Federica
Spadotto, Alessandro
Radovic, Slobodanka
Marroni, Fabio
author_facet Cattonaro, Federica
Spadotto, Alessandro
Radovic, Slobodanka
Marroni, Fabio
author_sort Cattonaro, Federica
collection PubMed
description Shotgun metagenomics sequencing is a powerful tool for the characterization of complex biological matrices, enabling analysis of prokaryotic and eukaryotic organisms and viruses in a single experiment, with the possibility of reconstructing de novo the whole metagenome or a set of genes of interest. One of the main factors limiting the use of shotgun metagenomics on wide scale projects is the high cost associated with the approach. We set out to determine if it is possible to use shallow shotgun metagenomics to characterize complex biological matrices while reducing costs. We used a staggered mock community to estimate the optimal threshold for species detection. We measured the variation of several summary statistics simulating a decrease in sequencing depth by randomly subsampling a number of reads. The main statistics that were compared are diversity estimates, species abundance, and ability of reconstructing de novo the metagenome in terms of length and completeness. Our results show that diversity indices of complex prokaryotic, eukaryotic and viral communities can be accurately estimated with 500,000 reads or less, although particularly complex samples may require 1,000,000 reads. On the contrary, any task involving the reconstruction of the metagenome performed poorly, even with the largest simulated subsample (1,000,000 reads). The length of the reconstructed assembly was smaller than the length obtained with the full dataset, and the proportion of conserved genes that were identified in the meta-genome was drastically reduced compared to the full sample. Shallow shotgun metagenomics can be a useful tool to describe the structure of complex matrices, but it is not adequate to reconstruct—even partially—the metagenome.
format Online
Article
Text
id pubmed-7059852
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-70598522020-03-16 Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies Cattonaro, Federica Spadotto, Alessandro Radovic, Slobodanka Marroni, Fabio F1000Res Method Article Shotgun metagenomics sequencing is a powerful tool for the characterization of complex biological matrices, enabling analysis of prokaryotic and eukaryotic organisms and viruses in a single experiment, with the possibility of reconstructing de novo the whole metagenome or a set of genes of interest. One of the main factors limiting the use of shotgun metagenomics on wide scale projects is the high cost associated with the approach. We set out to determine if it is possible to use shallow shotgun metagenomics to characterize complex biological matrices while reducing costs. We used a staggered mock community to estimate the optimal threshold for species detection. We measured the variation of several summary statistics simulating a decrease in sequencing depth by randomly subsampling a number of reads. The main statistics that were compared are diversity estimates, species abundance, and ability of reconstructing de novo the metagenome in terms of length and completeness. Our results show that diversity indices of complex prokaryotic, eukaryotic and viral communities can be accurately estimated with 500,000 reads or less, although particularly complex samples may require 1,000,000 reads. On the contrary, any task involving the reconstruction of the metagenome performed poorly, even with the largest simulated subsample (1,000,000 reads). The length of the reconstructed assembly was smaller than the length obtained with the full dataset, and the proportion of conserved genes that were identified in the meta-genome was drastically reduced compared to the full sample. Shallow shotgun metagenomics can be a useful tool to describe the structure of complex matrices, but it is not adequate to reconstruct—even partially—the metagenome. F1000 Research Limited 2020-01-22 /pmc/articles/PMC7059852/ /pubmed/32185014 http://dx.doi.org/10.12688/f1000research.16804.4 Text en Copyright: © 2020 Cattonaro F et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
Cattonaro, Federica
Spadotto, Alessandro
Radovic, Slobodanka
Marroni, Fabio
Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies
title Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies
title_full Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies
title_fullStr Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies
title_full_unstemmed Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies
title_short Do you cov me? Effect of coverage reduction on metagenome shotgun sequencing studies
title_sort do you cov me? effect of coverage reduction on metagenome shotgun sequencing studies
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7059852/
https://www.ncbi.nlm.nih.gov/pubmed/32185014
http://dx.doi.org/10.12688/f1000research.16804.4
work_keys_str_mv AT cattonarofederica doyoucovmeeffectofcoveragereductiononmetagenomeshotgunsequencingstudies
AT spadottoalessandro doyoucovmeeffectofcoveragereductiononmetagenomeshotgunsequencingstudies
AT radovicslobodanka doyoucovmeeffectofcoveragereductiononmetagenomeshotgunsequencingstudies
AT marronifabio doyoucovmeeffectofcoveragereductiononmetagenomeshotgunsequencingstudies