Cargando…

A priori estimation of sequencing effort in complex microbial metatranscriptomes

Metatranscriptome analysis or the analysis of the expression profiles of whole microbial communities has the additional challenge of dealing with a complex system with dozens of different organisms expressing genes simultaneously. An underlying issue for virtually all metatranscriptomic sequencing e...

Descripción completa

Detalles Bibliográficos
Autores principales: Monleon‐Getino, Toni, Frias‐Lopez, Jorge
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7713979/
https://www.ncbi.nlm.nih.gov/pubmed/33304545
http://dx.doi.org/10.1002/ece3.6941
_version_ 1783618657375485952
author Monleon‐Getino, Toni
Frias‐Lopez, Jorge
author_facet Monleon‐Getino, Toni
Frias‐Lopez, Jorge
author_sort Monleon‐Getino, Toni
collection PubMed
description Metatranscriptome analysis or the analysis of the expression profiles of whole microbial communities has the additional challenge of dealing with a complex system with dozens of different organisms expressing genes simultaneously. An underlying issue for virtually all metatranscriptomic sequencing experiments is how to allocate the limited sequencing budget while guaranteeing that the libraries have sufficient depth to cover the breadth of expression of the community. Estimating the required sequencing depth to effectively sample the target metatranscriptome using RNA‐seq is an essential first step to obtain robust results in subsequent analysis and to avoid overexpansion, once the information contained in the library reaches saturation. Here, we present a method to calculate the sequencing effort using a simulated series of metatranscriptomic/metagenomic matrices. This method is based on an extrapolation rarefaction curve using a Weibull growth model to estimate the maximum number of observed genes as a function of sequencing depth. This approach allowed us to compute the effort at different confidence intervals and to obtain an approximate a priori effort based on an initial fraction of sequences. The analytical pipeline presented here may be successfully used for the in‐depth and time‐effective characterization of complex microbial communities, representing a useful tool for the microbiome research community.
format Online
Article
Text
id pubmed-7713979
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-77139792020-12-09 A priori estimation of sequencing effort in complex microbial metatranscriptomes Monleon‐Getino, Toni Frias‐Lopez, Jorge Ecol Evol Original Research Metatranscriptome analysis or the analysis of the expression profiles of whole microbial communities has the additional challenge of dealing with a complex system with dozens of different organisms expressing genes simultaneously. An underlying issue for virtually all metatranscriptomic sequencing experiments is how to allocate the limited sequencing budget while guaranteeing that the libraries have sufficient depth to cover the breadth of expression of the community. Estimating the required sequencing depth to effectively sample the target metatranscriptome using RNA‐seq is an essential first step to obtain robust results in subsequent analysis and to avoid overexpansion, once the information contained in the library reaches saturation. Here, we present a method to calculate the sequencing effort using a simulated series of metatranscriptomic/metagenomic matrices. This method is based on an extrapolation rarefaction curve using a Weibull growth model to estimate the maximum number of observed genes as a function of sequencing depth. This approach allowed us to compute the effort at different confidence intervals and to obtain an approximate a priori effort based on an initial fraction of sequences. The analytical pipeline presented here may be successfully used for the in‐depth and time‐effective characterization of complex microbial communities, representing a useful tool for the microbiome research community. John Wiley and Sons Inc. 2020-11-05 /pmc/articles/PMC7713979/ /pubmed/33304545 http://dx.doi.org/10.1002/ece3.6941 Text en © 2020 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Monleon‐Getino, Toni
Frias‐Lopez, Jorge
A priori estimation of sequencing effort in complex microbial metatranscriptomes
title A priori estimation of sequencing effort in complex microbial metatranscriptomes
title_full A priori estimation of sequencing effort in complex microbial metatranscriptomes
title_fullStr A priori estimation of sequencing effort in complex microbial metatranscriptomes
title_full_unstemmed A priori estimation of sequencing effort in complex microbial metatranscriptomes
title_short A priori estimation of sequencing effort in complex microbial metatranscriptomes
title_sort priori estimation of sequencing effort in complex microbial metatranscriptomes
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7713979/
https://www.ncbi.nlm.nih.gov/pubmed/33304545
http://dx.doi.org/10.1002/ece3.6941
work_keys_str_mv AT monleongetinotoni aprioriestimationofsequencingeffortincomplexmicrobialmetatranscriptomes
AT friaslopezjorge aprioriestimationofsequencingeffortincomplexmicrobialmetatranscriptomes
AT monleongetinotoni prioriestimationofsequencingeffortincomplexmicrobialmetatranscriptomes
AT friaslopezjorge prioriestimationofsequencingeffortincomplexmicrobialmetatranscriptomes