Cargando…
A priori estimation of sequencing effort in complex microbial metatranscriptomes
Metatranscriptome analysis or the analysis of the expression profiles of whole microbial communities has the additional challenge of dealing with a complex system with dozens of different organisms expressing genes simultaneously. An underlying issue for virtually all metatranscriptomic sequencing e...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7713979/ https://www.ncbi.nlm.nih.gov/pubmed/33304545 http://dx.doi.org/10.1002/ece3.6941 |
_version_ | 1783618657375485952 |
---|---|
author | Monleon‐Getino, Toni Frias‐Lopez, Jorge |
author_facet | Monleon‐Getino, Toni Frias‐Lopez, Jorge |
author_sort | Monleon‐Getino, Toni |
collection | PubMed |
description | Metatranscriptome analysis or the analysis of the expression profiles of whole microbial communities has the additional challenge of dealing with a complex system with dozens of different organisms expressing genes simultaneously. An underlying issue for virtually all metatranscriptomic sequencing experiments is how to allocate the limited sequencing budget while guaranteeing that the libraries have sufficient depth to cover the breadth of expression of the community. Estimating the required sequencing depth to effectively sample the target metatranscriptome using RNA‐seq is an essential first step to obtain robust results in subsequent analysis and to avoid overexpansion, once the information contained in the library reaches saturation. Here, we present a method to calculate the sequencing effort using a simulated series of metatranscriptomic/metagenomic matrices. This method is based on an extrapolation rarefaction curve using a Weibull growth model to estimate the maximum number of observed genes as a function of sequencing depth. This approach allowed us to compute the effort at different confidence intervals and to obtain an approximate a priori effort based on an initial fraction of sequences. The analytical pipeline presented here may be successfully used for the in‐depth and time‐effective characterization of complex microbial communities, representing a useful tool for the microbiome research community. |
format | Online Article Text |
id | pubmed-7713979 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-77139792020-12-09 A priori estimation of sequencing effort in complex microbial metatranscriptomes Monleon‐Getino, Toni Frias‐Lopez, Jorge Ecol Evol Original Research Metatranscriptome analysis or the analysis of the expression profiles of whole microbial communities has the additional challenge of dealing with a complex system with dozens of different organisms expressing genes simultaneously. An underlying issue for virtually all metatranscriptomic sequencing experiments is how to allocate the limited sequencing budget while guaranteeing that the libraries have sufficient depth to cover the breadth of expression of the community. Estimating the required sequencing depth to effectively sample the target metatranscriptome using RNA‐seq is an essential first step to obtain robust results in subsequent analysis and to avoid overexpansion, once the information contained in the library reaches saturation. Here, we present a method to calculate the sequencing effort using a simulated series of metatranscriptomic/metagenomic matrices. This method is based on an extrapolation rarefaction curve using a Weibull growth model to estimate the maximum number of observed genes as a function of sequencing depth. This approach allowed us to compute the effort at different confidence intervals and to obtain an approximate a priori effort based on an initial fraction of sequences. The analytical pipeline presented here may be successfully used for the in‐depth and time‐effective characterization of complex microbial communities, representing a useful tool for the microbiome research community. John Wiley and Sons Inc. 2020-11-05 /pmc/articles/PMC7713979/ /pubmed/33304545 http://dx.doi.org/10.1002/ece3.6941 Text en © 2020 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Research Monleon‐Getino, Toni Frias‐Lopez, Jorge A priori estimation of sequencing effort in complex microbial metatranscriptomes |
title | A priori estimation of sequencing effort in complex microbial metatranscriptomes |
title_full | A priori estimation of sequencing effort in complex microbial metatranscriptomes |
title_fullStr | A priori estimation of sequencing effort in complex microbial metatranscriptomes |
title_full_unstemmed | A priori estimation of sequencing effort in complex microbial metatranscriptomes |
title_short | A priori estimation of sequencing effort in complex microbial metatranscriptomes |
title_sort | priori estimation of sequencing effort in complex microbial metatranscriptomes |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7713979/ https://www.ncbi.nlm.nih.gov/pubmed/33304545 http://dx.doi.org/10.1002/ece3.6941 |
work_keys_str_mv | AT monleongetinotoni aprioriestimationofsequencingeffortincomplexmicrobialmetatranscriptomes AT friaslopezjorge aprioriestimationofsequencingeffortincomplexmicrobialmetatranscriptomes AT monleongetinotoni prioriestimationofsequencingeffortincomplexmicrobialmetatranscriptomes AT friaslopezjorge prioriestimationofsequencingeffortincomplexmicrobialmetatranscriptomes |