Cargando…
Statistical approaches for differential expression analysis in metatranscriptomics
MOTIVATION: Metatranscriptomics (MTX) has become an increasingly practical way to profile the functional activity of microbial communities in situ. However, MTX remains underutilized due to experimental and computational limitations. The latter are complicated by non-independent changes in both RNA...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275336/ https://www.ncbi.nlm.nih.gov/pubmed/34252963 http://dx.doi.org/10.1093/bioinformatics/btab327 |
_version_ | 1783721692817784832 |
---|---|
author | Zhang, Yancong Thompson, Kelsey N Huttenhower, Curtis Franzosa, Eric A |
author_facet | Zhang, Yancong Thompson, Kelsey N Huttenhower, Curtis Franzosa, Eric A |
author_sort | Zhang, Yancong |
collection | PubMed |
description | MOTIVATION: Metatranscriptomics (MTX) has become an increasingly practical way to profile the functional activity of microbial communities in situ. However, MTX remains underutilized due to experimental and computational limitations. The latter are complicated by non-independent changes in both RNA transcript levels and their underlying genomic DNA copies (as microbes simultaneously change their overall abundance in the population and regulate individual transcripts), genetic plasticity (as whole loci are frequently gained and lost in microbial lineages) and measurement compositionality and zero-inflation. Here, we present a systematic evaluation of and recommendations for differential expression (DE) analysis in MTX. RESULTS: We designed and assessed six statistical models for DE discovery in MTX that incorporate different combinations of DNA and RNA normalization and assumptions about the underlying changes of gene copies or species abundance within communities. We evaluated these models on multiple simulated and real multi-omic datasets. Models adjusting transcripts relative to their encoding gene copies as a covariate were significantly more accurate in identifying DE from MTX in both simulated and real datasets. Moreover, we show that when paired DNA measurements (metagenomic data) are not available, models normalizing MTX measurements within-species while also adjusting for total-species RNA balance sensitivity, specificity and interpretability of DE detection, as does filtering likely technical zeros. The efficiency and accuracy of these models pave the way for more effective MTX-based DE discovery in microbial communities. AVAILABILITY AND IMPLEMENTATION: The analysis code and synthetic datasets used in this evaluation are available online at http://huttenhower.sph.harvard.edu/mtx2021. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8275336 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-82753362021-07-13 Statistical approaches for differential expression analysis in metatranscriptomics Zhang, Yancong Thompson, Kelsey N Huttenhower, Curtis Franzosa, Eric A Bioinformatics Bioinformatics of Microbes and Microbiomes MOTIVATION: Metatranscriptomics (MTX) has become an increasingly practical way to profile the functional activity of microbial communities in situ. However, MTX remains underutilized due to experimental and computational limitations. The latter are complicated by non-independent changes in both RNA transcript levels and their underlying genomic DNA copies (as microbes simultaneously change their overall abundance in the population and regulate individual transcripts), genetic plasticity (as whole loci are frequently gained and lost in microbial lineages) and measurement compositionality and zero-inflation. Here, we present a systematic evaluation of and recommendations for differential expression (DE) analysis in MTX. RESULTS: We designed and assessed six statistical models for DE discovery in MTX that incorporate different combinations of DNA and RNA normalization and assumptions about the underlying changes of gene copies or species abundance within communities. We evaluated these models on multiple simulated and real multi-omic datasets. Models adjusting transcripts relative to their encoding gene copies as a covariate were significantly more accurate in identifying DE from MTX in both simulated and real datasets. Moreover, we show that when paired DNA measurements (metagenomic data) are not available, models normalizing MTX measurements within-species while also adjusting for total-species RNA balance sensitivity, specificity and interpretability of DE detection, as does filtering likely technical zeros. The efficiency and accuracy of these models pave the way for more effective MTX-based DE discovery in microbial communities. AVAILABILITY AND IMPLEMENTATION: The analysis code and synthetic datasets used in this evaluation are available online at http://huttenhower.sph.harvard.edu/mtx2021. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-07-12 /pmc/articles/PMC8275336/ /pubmed/34252963 http://dx.doi.org/10.1093/bioinformatics/btab327 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Bioinformatics of Microbes and Microbiomes Zhang, Yancong Thompson, Kelsey N Huttenhower, Curtis Franzosa, Eric A Statistical approaches for differential expression analysis in metatranscriptomics |
title | Statistical approaches for differential expression analysis in metatranscriptomics |
title_full | Statistical approaches for differential expression analysis in metatranscriptomics |
title_fullStr | Statistical approaches for differential expression analysis in metatranscriptomics |
title_full_unstemmed | Statistical approaches for differential expression analysis in metatranscriptomics |
title_short | Statistical approaches for differential expression analysis in metatranscriptomics |
title_sort | statistical approaches for differential expression analysis in metatranscriptomics |
topic | Bioinformatics of Microbes and Microbiomes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275336/ https://www.ncbi.nlm.nih.gov/pubmed/34252963 http://dx.doi.org/10.1093/bioinformatics/btab327 |
work_keys_str_mv | AT zhangyancong statisticalapproachesfordifferentialexpressionanalysisinmetatranscriptomics AT thompsonkelseyn statisticalapproachesfordifferentialexpressionanalysisinmetatranscriptomics AT huttenhowercurtis statisticalapproachesfordifferentialexpressionanalysisinmetatranscriptomics AT franzosaerica statisticalapproachesfordifferentialexpressionanalysisinmetatranscriptomics |