Cargando…
An application of compositional data analysis to multiomic time-series data
Compositional data analysis (CoDA) methods have increased in popularity as a new framework for analyzing next-generation sequencing (NGS) data. CoDA methods, such as the centered log-ratio (clr) transformation, adjust for the compositional nature of NGS counts, which is not addressed by traditional...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671389/ https://www.ncbi.nlm.nih.gov/pubmed/33575625 http://dx.doi.org/10.1093/nargab/lqaa079 |
_version_ | 1783610920000290816 |
---|---|
author | Sisk-Hackworth, Laura Kelley, Scott T |
author_facet | Sisk-Hackworth, Laura Kelley, Scott T |
author_sort | Sisk-Hackworth, Laura |
collection | PubMed |
description | Compositional data analysis (CoDA) methods have increased in popularity as a new framework for analyzing next-generation sequencing (NGS) data. CoDA methods, such as the centered log-ratio (clr) transformation, adjust for the compositional nature of NGS counts, which is not addressed by traditional normalization methods. CoDA has only been sparsely applied to NGS data generated from microbial communities or to multiple ‘omics’ datasets. In this study, we applied CoDA methods to analyze NGS and untargeted metabolomic datasets obtained from bacterial and fungal communities. Specifically, we used clr transformation to reanalyze NGS amplicon and metabolomics data from a study investigating the effects of building material type, moisture and time on microbial and metabolomic diversity. Compared to analysis of untransformed data, analysis of clr-transformed data revealed novel relationships and stronger associations between sample conditions and microbial and metabolic community profiles. |
format | Online Article Text |
id | pubmed-7671389 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-76713892021-02-10 An application of compositional data analysis to multiomic time-series data Sisk-Hackworth, Laura Kelley, Scott T NAR Genom Bioinform Methart Compositional data analysis (CoDA) methods have increased in popularity as a new framework for analyzing next-generation sequencing (NGS) data. CoDA methods, such as the centered log-ratio (clr) transformation, adjust for the compositional nature of NGS counts, which is not addressed by traditional normalization methods. CoDA has only been sparsely applied to NGS data generated from microbial communities or to multiple ‘omics’ datasets. In this study, we applied CoDA methods to analyze NGS and untargeted metabolomic datasets obtained from bacterial and fungal communities. Specifically, we used clr transformation to reanalyze NGS amplicon and metabolomics data from a study investigating the effects of building material type, moisture and time on microbial and metabolomic diversity. Compared to analysis of untransformed data, analysis of clr-transformed data revealed novel relationships and stronger associations between sample conditions and microbial and metabolic community profiles. Oxford University Press 2020-10-02 /pmc/articles/PMC7671389/ /pubmed/33575625 http://dx.doi.org/10.1093/nargab/lqaa079 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methart Sisk-Hackworth, Laura Kelley, Scott T An application of compositional data analysis to multiomic time-series data |
title | An application of compositional data analysis to multiomic time-series data |
title_full | An application of compositional data analysis to multiomic time-series data |
title_fullStr | An application of compositional data analysis to multiomic time-series data |
title_full_unstemmed | An application of compositional data analysis to multiomic time-series data |
title_short | An application of compositional data analysis to multiomic time-series data |
title_sort | application of compositional data analysis to multiomic time-series data |
topic | Methart |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671389/ https://www.ncbi.nlm.nih.gov/pubmed/33575625 http://dx.doi.org/10.1093/nargab/lqaa079 |
work_keys_str_mv | AT siskhackworthlaura anapplicationofcompositionaldataanalysistomultiomictimeseriesdata AT kelleyscottt anapplicationofcompositionaldataanalysistomultiomictimeseriesdata AT siskhackworthlaura applicationofcompositionaldataanalysistomultiomictimeseriesdata AT kelleyscottt applicationofcompositionaldataanalysistomultiomictimeseriesdata |