Cargando…

An application of compositional data analysis to multiomic time-series data

Compositional data analysis (CoDA) methods have increased in popularity as a new framework for analyzing next-generation sequencing (NGS) data. CoDA methods, such as the centered log-ratio (clr) transformation, adjust for the compositional nature of NGS counts, which is not addressed by traditional...

Descripción completa

Detalles Bibliográficos
Autores principales: Sisk-Hackworth, Laura, Kelley, Scott T
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671389/
https://www.ncbi.nlm.nih.gov/pubmed/33575625
http://dx.doi.org/10.1093/nargab/lqaa079
_version_ 1783610920000290816
author Sisk-Hackworth, Laura
Kelley, Scott T
author_facet Sisk-Hackworth, Laura
Kelley, Scott T
author_sort Sisk-Hackworth, Laura
collection PubMed
description Compositional data analysis (CoDA) methods have increased in popularity as a new framework for analyzing next-generation sequencing (NGS) data. CoDA methods, such as the centered log-ratio (clr) transformation, adjust for the compositional nature of NGS counts, which is not addressed by traditional normalization methods. CoDA has only been sparsely applied to NGS data generated from microbial communities or to multiple ‘omics’ datasets. In this study, we applied CoDA methods to analyze NGS and untargeted metabolomic datasets obtained from bacterial and fungal communities. Specifically, we used clr transformation to reanalyze NGS amplicon and metabolomics data from a study investigating the effects of building material type, moisture and time on microbial and metabolomic diversity. Compared to analysis of untransformed data, analysis of clr-transformed data revealed novel relationships and stronger associations between sample conditions and microbial and metabolic community profiles.
format Online
Article
Text
id pubmed-7671389
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-76713892021-02-10 An application of compositional data analysis to multiomic time-series data Sisk-Hackworth, Laura Kelley, Scott T NAR Genom Bioinform Methart Compositional data analysis (CoDA) methods have increased in popularity as a new framework for analyzing next-generation sequencing (NGS) data. CoDA methods, such as the centered log-ratio (clr) transformation, adjust for the compositional nature of NGS counts, which is not addressed by traditional normalization methods. CoDA has only been sparsely applied to NGS data generated from microbial communities or to multiple ‘omics’ datasets. In this study, we applied CoDA methods to analyze NGS and untargeted metabolomic datasets obtained from bacterial and fungal communities. Specifically, we used clr transformation to reanalyze NGS amplicon and metabolomics data from a study investigating the effects of building material type, moisture and time on microbial and metabolomic diversity. Compared to analysis of untransformed data, analysis of clr-transformed data revealed novel relationships and stronger associations between sample conditions and microbial and metabolic community profiles. Oxford University Press 2020-10-02 /pmc/articles/PMC7671389/ /pubmed/33575625 http://dx.doi.org/10.1093/nargab/lqaa079 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methart
Sisk-Hackworth, Laura
Kelley, Scott T
An application of compositional data analysis to multiomic time-series data
title An application of compositional data analysis to multiomic time-series data
title_full An application of compositional data analysis to multiomic time-series data
title_fullStr An application of compositional data analysis to multiomic time-series data
title_full_unstemmed An application of compositional data analysis to multiomic time-series data
title_short An application of compositional data analysis to multiomic time-series data
title_sort application of compositional data analysis to multiomic time-series data
topic Methart
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671389/
https://www.ncbi.nlm.nih.gov/pubmed/33575625
http://dx.doi.org/10.1093/nargab/lqaa079
work_keys_str_mv AT siskhackworthlaura anapplicationofcompositionaldataanalysistomultiomictimeseriesdata
AT kelleyscottt anapplicationofcompositionaldataanalysistomultiomictimeseriesdata
AT siskhackworthlaura applicationofcompositionaldataanalysistomultiomictimeseriesdata
AT kelleyscottt applicationofcompositionaldataanalysistomultiomictimeseriesdata