Cargando…
Multivariable association discovery in population-scale meta-omics studies
It is challenging to associate features such as human health outcomes, diet, environmental conditions, or other metadata to microbial community measurements, due in part to their quantitative properties. Microbiome multi-omics are typically noisy, sparse (zero-inflated), high-dimensional, extremely...
Autores principales: | , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8714082/ https://www.ncbi.nlm.nih.gov/pubmed/34784344 http://dx.doi.org/10.1371/journal.pcbi.1009442 |
_version_ | 1784623842833989632 |
---|---|
author | Mallick, Himel Rahnavard, Ali McIver, Lauren J. Ma, Siyuan Zhang, Yancong Nguyen, Long H. Tickle, Timothy L. Weingart, George Ren, Boyu Schwager, Emma H. Chatterjee, Suvo Thompson, Kelsey N. Wilkinson, Jeremy E. Subramanian, Ayshwarya Lu, Yiren Waldron, Levi Paulson, Joseph N. Franzosa, Eric A. Bravo, Hector Corrada Huttenhower, Curtis |
author_facet | Mallick, Himel Rahnavard, Ali McIver, Lauren J. Ma, Siyuan Zhang, Yancong Nguyen, Long H. Tickle, Timothy L. Weingart, George Ren, Boyu Schwager, Emma H. Chatterjee, Suvo Thompson, Kelsey N. Wilkinson, Jeremy E. Subramanian, Ayshwarya Lu, Yiren Waldron, Levi Paulson, Joseph N. Franzosa, Eric A. Bravo, Hector Corrada Huttenhower, Curtis |
author_sort | Mallick, Himel |
collection | PubMed |
description | It is challenging to associate features such as human health outcomes, diet, environmental conditions, or other metadata to microbial community measurements, due in part to their quantitative properties. Microbiome multi-omics are typically noisy, sparse (zero-inflated), high-dimensional, extremely non-normal, and often in the form of count or compositional measurements. Here we introduce an optimized combination of novel and established methodology to assess multivariable association of microbial community features with complex metadata in population-scale observational studies. Our approach, MaAsLin 2 (Microbiome Multivariable Associations with Linear Models), uses generalized linear and mixed models to accommodate a wide variety of modern epidemiological studies, including cross-sectional and longitudinal designs, as well as a variety of data types (e.g., counts and relative abundances) with or without covariates and repeated measurements. To construct this method, we conducted a large-scale evaluation of a broad range of scenarios under which straightforward identification of meta-omics associations can be challenging. These simulation studies reveal that MaAsLin 2’s linear model preserves statistical power in the presence of repeated measures and multiple covariates, while accounting for the nuances of meta-omics features and controlling false discovery. We also applied MaAsLin 2 to a microbial multi-omics dataset from the Integrative Human Microbiome (HMP2) project which, in addition to reproducing established results, revealed a unique, integrated landscape of inflammatory bowel diseases (IBD) across multiple time points and omics profiles. |
format | Online Article Text |
id | pubmed-8714082 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-87140822021-12-29 Multivariable association discovery in population-scale meta-omics studies Mallick, Himel Rahnavard, Ali McIver, Lauren J. Ma, Siyuan Zhang, Yancong Nguyen, Long H. Tickle, Timothy L. Weingart, George Ren, Boyu Schwager, Emma H. Chatterjee, Suvo Thompson, Kelsey N. Wilkinson, Jeremy E. Subramanian, Ayshwarya Lu, Yiren Waldron, Levi Paulson, Joseph N. Franzosa, Eric A. Bravo, Hector Corrada Huttenhower, Curtis PLoS Comput Biol Research Article It is challenging to associate features such as human health outcomes, diet, environmental conditions, or other metadata to microbial community measurements, due in part to their quantitative properties. Microbiome multi-omics are typically noisy, sparse (zero-inflated), high-dimensional, extremely non-normal, and often in the form of count or compositional measurements. Here we introduce an optimized combination of novel and established methodology to assess multivariable association of microbial community features with complex metadata in population-scale observational studies. Our approach, MaAsLin 2 (Microbiome Multivariable Associations with Linear Models), uses generalized linear and mixed models to accommodate a wide variety of modern epidemiological studies, including cross-sectional and longitudinal designs, as well as a variety of data types (e.g., counts and relative abundances) with or without covariates and repeated measurements. To construct this method, we conducted a large-scale evaluation of a broad range of scenarios under which straightforward identification of meta-omics associations can be challenging. These simulation studies reveal that MaAsLin 2’s linear model preserves statistical power in the presence of repeated measures and multiple covariates, while accounting for the nuances of meta-omics features and controlling false discovery. We also applied MaAsLin 2 to a microbial multi-omics dataset from the Integrative Human Microbiome (HMP2) project which, in addition to reproducing established results, revealed a unique, integrated landscape of inflammatory bowel diseases (IBD) across multiple time points and omics profiles. Public Library of Science 2021-11-16 /pmc/articles/PMC8714082/ /pubmed/34784344 http://dx.doi.org/10.1371/journal.pcbi.1009442 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Mallick, Himel Rahnavard, Ali McIver, Lauren J. Ma, Siyuan Zhang, Yancong Nguyen, Long H. Tickle, Timothy L. Weingart, George Ren, Boyu Schwager, Emma H. Chatterjee, Suvo Thompson, Kelsey N. Wilkinson, Jeremy E. Subramanian, Ayshwarya Lu, Yiren Waldron, Levi Paulson, Joseph N. Franzosa, Eric A. Bravo, Hector Corrada Huttenhower, Curtis Multivariable association discovery in population-scale meta-omics studies |
title | Multivariable association discovery in population-scale meta-omics studies |
title_full | Multivariable association discovery in population-scale meta-omics studies |
title_fullStr | Multivariable association discovery in population-scale meta-omics studies |
title_full_unstemmed | Multivariable association discovery in population-scale meta-omics studies |
title_short | Multivariable association discovery in population-scale meta-omics studies |
title_sort | multivariable association discovery in population-scale meta-omics studies |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8714082/ https://www.ncbi.nlm.nih.gov/pubmed/34784344 http://dx.doi.org/10.1371/journal.pcbi.1009442 |
work_keys_str_mv | AT mallickhimel multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT rahnavardali multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT mciverlaurenj multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT masiyuan multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT zhangyancong multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT nguyenlongh multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT tickletimothyl multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT weingartgeorge multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT renboyu multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT schwageremmah multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT chatterjeesuvo multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT thompsonkelseyn multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT wilkinsonjeremye multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT subramanianayshwarya multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT luyiren multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT waldronlevi multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT paulsonjosephn multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT franzosaerica multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT bravohectorcorrada multivariableassociationdiscoveryinpopulationscalemetaomicsstudies AT huttenhowercurtis multivariableassociationdiscoveryinpopulationscalemetaomicsstudies |