Cargando…

Multivariable association discovery in population-scale meta-omics studies

It is challenging to associate features such as human health outcomes, diet, environmental conditions, or other metadata to microbial community measurements, due in part to their quantitative properties. Microbiome multi-omics are typically noisy, sparse (zero-inflated), high-dimensional, extremely...

Descripción completa

Detalles Bibliográficos
Autores principales: Mallick, Himel, Rahnavard, Ali, McIver, Lauren J., Ma, Siyuan, Zhang, Yancong, Nguyen, Long H., Tickle, Timothy L., Weingart, George, Ren, Boyu, Schwager, Emma H., Chatterjee, Suvo, Thompson, Kelsey N., Wilkinson, Jeremy E., Subramanian, Ayshwarya, Lu, Yiren, Waldron, Levi, Paulson, Joseph N., Franzosa, Eric A., Bravo, Hector Corrada, Huttenhower, Curtis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8714082/
https://www.ncbi.nlm.nih.gov/pubmed/34784344
http://dx.doi.org/10.1371/journal.pcbi.1009442
_version_ 1784623842833989632
author Mallick, Himel
Rahnavard, Ali
McIver, Lauren J.
Ma, Siyuan
Zhang, Yancong
Nguyen, Long H.
Tickle, Timothy L.
Weingart, George
Ren, Boyu
Schwager, Emma H.
Chatterjee, Suvo
Thompson, Kelsey N.
Wilkinson, Jeremy E.
Subramanian, Ayshwarya
Lu, Yiren
Waldron, Levi
Paulson, Joseph N.
Franzosa, Eric A.
Bravo, Hector Corrada
Huttenhower, Curtis
author_facet Mallick, Himel
Rahnavard, Ali
McIver, Lauren J.
Ma, Siyuan
Zhang, Yancong
Nguyen, Long H.
Tickle, Timothy L.
Weingart, George
Ren, Boyu
Schwager, Emma H.
Chatterjee, Suvo
Thompson, Kelsey N.
Wilkinson, Jeremy E.
Subramanian, Ayshwarya
Lu, Yiren
Waldron, Levi
Paulson, Joseph N.
Franzosa, Eric A.
Bravo, Hector Corrada
Huttenhower, Curtis
author_sort Mallick, Himel
collection PubMed
description It is challenging to associate features such as human health outcomes, diet, environmental conditions, or other metadata to microbial community measurements, due in part to their quantitative properties. Microbiome multi-omics are typically noisy, sparse (zero-inflated), high-dimensional, extremely non-normal, and often in the form of count or compositional measurements. Here we introduce an optimized combination of novel and established methodology to assess multivariable association of microbial community features with complex metadata in population-scale observational studies. Our approach, MaAsLin 2 (Microbiome Multivariable Associations with Linear Models), uses generalized linear and mixed models to accommodate a wide variety of modern epidemiological studies, including cross-sectional and longitudinal designs, as well as a variety of data types (e.g., counts and relative abundances) with or without covariates and repeated measurements. To construct this method, we conducted a large-scale evaluation of a broad range of scenarios under which straightforward identification of meta-omics associations can be challenging. These simulation studies reveal that MaAsLin 2’s linear model preserves statistical power in the presence of repeated measures and multiple covariates, while accounting for the nuances of meta-omics features and controlling false discovery. We also applied MaAsLin 2 to a microbial multi-omics dataset from the Integrative Human Microbiome (HMP2) project which, in addition to reproducing established results, revealed a unique, integrated landscape of inflammatory bowel diseases (IBD) across multiple time points and omics profiles.
format Online
Article
Text
id pubmed-8714082
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87140822021-12-29 Multivariable association discovery in population-scale meta-omics studies Mallick, Himel Rahnavard, Ali McIver, Lauren J. Ma, Siyuan Zhang, Yancong Nguyen, Long H. Tickle, Timothy L. Weingart, George Ren, Boyu Schwager, Emma H. Chatterjee, Suvo Thompson, Kelsey N. Wilkinson, Jeremy E. Subramanian, Ayshwarya Lu, Yiren Waldron, Levi Paulson, Joseph N. Franzosa, Eric A. Bravo, Hector Corrada Huttenhower, Curtis PLoS Comput Biol Research Article It is challenging to associate features such as human health outcomes, diet, environmental conditions, or other metadata to microbial community measurements, due in part to their quantitative properties. Microbiome multi-omics are typically noisy, sparse (zero-inflated), high-dimensional, extremely non-normal, and often in the form of count or compositional measurements. Here we introduce an optimized combination of novel and established methodology to assess multivariable association of microbial community features with complex metadata in population-scale observational studies. Our approach, MaAsLin 2 (Microbiome Multivariable Associations with Linear Models), uses generalized linear and mixed models to accommodate a wide variety of modern epidemiological studies, including cross-sectional and longitudinal designs, as well as a variety of data types (e.g., counts and relative abundances) with or without covariates and repeated measurements. To construct this method, we conducted a large-scale evaluation of a broad range of scenarios under which straightforward identification of meta-omics associations can be challenging. These simulation studies reveal that MaAsLin 2’s linear model preserves statistical power in the presence of repeated measures and multiple covariates, while accounting for the nuances of meta-omics features and controlling false discovery. We also applied MaAsLin 2 to a microbial multi-omics dataset from the Integrative Human Microbiome (HMP2) project which, in addition to reproducing established results, revealed a unique, integrated landscape of inflammatory bowel diseases (IBD) across multiple time points and omics profiles. Public Library of Science 2021-11-16 /pmc/articles/PMC8714082/ /pubmed/34784344 http://dx.doi.org/10.1371/journal.pcbi.1009442 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Mallick, Himel
Rahnavard, Ali
McIver, Lauren J.
Ma, Siyuan
Zhang, Yancong
Nguyen, Long H.
Tickle, Timothy L.
Weingart, George
Ren, Boyu
Schwager, Emma H.
Chatterjee, Suvo
Thompson, Kelsey N.
Wilkinson, Jeremy E.
Subramanian, Ayshwarya
Lu, Yiren
Waldron, Levi
Paulson, Joseph N.
Franzosa, Eric A.
Bravo, Hector Corrada
Huttenhower, Curtis
Multivariable association discovery in population-scale meta-omics studies
title Multivariable association discovery in population-scale meta-omics studies
title_full Multivariable association discovery in population-scale meta-omics studies
title_fullStr Multivariable association discovery in population-scale meta-omics studies
title_full_unstemmed Multivariable association discovery in population-scale meta-omics studies
title_short Multivariable association discovery in population-scale meta-omics studies
title_sort multivariable association discovery in population-scale meta-omics studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8714082/
https://www.ncbi.nlm.nih.gov/pubmed/34784344
http://dx.doi.org/10.1371/journal.pcbi.1009442
work_keys_str_mv AT mallickhimel multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT rahnavardali multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT mciverlaurenj multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT masiyuan multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT zhangyancong multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT nguyenlongh multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT tickletimothyl multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT weingartgeorge multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT renboyu multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT schwageremmah multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT chatterjeesuvo multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT thompsonkelseyn multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT wilkinsonjeremye multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT subramanianayshwarya multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT luyiren multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT waldronlevi multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT paulsonjosephn multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT franzosaerica multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT bravohectorcorrada multivariableassociationdiscoveryinpopulationscalemetaomicsstudies
AT huttenhowercurtis multivariableassociationdiscoveryinpopulationscalemetaomicsstudies