Cargando…
Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model
BACKGROUND: Human microbiome studies in clinical settings generally focus on distinguishing the microbiota in health from that in disease at a specific point in time. However, microbiome samples may be associated with disease severity or continuous clinical health indicators that are often assessed...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6003033/ https://www.ncbi.nlm.nih.gov/pubmed/29983746 http://dx.doi.org/10.1186/s13040-018-0173-9 |
_version_ | 1783332293091262464 |
---|---|
author | Tipton, Laura Cuenco, Karen T. Huang, Laurence Greenblatt, Ruth M. Kleerup, Eric Sciurba, Frank Duncan, Steven R. Donahoe, Michael P. Morris, Alison Ghedin, Elodie |
author_facet | Tipton, Laura Cuenco, Karen T. Huang, Laurence Greenblatt, Ruth M. Kleerup, Eric Sciurba, Frank Duncan, Steven R. Donahoe, Michael P. Morris, Alison Ghedin, Elodie |
author_sort | Tipton, Laura |
collection | PubMed |
description | BACKGROUND: Human microbiome studies in clinical settings generally focus on distinguishing the microbiota in health from that in disease at a specific point in time. However, microbiome samples may be associated with disease severity or continuous clinical health indicators that are often assessed at multiple time points. While the temporal data from clinical and microbiome samples may be informative, analysis of this type of data can be problematic for standard statistical methods. RESULTS: To identify associations between microbiota and continuous clinical variables measured repeatedly in two studies of the respiratory tract, we adapted a statistical method, the lasso-penalized generalized linear mixed model (LassoGLMM). LassoGLMM can screen for associated clinical variables, incorporate repeated measures of individuals, and address the large number of species found in the microbiome. As is common in microbiome studies, when the number of variables is an order of magnitude larger than the number of samples LassoGLMM can be imperfect in its variable selection. We overcome this limitation by adding a pre-screening step to reduce the number of variables evaluated in the model. We assessed the use of this adapted two-stage LassoGLMM for its ability to determine which microbes are associated with continuous repeated clinical measures. We found associations (retaining a non-zero coefficient in the LassoGLMM) between 10 laboratory measurements and 43 bacterial genera in the oral microbiota, and between 2 cytokines and 3 bacterial genera in the lung. We compared our associations with those identified by the Wilcoxon test after dichotomizing our outcomes and identified a non-significant trend towards differential abundance between high and low outcomes. Our two-step LassoGLMM explained more of the variance seen in the outcome of interest than other variants of the LassoGLMM method. CONCLUSIONS: We demonstrated a method that can account for the large number of genera detected in microbiome studies and repeated measures of clinical or longitudinal studies, allowing for the detection of strong associations between microbes and clinical measures. By incorporating the design strengths of repeated measurements and a prescreening step to aid variable selection, our two-step LassoGLMM will be a useful analytic method for investigating relationships between microbes and repeatedly measured continuous outcomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13040-018-0173-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6003033 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-60030332018-07-06 Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model Tipton, Laura Cuenco, Karen T. Huang, Laurence Greenblatt, Ruth M. Kleerup, Eric Sciurba, Frank Duncan, Steven R. Donahoe, Michael P. Morris, Alison Ghedin, Elodie BioData Min Research BACKGROUND: Human microbiome studies in clinical settings generally focus on distinguishing the microbiota in health from that in disease at a specific point in time. However, microbiome samples may be associated with disease severity or continuous clinical health indicators that are often assessed at multiple time points. While the temporal data from clinical and microbiome samples may be informative, analysis of this type of data can be problematic for standard statistical methods. RESULTS: To identify associations between microbiota and continuous clinical variables measured repeatedly in two studies of the respiratory tract, we adapted a statistical method, the lasso-penalized generalized linear mixed model (LassoGLMM). LassoGLMM can screen for associated clinical variables, incorporate repeated measures of individuals, and address the large number of species found in the microbiome. As is common in microbiome studies, when the number of variables is an order of magnitude larger than the number of samples LassoGLMM can be imperfect in its variable selection. We overcome this limitation by adding a pre-screening step to reduce the number of variables evaluated in the model. We assessed the use of this adapted two-stage LassoGLMM for its ability to determine which microbes are associated with continuous repeated clinical measures. We found associations (retaining a non-zero coefficient in the LassoGLMM) between 10 laboratory measurements and 43 bacterial genera in the oral microbiota, and between 2 cytokines and 3 bacterial genera in the lung. We compared our associations with those identified by the Wilcoxon test after dichotomizing our outcomes and identified a non-significant trend towards differential abundance between high and low outcomes. Our two-step LassoGLMM explained more of the variance seen in the outcome of interest than other variants of the LassoGLMM method. CONCLUSIONS: We demonstrated a method that can account for the large number of genera detected in microbiome studies and repeated measures of clinical or longitudinal studies, allowing for the detection of strong associations between microbes and clinical measures. By incorporating the design strengths of repeated measurements and a prescreening step to aid variable selection, our two-step LassoGLMM will be a useful analytic method for investigating relationships between microbes and repeatedly measured continuous outcomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13040-018-0173-9) contains supplementary material, which is available to authorized users. BioMed Central 2018-06-15 /pmc/articles/PMC6003033/ /pubmed/29983746 http://dx.doi.org/10.1186/s13040-018-0173-9 Text en © The Author(s). 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Tipton, Laura Cuenco, Karen T. Huang, Laurence Greenblatt, Ruth M. Kleerup, Eric Sciurba, Frank Duncan, Steven R. Donahoe, Michael P. Morris, Alison Ghedin, Elodie Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model |
title | Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model |
title_full | Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model |
title_fullStr | Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model |
title_full_unstemmed | Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model |
title_short | Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model |
title_sort | measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6003033/ https://www.ncbi.nlm.nih.gov/pubmed/29983746 http://dx.doi.org/10.1186/s13040-018-0173-9 |
work_keys_str_mv | AT tiptonlaura measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel AT cuencokarent measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel AT huanglaurence measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel AT greenblattruthm measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel AT kleeruperic measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel AT sciurbafrank measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel AT duncanstevenr measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel AT donahoemichaelp measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel AT morrisalison measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel AT ghedinelodie measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel |