Cargando…

Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model

BACKGROUND: Human microbiome studies in clinical settings generally focus on distinguishing the microbiota in health from that in disease at a specific point in time. However, microbiome samples may be associated with disease severity or continuous clinical health indicators that are often assessed...

Descripción completa

Detalles Bibliográficos
Autores principales: Tipton, Laura, Cuenco, Karen T., Huang, Laurence, Greenblatt, Ruth M., Kleerup, Eric, Sciurba, Frank, Duncan, Steven R., Donahoe, Michael P., Morris, Alison, Ghedin, Elodie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6003033/
https://www.ncbi.nlm.nih.gov/pubmed/29983746
http://dx.doi.org/10.1186/s13040-018-0173-9
_version_ 1783332293091262464
author Tipton, Laura
Cuenco, Karen T.
Huang, Laurence
Greenblatt, Ruth M.
Kleerup, Eric
Sciurba, Frank
Duncan, Steven R.
Donahoe, Michael P.
Morris, Alison
Ghedin, Elodie
author_facet Tipton, Laura
Cuenco, Karen T.
Huang, Laurence
Greenblatt, Ruth M.
Kleerup, Eric
Sciurba, Frank
Duncan, Steven R.
Donahoe, Michael P.
Morris, Alison
Ghedin, Elodie
author_sort Tipton, Laura
collection PubMed
description BACKGROUND: Human microbiome studies in clinical settings generally focus on distinguishing the microbiota in health from that in disease at a specific point in time. However, microbiome samples may be associated with disease severity or continuous clinical health indicators that are often assessed at multiple time points. While the temporal data from clinical and microbiome samples may be informative, analysis of this type of data can be problematic for standard statistical methods. RESULTS: To identify associations between microbiota and continuous clinical variables measured repeatedly in two studies of the respiratory tract, we adapted a statistical method, the lasso-penalized generalized linear mixed model (LassoGLMM). LassoGLMM can screen for associated clinical variables, incorporate repeated measures of individuals, and address the large number of species found in the microbiome. As is common in microbiome studies, when the number of variables is an order of magnitude larger than the number of samples LassoGLMM can be imperfect in its variable selection. We overcome this limitation by adding a pre-screening step to reduce the number of variables evaluated in the model. We assessed the use of this adapted two-stage LassoGLMM for its ability to determine which microbes are associated with continuous repeated clinical measures. We found associations (retaining a non-zero coefficient in the LassoGLMM) between 10 laboratory measurements and 43 bacterial genera in the oral microbiota, and between 2 cytokines and 3 bacterial genera in the lung. We compared our associations with those identified by the Wilcoxon test after dichotomizing our outcomes and identified a non-significant trend towards differential abundance between high and low outcomes. Our two-step LassoGLMM explained more of the variance seen in the outcome of interest than other variants of the LassoGLMM method. CONCLUSIONS: We demonstrated a method that can account for the large number of genera detected in microbiome studies and repeated measures of clinical or longitudinal studies, allowing for the detection of strong associations between microbes and clinical measures. By incorporating the design strengths of repeated measurements and a prescreening step to aid variable selection, our two-step LassoGLMM will be a useful analytic method for investigating relationships between microbes and repeatedly measured continuous outcomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13040-018-0173-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6003033
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60030332018-07-06 Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model Tipton, Laura Cuenco, Karen T. Huang, Laurence Greenblatt, Ruth M. Kleerup, Eric Sciurba, Frank Duncan, Steven R. Donahoe, Michael P. Morris, Alison Ghedin, Elodie BioData Min Research BACKGROUND: Human microbiome studies in clinical settings generally focus on distinguishing the microbiota in health from that in disease at a specific point in time. However, microbiome samples may be associated with disease severity or continuous clinical health indicators that are often assessed at multiple time points. While the temporal data from clinical and microbiome samples may be informative, analysis of this type of data can be problematic for standard statistical methods. RESULTS: To identify associations between microbiota and continuous clinical variables measured repeatedly in two studies of the respiratory tract, we adapted a statistical method, the lasso-penalized generalized linear mixed model (LassoGLMM). LassoGLMM can screen for associated clinical variables, incorporate repeated measures of individuals, and address the large number of species found in the microbiome. As is common in microbiome studies, when the number of variables is an order of magnitude larger than the number of samples LassoGLMM can be imperfect in its variable selection. We overcome this limitation by adding a pre-screening step to reduce the number of variables evaluated in the model. We assessed the use of this adapted two-stage LassoGLMM for its ability to determine which microbes are associated with continuous repeated clinical measures. We found associations (retaining a non-zero coefficient in the LassoGLMM) between 10 laboratory measurements and 43 bacterial genera in the oral microbiota, and between 2 cytokines and 3 bacterial genera in the lung. We compared our associations with those identified by the Wilcoxon test after dichotomizing our outcomes and identified a non-significant trend towards differential abundance between high and low outcomes. Our two-step LassoGLMM explained more of the variance seen in the outcome of interest than other variants of the LassoGLMM method. CONCLUSIONS: We demonstrated a method that can account for the large number of genera detected in microbiome studies and repeated measures of clinical or longitudinal studies, allowing for the detection of strong associations between microbes and clinical measures. By incorporating the design strengths of repeated measurements and a prescreening step to aid variable selection, our two-step LassoGLMM will be a useful analytic method for investigating relationships between microbes and repeatedly measured continuous outcomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13040-018-0173-9) contains supplementary material, which is available to authorized users. BioMed Central 2018-06-15 /pmc/articles/PMC6003033/ /pubmed/29983746 http://dx.doi.org/10.1186/s13040-018-0173-9 Text en © The Author(s). 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Tipton, Laura
Cuenco, Karen T.
Huang, Laurence
Greenblatt, Ruth M.
Kleerup, Eric
Sciurba, Frank
Duncan, Steven R.
Donahoe, Michael P.
Morris, Alison
Ghedin, Elodie
Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model
title Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model
title_full Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model
title_fullStr Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model
title_full_unstemmed Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model
title_short Measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model
title_sort measuring associations between the microbiota and repeated measures of continuous clinical variables using a lasso-penalized generalized linear mixed model
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6003033/
https://www.ncbi.nlm.nih.gov/pubmed/29983746
http://dx.doi.org/10.1186/s13040-018-0173-9
work_keys_str_mv AT tiptonlaura measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel
AT cuencokarent measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel
AT huanglaurence measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel
AT greenblattruthm measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel
AT kleeruperic measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel
AT sciurbafrank measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel
AT duncanstevenr measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel
AT donahoemichaelp measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel
AT morrisalison measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel
AT ghedinelodie measuringassociationsbetweenthemicrobiotaandrepeatedmeasuresofcontinuousclinicalvariablesusingalassopenalizedgeneralizedlinearmixedmodel