Cargando…

A mixed-model approach for estimating drivers of microbiota community composition and differential taxonomic abundance

Next-generation sequencing (NGS) and metabarcoding approaches are increasingly applied to wild animal populations, but there is a disconnect between the widely applied generalized linear mixed model (GLMM) approaches commonly used to study phenotypic variation and the statistical toolkit from commun...

Descripción completa

Detalles Bibliográficos
Autores principales: Sweeny, Amy R., Lemon, Hannah, Ibrahim, Anan, Watt, Kathryn A., Wilson, Kenneth, Childs, Dylan Z., Nussey, Daniel H., Free, Andrew, McNally, Luke
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10469806/
https://www.ncbi.nlm.nih.gov/pubmed/37489890
http://dx.doi.org/10.1128/msystems.00040-23
_version_ 1785099527645036544
author Sweeny, Amy R.
Lemon, Hannah
Ibrahim, Anan
Watt, Kathryn A.
Wilson, Kenneth
Childs, Dylan Z.
Nussey, Daniel H.
Free, Andrew
McNally, Luke
author_facet Sweeny, Amy R.
Lemon, Hannah
Ibrahim, Anan
Watt, Kathryn A.
Wilson, Kenneth
Childs, Dylan Z.
Nussey, Daniel H.
Free, Andrew
McNally, Luke
author_sort Sweeny, Amy R.
collection PubMed
description Next-generation sequencing (NGS) and metabarcoding approaches are increasingly applied to wild animal populations, but there is a disconnect between the widely applied generalized linear mixed model (GLMM) approaches commonly used to study phenotypic variation and the statistical toolkit from community ecology typically applied to metabarcoding data. Here, we describe the suitability of a novel GLMM-based approach for analyzing the taxon-specific sequence read counts derived from standard metabarcoding data. This approach allows decomposition of the contribution of different drivers to variation in community composition (e.g., age, season, individual) via interaction terms in the model random-effects structure. We provide guidance to implementing this approach and show how these models can identify how responsible specific taxonomic groups are for the effects attributed to different drivers. We applied this approach to two cross-sectional data sets from the Soay sheep population of St. Kilda. GLMMs showed agreement with dissimilarity-based approaches highlighting the substantial contribution of age and minimal contribution of season to microbiota community compositions, and simultaneously estimated the contribution of other technical and biological factors. We further used model predictions to show that age effects were principally due to increases in taxa of the phylum Bacteroidetes and declines in taxa of the phylum Firmicutes. This approach offers a powerful means for understanding the influence of drivers of community structure derived from metabarcoding data. We discuss how our approach could be readily adapted to allow researchers to estimate contributions of additional factors such as host or microbe phylogeny to answer emerging questions surrounding the ecological and evolutionary roles of within-host communities. IMPORTANCE: NGS and fecal metabarcoding methods have provided powerful opportunities to study the wild gut microbiome. A wealth of data is, therefore, amassing across wild systems, generating the need for analytical approaches that can appropriately investigate simultaneous factors at the host and environmental scale that determine the composition of these communities. Here, we describe a generalized linear mixed-effects model (GLMM) approach to analyze read count data from metabarcoding of the gut microbiota, allowing us to quantify the contributions of multiple host and environmental factors to within-host community structure. Our approach provides outputs that are familiar to a majority of field ecologists and can be run using any standard mixed-effects modeling packages. We illustrate this approach using two metabarcoding data sets from the Soay sheep population of St. Kilda investigating age and season effects as worked examples.
format Online
Article
Text
id pubmed-10469806
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-104698062023-09-01 A mixed-model approach for estimating drivers of microbiota community composition and differential taxonomic abundance Sweeny, Amy R. Lemon, Hannah Ibrahim, Anan Watt, Kathryn A. Wilson, Kenneth Childs, Dylan Z. Nussey, Daniel H. Free, Andrew McNally, Luke mSystems Research Article Next-generation sequencing (NGS) and metabarcoding approaches are increasingly applied to wild animal populations, but there is a disconnect between the widely applied generalized linear mixed model (GLMM) approaches commonly used to study phenotypic variation and the statistical toolkit from community ecology typically applied to metabarcoding data. Here, we describe the suitability of a novel GLMM-based approach for analyzing the taxon-specific sequence read counts derived from standard metabarcoding data. This approach allows decomposition of the contribution of different drivers to variation in community composition (e.g., age, season, individual) via interaction terms in the model random-effects structure. We provide guidance to implementing this approach and show how these models can identify how responsible specific taxonomic groups are for the effects attributed to different drivers. We applied this approach to two cross-sectional data sets from the Soay sheep population of St. Kilda. GLMMs showed agreement with dissimilarity-based approaches highlighting the substantial contribution of age and minimal contribution of season to microbiota community compositions, and simultaneously estimated the contribution of other technical and biological factors. We further used model predictions to show that age effects were principally due to increases in taxa of the phylum Bacteroidetes and declines in taxa of the phylum Firmicutes. This approach offers a powerful means for understanding the influence of drivers of community structure derived from metabarcoding data. We discuss how our approach could be readily adapted to allow researchers to estimate contributions of additional factors such as host or microbe phylogeny to answer emerging questions surrounding the ecological and evolutionary roles of within-host communities. IMPORTANCE: NGS and fecal metabarcoding methods have provided powerful opportunities to study the wild gut microbiome. A wealth of data is, therefore, amassing across wild systems, generating the need for analytical approaches that can appropriately investigate simultaneous factors at the host and environmental scale that determine the composition of these communities. Here, we describe a generalized linear mixed-effects model (GLMM) approach to analyze read count data from metabarcoding of the gut microbiota, allowing us to quantify the contributions of multiple host and environmental factors to within-host community structure. Our approach provides outputs that are familiar to a majority of field ecologists and can be run using any standard mixed-effects modeling packages. We illustrate this approach using two metabarcoding data sets from the Soay sheep population of St. Kilda investigating age and season effects as worked examples. American Society for Microbiology 2023-07-25 /pmc/articles/PMC10469806/ /pubmed/37489890 http://dx.doi.org/10.1128/msystems.00040-23 Text en Copyright © 2023 Sweeny et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Sweeny, Amy R.
Lemon, Hannah
Ibrahim, Anan
Watt, Kathryn A.
Wilson, Kenneth
Childs, Dylan Z.
Nussey, Daniel H.
Free, Andrew
McNally, Luke
A mixed-model approach for estimating drivers of microbiota community composition and differential taxonomic abundance
title A mixed-model approach for estimating drivers of microbiota community composition and differential taxonomic abundance
title_full A mixed-model approach for estimating drivers of microbiota community composition and differential taxonomic abundance
title_fullStr A mixed-model approach for estimating drivers of microbiota community composition and differential taxonomic abundance
title_full_unstemmed A mixed-model approach for estimating drivers of microbiota community composition and differential taxonomic abundance
title_short A mixed-model approach for estimating drivers of microbiota community composition and differential taxonomic abundance
title_sort mixed-model approach for estimating drivers of microbiota community composition and differential taxonomic abundance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10469806/
https://www.ncbi.nlm.nih.gov/pubmed/37489890
http://dx.doi.org/10.1128/msystems.00040-23
work_keys_str_mv AT sweenyamyr amixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT lemonhannah amixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT ibrahimanan amixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT wattkathryna amixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT wilsonkenneth amixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT childsdylanz amixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT nusseydanielh amixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT freeandrew amixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT mcnallyluke amixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT sweenyamyr mixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT lemonhannah mixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT ibrahimanan mixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT wattkathryna mixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT wilsonkenneth mixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT childsdylanz mixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT nusseydanielh mixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT freeandrew mixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance
AT mcnallyluke mixedmodelapproachforestimatingdriversofmicrobiotacommunitycompositionanddifferentialtaxonomicabundance