Cargando…

Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy

Methane (CH(4)) emissions from ruminants are of a significant environmental concern, necessitating accurate prediction for emission inventories. Existing models rely solely on dietary and host animal-related data, ignoring the predicting power of rumen microbiota, the source of CH(4). To address thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Boyang, Lin, Shili, Moraes, Luis, Firkins, Jeffrey, Hristov, Alexander N., Kebreab, Ermias, Janssen, Peter H., Bannink, André, Bayat, Alireza R., Crompton, Les A., Dijkstra, Jan, Eugène, Maguy A., Kreuzer, Michael, McGee, Mark, Reynolds, Christopher K., Schwarm, Angela, Yáñez-Ruiz, David R., Yu, Zhongtang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693554/
https://www.ncbi.nlm.nih.gov/pubmed/38042941
http://dx.doi.org/10.1038/s41598-023-48449-y
_version_ 1785153186694168576
author Zhang, Boyang
Lin, Shili
Moraes, Luis
Firkins, Jeffrey
Hristov, Alexander N.
Kebreab, Ermias
Janssen, Peter H.
Bannink, André
Bayat, Alireza R.
Crompton, Les A.
Dijkstra, Jan
Eugène, Maguy A.
Kreuzer, Michael
McGee, Mark
Reynolds, Christopher K.
Schwarm, Angela
Yáñez-Ruiz, David R.
Yu, Zhongtang
author_facet Zhang, Boyang
Lin, Shili
Moraes, Luis
Firkins, Jeffrey
Hristov, Alexander N.
Kebreab, Ermias
Janssen, Peter H.
Bannink, André
Bayat, Alireza R.
Crompton, Les A.
Dijkstra, Jan
Eugène, Maguy A.
Kreuzer, Michael
McGee, Mark
Reynolds, Christopher K.
Schwarm, Angela
Yáñez-Ruiz, David R.
Yu, Zhongtang
author_sort Zhang, Boyang
collection PubMed
description Methane (CH(4)) emissions from ruminants are of a significant environmental concern, necessitating accurate prediction for emission inventories. Existing models rely solely on dietary and host animal-related data, ignoring the predicting power of rumen microbiota, the source of CH(4). To address this limitation, we developed novel CH(4) prediction models incorporating rumen microbes as predictors, alongside animal- and feed-related predictors using four statistical/machine learning (ML) methods. These include random forest combined with boosting (RF-B), least absolute shrinkage and selection operator (LASSO), generalized linear mixed model with LASSO (glmmLasso), and smoothly clipped absolute deviation (SCAD) implemented on linear mixed models. With a sheep dataset (218 observations) of both animal data and rumen microbiota data (relative sequence abundance of 330 genera of rumen bacteria, archaea, protozoa, and fungi), we developed linear mixed models to predict CH(4) production (g CH(4)/animal·d, ANIM-B models) and CH(4) yield (g CH(4)/kg of dry matter intake, DMI-B models). We also developed models solely based on animal-related data. Prediction performance was evaluated 200 times with random data splits, while fitting performance was assessed without data splitting. The inclusion of microbial predictors improved the models, as indicated by decreased root mean square prediction error (RMSPE) and mean absolute error (MAE), and increased Lin’s concordance correlation coefficient (CCC). Both glmmLasso and SCAD reduced the Akaike information criterion (AIC) and Bayesian information criterion (BIC) for both the ANIM-B and the DMI-B models, while the other two ML methods had mixed outcomes. By balancing prediction performance and fitting performance, we obtained one ANIM-B model (containing 10 genera of bacteria and 3 animal data) fitted using glmmLasso and one DMI-B model (5 genera of bacteria and 1 animal datum) fitted using SCAD. This study highlights the importance of incorporating rumen microbiota data in CH(4) prediction models to enhance accuracy and robustness. Additionally, ML methods facilitate the selection of microbial predictors from high-dimensional metataxonomic data of the rumen microbiota without overfitting. Moreover, the identified microbial predictors can serve as biomarkers of CH(4) emissions from sheep, providing valuable insights for future research and mitigation strategies.
format Online
Article
Text
id pubmed-10693554
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106935542023-12-04 Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy Zhang, Boyang Lin, Shili Moraes, Luis Firkins, Jeffrey Hristov, Alexander N. Kebreab, Ermias Janssen, Peter H. Bannink, André Bayat, Alireza R. Crompton, Les A. Dijkstra, Jan Eugène, Maguy A. Kreuzer, Michael McGee, Mark Reynolds, Christopher K. Schwarm, Angela Yáñez-Ruiz, David R. Yu, Zhongtang Sci Rep Article Methane (CH(4)) emissions from ruminants are of a significant environmental concern, necessitating accurate prediction for emission inventories. Existing models rely solely on dietary and host animal-related data, ignoring the predicting power of rumen microbiota, the source of CH(4). To address this limitation, we developed novel CH(4) prediction models incorporating rumen microbes as predictors, alongside animal- and feed-related predictors using four statistical/machine learning (ML) methods. These include random forest combined with boosting (RF-B), least absolute shrinkage and selection operator (LASSO), generalized linear mixed model with LASSO (glmmLasso), and smoothly clipped absolute deviation (SCAD) implemented on linear mixed models. With a sheep dataset (218 observations) of both animal data and rumen microbiota data (relative sequence abundance of 330 genera of rumen bacteria, archaea, protozoa, and fungi), we developed linear mixed models to predict CH(4) production (g CH(4)/animal·d, ANIM-B models) and CH(4) yield (g CH(4)/kg of dry matter intake, DMI-B models). We also developed models solely based on animal-related data. Prediction performance was evaluated 200 times with random data splits, while fitting performance was assessed without data splitting. The inclusion of microbial predictors improved the models, as indicated by decreased root mean square prediction error (RMSPE) and mean absolute error (MAE), and increased Lin’s concordance correlation coefficient (CCC). Both glmmLasso and SCAD reduced the Akaike information criterion (AIC) and Bayesian information criterion (BIC) for both the ANIM-B and the DMI-B models, while the other two ML methods had mixed outcomes. By balancing prediction performance and fitting performance, we obtained one ANIM-B model (containing 10 genera of bacteria and 3 animal data) fitted using glmmLasso and one DMI-B model (5 genera of bacteria and 1 animal datum) fitted using SCAD. This study highlights the importance of incorporating rumen microbiota data in CH(4) prediction models to enhance accuracy and robustness. Additionally, ML methods facilitate the selection of microbial predictors from high-dimensional metataxonomic data of the rumen microbiota without overfitting. Moreover, the identified microbial predictors can serve as biomarkers of CH(4) emissions from sheep, providing valuable insights for future research and mitigation strategies. Nature Publishing Group UK 2023-12-02 /pmc/articles/PMC10693554/ /pubmed/38042941 http://dx.doi.org/10.1038/s41598-023-48449-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Zhang, Boyang
Lin, Shili
Moraes, Luis
Firkins, Jeffrey
Hristov, Alexander N.
Kebreab, Ermias
Janssen, Peter H.
Bannink, André
Bayat, Alireza R.
Crompton, Les A.
Dijkstra, Jan
Eugène, Maguy A.
Kreuzer, Michael
McGee, Mark
Reynolds, Christopher K.
Schwarm, Angela
Yáñez-Ruiz, David R.
Yu, Zhongtang
Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy
title Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy
title_full Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy
title_fullStr Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy
title_full_unstemmed Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy
title_short Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy
title_sort methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693554/
https://www.ncbi.nlm.nih.gov/pubmed/38042941
http://dx.doi.org/10.1038/s41598-023-48449-y
work_keys_str_mv AT zhangboyang methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT linshili methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT moraesluis methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT firkinsjeffrey methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT hristovalexandern methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT kebreabermias methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT janssenpeterh methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT banninkandre methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT bayatalirezar methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT cromptonlesa methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT dijkstrajan methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT eugenemaguya methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT kreuzermichael methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT mcgeemark methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT reynoldschristopherk methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT schwarmangela methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT yanezruizdavidr methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy
AT yuzhongtang methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy