Cargando…
Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy
Methane (CH(4)) emissions from ruminants are of a significant environmental concern, necessitating accurate prediction for emission inventories. Existing models rely solely on dietary and host animal-related data, ignoring the predicting power of rumen microbiota, the source of CH(4). To address thi...
Autores principales: | , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693554/ https://www.ncbi.nlm.nih.gov/pubmed/38042941 http://dx.doi.org/10.1038/s41598-023-48449-y |
_version_ | 1785153186694168576 |
---|---|
author | Zhang, Boyang Lin, Shili Moraes, Luis Firkins, Jeffrey Hristov, Alexander N. Kebreab, Ermias Janssen, Peter H. Bannink, André Bayat, Alireza R. Crompton, Les A. Dijkstra, Jan Eugène, Maguy A. Kreuzer, Michael McGee, Mark Reynolds, Christopher K. Schwarm, Angela Yáñez-Ruiz, David R. Yu, Zhongtang |
author_facet | Zhang, Boyang Lin, Shili Moraes, Luis Firkins, Jeffrey Hristov, Alexander N. Kebreab, Ermias Janssen, Peter H. Bannink, André Bayat, Alireza R. Crompton, Les A. Dijkstra, Jan Eugène, Maguy A. Kreuzer, Michael McGee, Mark Reynolds, Christopher K. Schwarm, Angela Yáñez-Ruiz, David R. Yu, Zhongtang |
author_sort | Zhang, Boyang |
collection | PubMed |
description | Methane (CH(4)) emissions from ruminants are of a significant environmental concern, necessitating accurate prediction for emission inventories. Existing models rely solely on dietary and host animal-related data, ignoring the predicting power of rumen microbiota, the source of CH(4). To address this limitation, we developed novel CH(4) prediction models incorporating rumen microbes as predictors, alongside animal- and feed-related predictors using four statistical/machine learning (ML) methods. These include random forest combined with boosting (RF-B), least absolute shrinkage and selection operator (LASSO), generalized linear mixed model with LASSO (glmmLasso), and smoothly clipped absolute deviation (SCAD) implemented on linear mixed models. With a sheep dataset (218 observations) of both animal data and rumen microbiota data (relative sequence abundance of 330 genera of rumen bacteria, archaea, protozoa, and fungi), we developed linear mixed models to predict CH(4) production (g CH(4)/animal·d, ANIM-B models) and CH(4) yield (g CH(4)/kg of dry matter intake, DMI-B models). We also developed models solely based on animal-related data. Prediction performance was evaluated 200 times with random data splits, while fitting performance was assessed without data splitting. The inclusion of microbial predictors improved the models, as indicated by decreased root mean square prediction error (RMSPE) and mean absolute error (MAE), and increased Lin’s concordance correlation coefficient (CCC). Both glmmLasso and SCAD reduced the Akaike information criterion (AIC) and Bayesian information criterion (BIC) for both the ANIM-B and the DMI-B models, while the other two ML methods had mixed outcomes. By balancing prediction performance and fitting performance, we obtained one ANIM-B model (containing 10 genera of bacteria and 3 animal data) fitted using glmmLasso and one DMI-B model (5 genera of bacteria and 1 animal datum) fitted using SCAD. This study highlights the importance of incorporating rumen microbiota data in CH(4) prediction models to enhance accuracy and robustness. Additionally, ML methods facilitate the selection of microbial predictors from high-dimensional metataxonomic data of the rumen microbiota without overfitting. Moreover, the identified microbial predictors can serve as biomarkers of CH(4) emissions from sheep, providing valuable insights for future research and mitigation strategies. |
format | Online Article Text |
id | pubmed-10693554 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-106935542023-12-04 Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy Zhang, Boyang Lin, Shili Moraes, Luis Firkins, Jeffrey Hristov, Alexander N. Kebreab, Ermias Janssen, Peter H. Bannink, André Bayat, Alireza R. Crompton, Les A. Dijkstra, Jan Eugène, Maguy A. Kreuzer, Michael McGee, Mark Reynolds, Christopher K. Schwarm, Angela Yáñez-Ruiz, David R. Yu, Zhongtang Sci Rep Article Methane (CH(4)) emissions from ruminants are of a significant environmental concern, necessitating accurate prediction for emission inventories. Existing models rely solely on dietary and host animal-related data, ignoring the predicting power of rumen microbiota, the source of CH(4). To address this limitation, we developed novel CH(4) prediction models incorporating rumen microbes as predictors, alongside animal- and feed-related predictors using four statistical/machine learning (ML) methods. These include random forest combined with boosting (RF-B), least absolute shrinkage and selection operator (LASSO), generalized linear mixed model with LASSO (glmmLasso), and smoothly clipped absolute deviation (SCAD) implemented on linear mixed models. With a sheep dataset (218 observations) of both animal data and rumen microbiota data (relative sequence abundance of 330 genera of rumen bacteria, archaea, protozoa, and fungi), we developed linear mixed models to predict CH(4) production (g CH(4)/animal·d, ANIM-B models) and CH(4) yield (g CH(4)/kg of dry matter intake, DMI-B models). We also developed models solely based on animal-related data. Prediction performance was evaluated 200 times with random data splits, while fitting performance was assessed without data splitting. The inclusion of microbial predictors improved the models, as indicated by decreased root mean square prediction error (RMSPE) and mean absolute error (MAE), and increased Lin’s concordance correlation coefficient (CCC). Both glmmLasso and SCAD reduced the Akaike information criterion (AIC) and Bayesian information criterion (BIC) for both the ANIM-B and the DMI-B models, while the other two ML methods had mixed outcomes. By balancing prediction performance and fitting performance, we obtained one ANIM-B model (containing 10 genera of bacteria and 3 animal data) fitted using glmmLasso and one DMI-B model (5 genera of bacteria and 1 animal datum) fitted using SCAD. This study highlights the importance of incorporating rumen microbiota data in CH(4) prediction models to enhance accuracy and robustness. Additionally, ML methods facilitate the selection of microbial predictors from high-dimensional metataxonomic data of the rumen microbiota without overfitting. Moreover, the identified microbial predictors can serve as biomarkers of CH(4) emissions from sheep, providing valuable insights for future research and mitigation strategies. Nature Publishing Group UK 2023-12-02 /pmc/articles/PMC10693554/ /pubmed/38042941 http://dx.doi.org/10.1038/s41598-023-48449-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Zhang, Boyang Lin, Shili Moraes, Luis Firkins, Jeffrey Hristov, Alexander N. Kebreab, Ermias Janssen, Peter H. Bannink, André Bayat, Alireza R. Crompton, Les A. Dijkstra, Jan Eugène, Maguy A. Kreuzer, Michael McGee, Mark Reynolds, Christopher K. Schwarm, Angela Yáñez-Ruiz, David R. Yu, Zhongtang Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy |
title | Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy |
title_full | Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy |
title_fullStr | Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy |
title_full_unstemmed | Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy |
title_short | Methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy |
title_sort | methane prediction equations including genera of rumen bacteria as predictor variables improve prediction accuracy |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693554/ https://www.ncbi.nlm.nih.gov/pubmed/38042941 http://dx.doi.org/10.1038/s41598-023-48449-y |
work_keys_str_mv | AT zhangboyang methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT linshili methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT moraesluis methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT firkinsjeffrey methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT hristovalexandern methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT kebreabermias methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT janssenpeterh methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT banninkandre methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT bayatalirezar methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT cromptonlesa methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT dijkstrajan methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT eugenemaguya methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT kreuzermichael methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT mcgeemark methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT reynoldschristopherk methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT schwarmangela methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT yanezruizdavidr methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy AT yuzhongtang methanepredictionequationsincludinggeneraofrumenbacteriaaspredictorvariablesimprovepredictionaccuracy |