Cargando…

Improved Metabolite Prediction Using Microbiome Data-Based Elastic Net Models

Microbiome data are becoming increasingly available in large health cohorts, yet metabolomics data are still scant. While many studies generate microbiome data, they lack matched metabolomics data or have considerable missing proportions of metabolites. Since metabolomics is key to understanding mic...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Jialiu, Cho, Hunyong, Lin, Bridget M., Pillai, Malvika, Heimisdottir, Lara H., Bandyopadhyay, Dipankar, Zou, Fei, Roach, Jeffrey, Divaris, Kimon, Wu, Di
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573316/
https://www.ncbi.nlm.nih.gov/pubmed/34760716
http://dx.doi.org/10.3389/fcimb.2021.734416
_version_ 1784595397624201216
author Xie, Jialiu
Cho, Hunyong
Lin, Bridget M.
Pillai, Malvika
Heimisdottir, Lara H.
Bandyopadhyay, Dipankar
Zou, Fei
Roach, Jeffrey
Divaris, Kimon
Wu, Di
author_facet Xie, Jialiu
Cho, Hunyong
Lin, Bridget M.
Pillai, Malvika
Heimisdottir, Lara H.
Bandyopadhyay, Dipankar
Zou, Fei
Roach, Jeffrey
Divaris, Kimon
Wu, Di
author_sort Xie, Jialiu
collection PubMed
description Microbiome data are becoming increasingly available in large health cohorts, yet metabolomics data are still scant. While many studies generate microbiome data, they lack matched metabolomics data or have considerable missing proportions of metabolites. Since metabolomics is key to understanding microbial and general biological activities, the possibility of imputing individual metabolites or inferring metabolomics pathways from microbial taxonomy or metagenomics is intriguing. Importantly, current metabolomics profiling methods such as the HMP Unified Metabolic Analysis Network (HUMAnN) have unknown accuracy and are limited in their ability to predict individual metabolites. To address this gap, we developed a novel metabolite prediction method, and we present its application and evaluation in an oral microbiome study. The new method for predicting metabolites using microbiome data (ENVIM) is based on the elastic net model (ENM). ENVIM introduces an extra step to ENM to consider variable importance (VI) scores, and thus, achieves better prediction power. We investigate the metabolite prediction performance of ENVIM using metagenomic and metatranscriptomic data in a supragingival biofilm multi-omics dataset of 289 children ages 3–5 who were participants of a community-based study of early childhood oral health (ZOE 2.0) in North Carolina, United States. We further validate ENVIM in two additional publicly available multi-omics datasets generated from studies of gut health. We select gene family sets based on variable importance scores and modify the existing ENM strategy used in the MelonnPan prediction software to accommodate the unique features of microbiome and metabolome data. We evaluate metagenomic and metatranscriptomic predictors and compare the prediction performance of ENVIM to the standard ENM employed in MelonnPan. The newly developed ENVIM method showed superior metabolite predictive accuracy than MelonnPan when trained with metatranscriptomics data only, metagenomics data only, or both. Better metabolite prediction is achieved in the gut microbiome compared with the oral microbiome setting. We report the best-predictable compounds in all these three datasets from two different body sites. For example, the metabolites trehalose, maltose, stachyose, and ribose are all well predicted by the supragingival microbiome.
format Online
Article
Text
id pubmed-8573316
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85733162021-11-09 Improved Metabolite Prediction Using Microbiome Data-Based Elastic Net Models Xie, Jialiu Cho, Hunyong Lin, Bridget M. Pillai, Malvika Heimisdottir, Lara H. Bandyopadhyay, Dipankar Zou, Fei Roach, Jeffrey Divaris, Kimon Wu, Di Front Cell Infect Microbiol Cellular and Infection Microbiology Microbiome data are becoming increasingly available in large health cohorts, yet metabolomics data are still scant. While many studies generate microbiome data, they lack matched metabolomics data or have considerable missing proportions of metabolites. Since metabolomics is key to understanding microbial and general biological activities, the possibility of imputing individual metabolites or inferring metabolomics pathways from microbial taxonomy or metagenomics is intriguing. Importantly, current metabolomics profiling methods such as the HMP Unified Metabolic Analysis Network (HUMAnN) have unknown accuracy and are limited in their ability to predict individual metabolites. To address this gap, we developed a novel metabolite prediction method, and we present its application and evaluation in an oral microbiome study. The new method for predicting metabolites using microbiome data (ENVIM) is based on the elastic net model (ENM). ENVIM introduces an extra step to ENM to consider variable importance (VI) scores, and thus, achieves better prediction power. We investigate the metabolite prediction performance of ENVIM using metagenomic and metatranscriptomic data in a supragingival biofilm multi-omics dataset of 289 children ages 3–5 who were participants of a community-based study of early childhood oral health (ZOE 2.0) in North Carolina, United States. We further validate ENVIM in two additional publicly available multi-omics datasets generated from studies of gut health. We select gene family sets based on variable importance scores and modify the existing ENM strategy used in the MelonnPan prediction software to accommodate the unique features of microbiome and metabolome data. We evaluate metagenomic and metatranscriptomic predictors and compare the prediction performance of ENVIM to the standard ENM employed in MelonnPan. The newly developed ENVIM method showed superior metabolite predictive accuracy than MelonnPan when trained with metatranscriptomics data only, metagenomics data only, or both. Better metabolite prediction is achieved in the gut microbiome compared with the oral microbiome setting. We report the best-predictable compounds in all these three datasets from two different body sites. For example, the metabolites trehalose, maltose, stachyose, and ribose are all well predicted by the supragingival microbiome. Frontiers Media S.A. 2021-10-25 /pmc/articles/PMC8573316/ /pubmed/34760716 http://dx.doi.org/10.3389/fcimb.2021.734416 Text en Copyright © 2021 Xie, Cho, Lin, Pillai, Heimisdottir, Bandyopadhyay, Zou, Roach, Divaris and Wu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cellular and Infection Microbiology
Xie, Jialiu
Cho, Hunyong
Lin, Bridget M.
Pillai, Malvika
Heimisdottir, Lara H.
Bandyopadhyay, Dipankar
Zou, Fei
Roach, Jeffrey
Divaris, Kimon
Wu, Di
Improved Metabolite Prediction Using Microbiome Data-Based Elastic Net Models
title Improved Metabolite Prediction Using Microbiome Data-Based Elastic Net Models
title_full Improved Metabolite Prediction Using Microbiome Data-Based Elastic Net Models
title_fullStr Improved Metabolite Prediction Using Microbiome Data-Based Elastic Net Models
title_full_unstemmed Improved Metabolite Prediction Using Microbiome Data-Based Elastic Net Models
title_short Improved Metabolite Prediction Using Microbiome Data-Based Elastic Net Models
title_sort improved metabolite prediction using microbiome data-based elastic net models
topic Cellular and Infection Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573316/
https://www.ncbi.nlm.nih.gov/pubmed/34760716
http://dx.doi.org/10.3389/fcimb.2021.734416
work_keys_str_mv AT xiejialiu improvedmetabolitepredictionusingmicrobiomedatabasedelasticnetmodels
AT chohunyong improvedmetabolitepredictionusingmicrobiomedatabasedelasticnetmodels
AT linbridgetm improvedmetabolitepredictionusingmicrobiomedatabasedelasticnetmodels
AT pillaimalvika improvedmetabolitepredictionusingmicrobiomedatabasedelasticnetmodels
AT heimisdottirlarah improvedmetabolitepredictionusingmicrobiomedatabasedelasticnetmodels
AT bandyopadhyaydipankar improvedmetabolitepredictionusingmicrobiomedatabasedelasticnetmodels
AT zoufei improvedmetabolitepredictionusingmicrobiomedatabasedelasticnetmodels
AT roachjeffrey improvedmetabolitepredictionusingmicrobiomedatabasedelasticnetmodels
AT divariskimon improvedmetabolitepredictionusingmicrobiomedatabasedelasticnetmodels
AT wudi improvedmetabolitepredictionusingmicrobiomedatabasedelasticnetmodels