Cargando…

Prediction of flavor and retention index for compounds in beer depending on molecular structure using a machine learning method

In order to make a preliminary prediction of flavor and retention index (RI) for compounds in beer, this work applied the machine learning method to modeling depending on molecular structure. Towards this goal, the flavor compounds in beer from existing literature were collected. The database was cl...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yu-Tang, Yang, Zhao-Xia, Piao, Zan-Hao, Xu, Xiao-Juan, Yu, Jun-Hong, Zhang, Ying-Hua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044825/
https://www.ncbi.nlm.nih.gov/pubmed/35494377
http://dx.doi.org/10.1039/d1ra06551c
_version_ 1784695186785304576
author Wang, Yu-Tang
Yang, Zhao-Xia
Piao, Zan-Hao
Xu, Xiao-Juan
Yu, Jun-Hong
Zhang, Ying-Hua
author_facet Wang, Yu-Tang
Yang, Zhao-Xia
Piao, Zan-Hao
Xu, Xiao-Juan
Yu, Jun-Hong
Zhang, Ying-Hua
author_sort Wang, Yu-Tang
collection PubMed
description In order to make a preliminary prediction of flavor and retention index (RI) for compounds in beer, this work applied the machine learning method to modeling depending on molecular structure. Towards this goal, the flavor compounds in beer from existing literature were collected. The database was classified into four groups as aromatic, bitter, sulfury, and others. The RI values on a non-polar SE-30 column and a polar Carbowax 20M column from the National Institute of Standards Technology (NIST) were investigated. The structures were converted to molecular descriptors calculated by molecular operating environment (MOE), ChemoPy and Mordred, respectively. By combining the pretreatment of the descriptors, machine learning models, including support vector machine (SVM), random forest (RF) and k-nearest neighbour (kNN) were utilized for beer flavor models. Principal component regression (PCR), random forest regression (RFR) and partial least squares (PLS) regression were employed to predict the RI. The accuracy of the test set was obtained by SVM, RF, and kNN. Among them, the combination of descriptors calculated by Mordred and RF model afforded the highest accuracy of 0.686. R(2) of the optimal regression model achieved 0.96. The results indicated that the models can be used to predict the flavor of a specific compound in beer and its RI value.
format Online
Article
Text
id pubmed-9044825
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-90448252022-04-28 Prediction of flavor and retention index for compounds in beer depending on molecular structure using a machine learning method Wang, Yu-Tang Yang, Zhao-Xia Piao, Zan-Hao Xu, Xiao-Juan Yu, Jun-Hong Zhang, Ying-Hua RSC Adv Chemistry In order to make a preliminary prediction of flavor and retention index (RI) for compounds in beer, this work applied the machine learning method to modeling depending on molecular structure. Towards this goal, the flavor compounds in beer from existing literature were collected. The database was classified into four groups as aromatic, bitter, sulfury, and others. The RI values on a non-polar SE-30 column and a polar Carbowax 20M column from the National Institute of Standards Technology (NIST) were investigated. The structures were converted to molecular descriptors calculated by molecular operating environment (MOE), ChemoPy and Mordred, respectively. By combining the pretreatment of the descriptors, machine learning models, including support vector machine (SVM), random forest (RF) and k-nearest neighbour (kNN) were utilized for beer flavor models. Principal component regression (PCR), random forest regression (RFR) and partial least squares (PLS) regression were employed to predict the RI. The accuracy of the test set was obtained by SVM, RF, and kNN. Among them, the combination of descriptors calculated by Mordred and RF model afforded the highest accuracy of 0.686. R(2) of the optimal regression model achieved 0.96. The results indicated that the models can be used to predict the flavor of a specific compound in beer and its RI value. The Royal Society of Chemistry 2021-11-17 /pmc/articles/PMC9044825/ /pubmed/35494377 http://dx.doi.org/10.1039/d1ra06551c Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Wang, Yu-Tang
Yang, Zhao-Xia
Piao, Zan-Hao
Xu, Xiao-Juan
Yu, Jun-Hong
Zhang, Ying-Hua
Prediction of flavor and retention index for compounds in beer depending on molecular structure using a machine learning method
title Prediction of flavor and retention index for compounds in beer depending on molecular structure using a machine learning method
title_full Prediction of flavor and retention index for compounds in beer depending on molecular structure using a machine learning method
title_fullStr Prediction of flavor and retention index for compounds in beer depending on molecular structure using a machine learning method
title_full_unstemmed Prediction of flavor and retention index for compounds in beer depending on molecular structure using a machine learning method
title_short Prediction of flavor and retention index for compounds in beer depending on molecular structure using a machine learning method
title_sort prediction of flavor and retention index for compounds in beer depending on molecular structure using a machine learning method
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044825/
https://www.ncbi.nlm.nih.gov/pubmed/35494377
http://dx.doi.org/10.1039/d1ra06551c
work_keys_str_mv AT wangyutang predictionofflavorandretentionindexforcompoundsinbeerdependingonmolecularstructureusingamachinelearningmethod
AT yangzhaoxia predictionofflavorandretentionindexforcompoundsinbeerdependingonmolecularstructureusingamachinelearningmethod
AT piaozanhao predictionofflavorandretentionindexforcompoundsinbeerdependingonmolecularstructureusingamachinelearningmethod
AT xuxiaojuan predictionofflavorandretentionindexforcompoundsinbeerdependingonmolecularstructureusingamachinelearningmethod
AT yujunhong predictionofflavorandretentionindexforcompoundsinbeerdependingonmolecularstructureusingamachinelearningmethod
AT zhangyinghua predictionofflavorandretentionindexforcompoundsinbeerdependingonmolecularstructureusingamachinelearningmethod