Cargando…

e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness

Artificial sweeteners (AS) can elicit the strong sweet sensation with the low or zero calorie, and are widely used to replace the nutritive sugar in the food and beverage industry. However, the safety issue of current AS is still controversial. Thus, it is imperative to develop more safe and potent...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Suqing, Chang, Wenping, Xu, Wenxin, Xu, Yong, Lin, Fu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6363693/
https://www.ncbi.nlm.nih.gov/pubmed/30761295
http://dx.doi.org/10.3389/fchem.2019.00035
_version_ 1783393149145579520
author Zheng, Suqing
Chang, Wenping
Xu, Wenxin
Xu, Yong
Lin, Fu
author_facet Zheng, Suqing
Chang, Wenping
Xu, Wenxin
Xu, Yong
Lin, Fu
author_sort Zheng, Suqing
collection PubMed
description Artificial sweeteners (AS) can elicit the strong sweet sensation with the low or zero calorie, and are widely used to replace the nutritive sugar in the food and beverage industry. However, the safety issue of current AS is still controversial. Thus, it is imperative to develop more safe and potent AS. Due to the costly and laborious experimental-screening of AS, in-silico sweetener/sweetness prediction could provide a good avenue to identify the potential sweetener candidates before experiment. In this work, we curate the largest dataset of 530 sweeteners and 850 non-sweeteners, and collect the second largest dataset of 352 sweeteners with the relative sweetness (RS) from the literature. In light of these experimental datasets, we adopt five machine-learning methods and conformational-independent molecular fingerprints to derive the classification and regression models for the prediction of sweetener and its RS, respectively via the consensus strategy. Our best classification model achieves the 95% confidence intervals for the accuracy (0.91 ± 0.01), precision (0.90 ± 0.01), specificity (0.94 ± 0.01), sensitivity (0.86 ± 0.01), F1-score (0.88 ± 0.01), and NER (Non-error Rate: 0.90 ± 0.01) on the test set, which outperforms the model (NER = 0.85) of Rojas et al. in terms of NER, and our best regression model gives the 95% confidence intervals for the R(2)(test set) and ΔR(2) [referring to |R(2)(test set)- R(2)(cross-validation)|] of 0.77 ± 0.01 and 0.03 ± 0.01, respectively, which is also better than the other works based on the conformation-independent 2D descriptors (e.g., 2D Dragon) according to R(2)(test set) and ΔR(2). Our models are obtained by averaging over nineteen data-splitting schemes, and fully comply with the guidelines of Organization for Economic Cooperation and Development (OECD), which are not completely followed by the previous relevant works that are all on the basis of only one random data-splitting scheme for the cross-validation set and test set. Finally, we develop a user-friendly platform “e-Sweet” for the automatic prediction of sweetener and its corresponding RS. To our best knowledge, it is a first and free platform that can enable the experimental food scientists to exploit the current machine-learning methods to boost the discovery of more AS with the low or zero calorie content.
format Online
Article
Text
id pubmed-6363693
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-63636932019-02-13 e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness Zheng, Suqing Chang, Wenping Xu, Wenxin Xu, Yong Lin, Fu Front Chem Chemistry Artificial sweeteners (AS) can elicit the strong sweet sensation with the low or zero calorie, and are widely used to replace the nutritive sugar in the food and beverage industry. However, the safety issue of current AS is still controversial. Thus, it is imperative to develop more safe and potent AS. Due to the costly and laborious experimental-screening of AS, in-silico sweetener/sweetness prediction could provide a good avenue to identify the potential sweetener candidates before experiment. In this work, we curate the largest dataset of 530 sweeteners and 850 non-sweeteners, and collect the second largest dataset of 352 sweeteners with the relative sweetness (RS) from the literature. In light of these experimental datasets, we adopt five machine-learning methods and conformational-independent molecular fingerprints to derive the classification and regression models for the prediction of sweetener and its RS, respectively via the consensus strategy. Our best classification model achieves the 95% confidence intervals for the accuracy (0.91 ± 0.01), precision (0.90 ± 0.01), specificity (0.94 ± 0.01), sensitivity (0.86 ± 0.01), F1-score (0.88 ± 0.01), and NER (Non-error Rate: 0.90 ± 0.01) on the test set, which outperforms the model (NER = 0.85) of Rojas et al. in terms of NER, and our best regression model gives the 95% confidence intervals for the R(2)(test set) and ΔR(2) [referring to |R(2)(test set)- R(2)(cross-validation)|] of 0.77 ± 0.01 and 0.03 ± 0.01, respectively, which is also better than the other works based on the conformation-independent 2D descriptors (e.g., 2D Dragon) according to R(2)(test set) and ΔR(2). Our models are obtained by averaging over nineteen data-splitting schemes, and fully comply with the guidelines of Organization for Economic Cooperation and Development (OECD), which are not completely followed by the previous relevant works that are all on the basis of only one random data-splitting scheme for the cross-validation set and test set. Finally, we develop a user-friendly platform “e-Sweet” for the automatic prediction of sweetener and its corresponding RS. To our best knowledge, it is a first and free platform that can enable the experimental food scientists to exploit the current machine-learning methods to boost the discovery of more AS with the low or zero calorie content. Frontiers Media S.A. 2019-01-30 /pmc/articles/PMC6363693/ /pubmed/30761295 http://dx.doi.org/10.3389/fchem.2019.00035 Text en Copyright © 2019 Zheng, Chang, Xu, Xu and Lin. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Chemistry
Zheng, Suqing
Chang, Wenping
Xu, Wenxin
Xu, Yong
Lin, Fu
e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness
title e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness
title_full e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness
title_fullStr e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness
title_full_unstemmed e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness
title_short e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness
title_sort e-sweet: a machine-learning based platform for the prediction of sweetener and its relative sweetness
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6363693/
https://www.ncbi.nlm.nih.gov/pubmed/30761295
http://dx.doi.org/10.3389/fchem.2019.00035
work_keys_str_mv AT zhengsuqing esweetamachinelearningbasedplatformforthepredictionofsweeteneranditsrelativesweetness
AT changwenping esweetamachinelearningbasedplatformforthepredictionofsweeteneranditsrelativesweetness
AT xuwenxin esweetamachinelearningbasedplatformforthepredictionofsweeteneranditsrelativesweetness
AT xuyong esweetamachinelearningbasedplatformforthepredictionofsweeteneranditsrelativesweetness
AT linfu esweetamachinelearningbasedplatformforthepredictionofsweeteneranditsrelativesweetness