Cargando…

Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures

There have been recent advances in predicting odor characteristics using molecular structure parameters of chemicals. Although the molecular structure parameters are available for each chemical, they cannot be used for chemical mixtures. This study will elucidate a computational method of predicting...

Descripción completa

Detalles Bibliográficos
Autores principales: Debnath, Tanoy, Nakamoto, Takamichi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304616/
https://www.ncbi.nlm.nih.gov/pubmed/32559255
http://dx.doi.org/10.1371/journal.pone.0234688
_version_ 1783548291029401600
author Debnath, Tanoy
Nakamoto, Takamichi
author_facet Debnath, Tanoy
Nakamoto, Takamichi
author_sort Debnath, Tanoy
collection PubMed
description There have been recent advances in predicting odor characteristics using molecular structure parameters of chemicals. Although the molecular structure parameters are available for each chemical, they cannot be used for chemical mixtures. This study will elucidate a computational method of predicting human odor perception from the mass spectra of chemical mixtures such as essential oils. Furthermore, a method for obtaining similarity among odor descriptors has been proposed although the dataset contains binary values only. When the database indicates a set of odor descriptors for one sample, only binary data are available and the correlation between the similar descriptors disappears. Thus, the prediction performance degrades for not considering the similarity among the odor descriptors. Since mass spectra dataset is highly dimensional, we use auto-encoder to learn the compressed representation from the mass spectra of essential oils in its bottleneck hidden layer and then accomplishes the hierarchical clustering to create odor descriptor groups with similar odor impressions using a matrix of continuous value-based correlation coefficient as well as natural language processing. This work will help to expatiate the process of overcoming binary value problem and find out the similarity among odor descriptors using machine learning with natural language semantic representation of words. To overcome the problem of disproportionate ratio of positive and negative class for both the continuous value-based correlation coefficient and word similarity based models, we use Synthetic Minority Oversampling Technique (SMOTE). This model allows us to predict human odor perception through computer simulations by forming odor descriptors group. Accordingly, this study demonstrates the feasibility of ensembling machine learning with natural language processing and SMOTE approach for predicting odor descriptor group from mass spectra of essential oils.
format Online
Article
Text
id pubmed-7304616
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-73046162020-06-22 Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures Debnath, Tanoy Nakamoto, Takamichi PLoS One Research Article There have been recent advances in predicting odor characteristics using molecular structure parameters of chemicals. Although the molecular structure parameters are available for each chemical, they cannot be used for chemical mixtures. This study will elucidate a computational method of predicting human odor perception from the mass spectra of chemical mixtures such as essential oils. Furthermore, a method for obtaining similarity among odor descriptors has been proposed although the dataset contains binary values only. When the database indicates a set of odor descriptors for one sample, only binary data are available and the correlation between the similar descriptors disappears. Thus, the prediction performance degrades for not considering the similarity among the odor descriptors. Since mass spectra dataset is highly dimensional, we use auto-encoder to learn the compressed representation from the mass spectra of essential oils in its bottleneck hidden layer and then accomplishes the hierarchical clustering to create odor descriptor groups with similar odor impressions using a matrix of continuous value-based correlation coefficient as well as natural language processing. This work will help to expatiate the process of overcoming binary value problem and find out the similarity among odor descriptors using machine learning with natural language semantic representation of words. To overcome the problem of disproportionate ratio of positive and negative class for both the continuous value-based correlation coefficient and word similarity based models, we use Synthetic Minority Oversampling Technique (SMOTE). This model allows us to predict human odor perception through computer simulations by forming odor descriptors group. Accordingly, this study demonstrates the feasibility of ensembling machine learning with natural language processing and SMOTE approach for predicting odor descriptor group from mass spectra of essential oils. Public Library of Science 2020-06-19 /pmc/articles/PMC7304616/ /pubmed/32559255 http://dx.doi.org/10.1371/journal.pone.0234688 Text en © 2020 Debnath, Nakamoto http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Debnath, Tanoy
Nakamoto, Takamichi
Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures
title Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures
title_full Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures
title_fullStr Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures
title_full_unstemmed Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures
title_short Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures
title_sort predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304616/
https://www.ncbi.nlm.nih.gov/pubmed/32559255
http://dx.doi.org/10.1371/journal.pone.0234688
work_keys_str_mv AT debnathtanoy predictinghumanodorperceptionrepresentedbycontinuousvaluesfrommassspectraofessentialoilsresemblingchemicalmixtures
AT nakamototakamichi predictinghumanodorperceptionrepresentedbycontinuousvaluesfrommassspectraofessentialoilsresemblingchemicalmixtures