Cargando…
Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures
There have been recent advances in predicting odor characteristics using molecular structure parameters of chemicals. Although the molecular structure parameters are available for each chemical, they cannot be used for chemical mixtures. This study will elucidate a computational method of predicting...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304616/ https://www.ncbi.nlm.nih.gov/pubmed/32559255 http://dx.doi.org/10.1371/journal.pone.0234688 |
_version_ | 1783548291029401600 |
---|---|
author | Debnath, Tanoy Nakamoto, Takamichi |
author_facet | Debnath, Tanoy Nakamoto, Takamichi |
author_sort | Debnath, Tanoy |
collection | PubMed |
description | There have been recent advances in predicting odor characteristics using molecular structure parameters of chemicals. Although the molecular structure parameters are available for each chemical, they cannot be used for chemical mixtures. This study will elucidate a computational method of predicting human odor perception from the mass spectra of chemical mixtures such as essential oils. Furthermore, a method for obtaining similarity among odor descriptors has been proposed although the dataset contains binary values only. When the database indicates a set of odor descriptors for one sample, only binary data are available and the correlation between the similar descriptors disappears. Thus, the prediction performance degrades for not considering the similarity among the odor descriptors. Since mass spectra dataset is highly dimensional, we use auto-encoder to learn the compressed representation from the mass spectra of essential oils in its bottleneck hidden layer and then accomplishes the hierarchical clustering to create odor descriptor groups with similar odor impressions using a matrix of continuous value-based correlation coefficient as well as natural language processing. This work will help to expatiate the process of overcoming binary value problem and find out the similarity among odor descriptors using machine learning with natural language semantic representation of words. To overcome the problem of disproportionate ratio of positive and negative class for both the continuous value-based correlation coefficient and word similarity based models, we use Synthetic Minority Oversampling Technique (SMOTE). This model allows us to predict human odor perception through computer simulations by forming odor descriptors group. Accordingly, this study demonstrates the feasibility of ensembling machine learning with natural language processing and SMOTE approach for predicting odor descriptor group from mass spectra of essential oils. |
format | Online Article Text |
id | pubmed-7304616 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-73046162020-06-22 Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures Debnath, Tanoy Nakamoto, Takamichi PLoS One Research Article There have been recent advances in predicting odor characteristics using molecular structure parameters of chemicals. Although the molecular structure parameters are available for each chemical, they cannot be used for chemical mixtures. This study will elucidate a computational method of predicting human odor perception from the mass spectra of chemical mixtures such as essential oils. Furthermore, a method for obtaining similarity among odor descriptors has been proposed although the dataset contains binary values only. When the database indicates a set of odor descriptors for one sample, only binary data are available and the correlation between the similar descriptors disappears. Thus, the prediction performance degrades for not considering the similarity among the odor descriptors. Since mass spectra dataset is highly dimensional, we use auto-encoder to learn the compressed representation from the mass spectra of essential oils in its bottleneck hidden layer and then accomplishes the hierarchical clustering to create odor descriptor groups with similar odor impressions using a matrix of continuous value-based correlation coefficient as well as natural language processing. This work will help to expatiate the process of overcoming binary value problem and find out the similarity among odor descriptors using machine learning with natural language semantic representation of words. To overcome the problem of disproportionate ratio of positive and negative class for both the continuous value-based correlation coefficient and word similarity based models, we use Synthetic Minority Oversampling Technique (SMOTE). This model allows us to predict human odor perception through computer simulations by forming odor descriptors group. Accordingly, this study demonstrates the feasibility of ensembling machine learning with natural language processing and SMOTE approach for predicting odor descriptor group from mass spectra of essential oils. Public Library of Science 2020-06-19 /pmc/articles/PMC7304616/ /pubmed/32559255 http://dx.doi.org/10.1371/journal.pone.0234688 Text en © 2020 Debnath, Nakamoto http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Debnath, Tanoy Nakamoto, Takamichi Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures |
title | Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures |
title_full | Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures |
title_fullStr | Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures |
title_full_unstemmed | Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures |
title_short | Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures |
title_sort | predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304616/ https://www.ncbi.nlm.nih.gov/pubmed/32559255 http://dx.doi.org/10.1371/journal.pone.0234688 |
work_keys_str_mv | AT debnathtanoy predictinghumanodorperceptionrepresentedbycontinuousvaluesfrommassspectraofessentialoilsresemblingchemicalmixtures AT nakamototakamichi predictinghumanodorperceptionrepresentedbycontinuousvaluesfrommassspectraofessentialoilsresemblingchemicalmixtures |