Cargando…

Smell compounds classification using UMAP to increase knowledge of odors and molecular structures linkages

This study aims to highlight the relationships between the structure of smell compounds and their odors. For this purpose, heterogeneous data sources were screened, and 6038 odorant compounds and their known associated odors (162 odor notes) were compiled, each individual molecule being represented...

Descripción completa

Detalles Bibliográficos
Autores principales: Rugard, Marylène, Jaylet, Thomas, Taboureau, Olivier, Tromelin, Anne, Audouze, Karine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8162648/
https://www.ncbi.nlm.nih.gov/pubmed/34048487
http://dx.doi.org/10.1371/journal.pone.0252486
_version_ 1783700760790302720
author Rugard, Marylène
Jaylet, Thomas
Taboureau, Olivier
Tromelin, Anne
Audouze, Karine
author_facet Rugard, Marylène
Jaylet, Thomas
Taboureau, Olivier
Tromelin, Anne
Audouze, Karine
author_sort Rugard, Marylène
collection PubMed
description This study aims to highlight the relationships between the structure of smell compounds and their odors. For this purpose, heterogeneous data sources were screened, and 6038 odorant compounds and their known associated odors (162 odor notes) were compiled, each individual molecule being represented with a set of 1024 structural fingerprint. Several dimensional reduction techniques (PCA, MDS, t-SNE and UMAP) with two clustering methods (k-means and agglomerative hierarchical clustering AHC) were assessed based on the calculated fingerprints. The combination of UMAP with k-means and AHC methods allowed to obtain a good representativeness of odors by clusters, as well as the best visualization of the proximity of odorants on the basis of their molecular structures. The presence or absence of molecular substructures has been calculated on odorant in order to link chemical groups to odors. The results of this analysis bring out some associations for both the odor notes and the chemical structures of the molecules such as “woody” and “spicy” notes with allylic and bicyclic structures, “balsamic” notes with unsaturated rings, both “sulfurous” and “citrus” with aldehydes, alcohols, carboxylic acids, amines and sulfur compounds, and “oily”, “fatty” and “fruity” characterized by esters and with long carbon chains. Overall, the use of UMAP associated to clustering is a promising method to suggest hypotheses on the odorant structure-odor relationships.
format Online
Article
Text
id pubmed-8162648
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-81626482021-06-10 Smell compounds classification using UMAP to increase knowledge of odors and molecular structures linkages Rugard, Marylène Jaylet, Thomas Taboureau, Olivier Tromelin, Anne Audouze, Karine PLoS One Research Article This study aims to highlight the relationships between the structure of smell compounds and their odors. For this purpose, heterogeneous data sources were screened, and 6038 odorant compounds and their known associated odors (162 odor notes) were compiled, each individual molecule being represented with a set of 1024 structural fingerprint. Several dimensional reduction techniques (PCA, MDS, t-SNE and UMAP) with two clustering methods (k-means and agglomerative hierarchical clustering AHC) were assessed based on the calculated fingerprints. The combination of UMAP with k-means and AHC methods allowed to obtain a good representativeness of odors by clusters, as well as the best visualization of the proximity of odorants on the basis of their molecular structures. The presence or absence of molecular substructures has been calculated on odorant in order to link chemical groups to odors. The results of this analysis bring out some associations for both the odor notes and the chemical structures of the molecules such as “woody” and “spicy” notes with allylic and bicyclic structures, “balsamic” notes with unsaturated rings, both “sulfurous” and “citrus” with aldehydes, alcohols, carboxylic acids, amines and sulfur compounds, and “oily”, “fatty” and “fruity” characterized by esters and with long carbon chains. Overall, the use of UMAP associated to clustering is a promising method to suggest hypotheses on the odorant structure-odor relationships. Public Library of Science 2021-05-28 /pmc/articles/PMC8162648/ /pubmed/34048487 http://dx.doi.org/10.1371/journal.pone.0252486 Text en © 2021 Rugard et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Rugard, Marylène
Jaylet, Thomas
Taboureau, Olivier
Tromelin, Anne
Audouze, Karine
Smell compounds classification using UMAP to increase knowledge of odors and molecular structures linkages
title Smell compounds classification using UMAP to increase knowledge of odors and molecular structures linkages
title_full Smell compounds classification using UMAP to increase knowledge of odors and molecular structures linkages
title_fullStr Smell compounds classification using UMAP to increase knowledge of odors and molecular structures linkages
title_full_unstemmed Smell compounds classification using UMAP to increase knowledge of odors and molecular structures linkages
title_short Smell compounds classification using UMAP to increase knowledge of odors and molecular structures linkages
title_sort smell compounds classification using umap to increase knowledge of odors and molecular structures linkages
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8162648/
https://www.ncbi.nlm.nih.gov/pubmed/34048487
http://dx.doi.org/10.1371/journal.pone.0252486
work_keys_str_mv AT rugardmarylene smellcompoundsclassificationusingumaptoincreaseknowledgeofodorsandmolecularstructureslinkages
AT jayletthomas smellcompoundsclassificationusingumaptoincreaseknowledgeofodorsandmolecularstructureslinkages
AT taboureauolivier smellcompoundsclassificationusingumaptoincreaseknowledgeofodorsandmolecularstructureslinkages
AT tromelinanne smellcompoundsclassificationusingumaptoincreaseknowledgeofodorsandmolecularstructureslinkages
AT audouzekarine smellcompoundsclassificationusingumaptoincreaseknowledgeofodorsandmolecularstructureslinkages