Cargando…

Investigation of the structure-odor relationship using a Transformer model

The relationships between molecular structures and their properties are subtle and complex, and the properties of odor are no exception. Molecules with similar structures, such as a molecule and its optical isomer, may have completely different odors, whereas molecules with completely distinct struc...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Xiaofan, Tomiura, Yoichi, Hayashi, Kenshi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798546/
https://www.ncbi.nlm.nih.gov/pubmed/36581889
http://dx.doi.org/10.1186/s13321-022-00671-y
_version_ 1784860927346081792
author Zheng, Xiaofan
Tomiura, Yoichi
Hayashi, Kenshi
author_facet Zheng, Xiaofan
Tomiura, Yoichi
Hayashi, Kenshi
author_sort Zheng, Xiaofan
collection PubMed
description The relationships between molecular structures and their properties are subtle and complex, and the properties of odor are no exception. Molecules with similar structures, such as a molecule and its optical isomer, may have completely different odors, whereas molecules with completely distinct structures may have similar odors. Many works have attempted to explain the molecular structure-odor relationship from chemical and data-driven perspectives. The Transformer model is widely used in natural language processing and computer vision, and the attention mechanism included in the Transformer model can identify relationships between inputs and outputs. In this paper, we describe the construction of a Transformer model for predicting molecular properties and interpreting the prediction results. The SMILES data of 100,000 molecules are collected and used to predict the existence of molecular substructures, and our proposed model achieves an F1 value of 0.98. The attention matrix is visualized to investigate the substructure annotation performance of the attention mechanism, and we find that certain atoms in the target substructures are accurately annotated. Finally, we collect 4462 molecules and their odor descriptors and use the proposed model to infer 98 odor descriptors, obtaining an average F1 value of 0.33. For the 19 odor descriptors that achieved F1 values greater than 0.45, we also attempt to summarize the relationship between the molecular substructures and odor quality through the attention matrix.
format Online
Article
Text
id pubmed-9798546
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-97985462022-12-30 Investigation of the structure-odor relationship using a Transformer model Zheng, Xiaofan Tomiura, Yoichi Hayashi, Kenshi J Cheminform Research The relationships between molecular structures and their properties are subtle and complex, and the properties of odor are no exception. Molecules with similar structures, such as a molecule and its optical isomer, may have completely different odors, whereas molecules with completely distinct structures may have similar odors. Many works have attempted to explain the molecular structure-odor relationship from chemical and data-driven perspectives. The Transformer model is widely used in natural language processing and computer vision, and the attention mechanism included in the Transformer model can identify relationships between inputs and outputs. In this paper, we describe the construction of a Transformer model for predicting molecular properties and interpreting the prediction results. The SMILES data of 100,000 molecules are collected and used to predict the existence of molecular substructures, and our proposed model achieves an F1 value of 0.98. The attention matrix is visualized to investigate the substructure annotation performance of the attention mechanism, and we find that certain atoms in the target substructures are accurately annotated. Finally, we collect 4462 molecules and their odor descriptors and use the proposed model to infer 98 odor descriptors, obtaining an average F1 value of 0.33. For the 19 odor descriptors that achieved F1 values greater than 0.45, we also attempt to summarize the relationship between the molecular substructures and odor quality through the attention matrix. Springer International Publishing 2022-12-29 /pmc/articles/PMC9798546/ /pubmed/36581889 http://dx.doi.org/10.1186/s13321-022-00671-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zheng, Xiaofan
Tomiura, Yoichi
Hayashi, Kenshi
Investigation of the structure-odor relationship using a Transformer model
title Investigation of the structure-odor relationship using a Transformer model
title_full Investigation of the structure-odor relationship using a Transformer model
title_fullStr Investigation of the structure-odor relationship using a Transformer model
title_full_unstemmed Investigation of the structure-odor relationship using a Transformer model
title_short Investigation of the structure-odor relationship using a Transformer model
title_sort investigation of the structure-odor relationship using a transformer model
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798546/
https://www.ncbi.nlm.nih.gov/pubmed/36581889
http://dx.doi.org/10.1186/s13321-022-00671-y
work_keys_str_mv AT zhengxiaofan investigationofthestructureodorrelationshipusingatransformermodel
AT tomiurayoichi investigationofthestructureodorrelationshipusingatransformermodel
AT hayashikenshi investigationofthestructureodorrelationshipusingatransformermodel