Cargando…

Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers

The rapid progress of machine learning (ML) in predicting molecular properties enables high-precision predictions being routinely achieved. However, many ML models, such as conventional molecular graph, cannot differentiate stereoisomers of certain types, particularly conformational and chiral ones...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Wenjie, Yang, Xiaoting, Wu, Di, Ma, FenFen, Zhang, Baicheng, Bao, Chaochao, Huo, Yaoyuan, Jiang, Jun, Chen, Xin, Wang, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851338/
https://www.ncbi.nlm.nih.gov/pubmed/36528804
http://dx.doi.org/10.1093/bib/bbac560
_version_ 1784872373796732928
author Du, Wenjie
Yang, Xiaoting
Wu, Di
Ma, FenFen
Zhang, Baicheng
Bao, Chaochao
Huo, Yaoyuan
Jiang, Jun
Chen, Xin
Wang, Yang
author_facet Du, Wenjie
Yang, Xiaoting
Wu, Di
Ma, FenFen
Zhang, Baicheng
Bao, Chaochao
Huo, Yaoyuan
Jiang, Jun
Chen, Xin
Wang, Yang
author_sort Du, Wenjie
collection PubMed
description The rapid progress of machine learning (ML) in predicting molecular properties enables high-precision predictions being routinely achieved. However, many ML models, such as conventional molecular graph, cannot differentiate stereoisomers of certain types, particularly conformational and chiral ones that share the same bonding connectivity but differ in spatial arrangement. Here, we designed a hybrid molecular graph network, Chemical Feature Fusion Network (CFFN), to address the issue by integrating planar and stereo information of molecules in an interweaved fashion. The three-dimensional (3D, i.e., stereo) modality guarantees precision and completeness by providing unabridged information, while the two-dimensional (2D, i.e., planar) modality brings in chemical intuitions as prior knowledge for guidance. The zipper-like arrangement of 2D and 3D information processing promotes cooperativity between them, and their synergy is the key to our model’s success. Experiments on various molecules or conformational datasets including a special newly created chiral molecule dataset comprised of various configurations and conformations demonstrate the superior performance of CFFN. The advantage of CFFN is even more significant in datasets made of small samples. Ablation experiments confirm that fusing 2D and 3D molecular graphs as unambiguous molecular descriptors can not only effectively distinguish molecules and their conformations, but also achieve more accurate and robust prediction of quantum chemical properties.
format Online
Article
Text
id pubmed-9851338
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98513382023-01-20 Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers Du, Wenjie Yang, Xiaoting Wu, Di Ma, FenFen Zhang, Baicheng Bao, Chaochao Huo, Yaoyuan Jiang, Jun Chen, Xin Wang, Yang Brief Bioinform Problem Solving Protocol The rapid progress of machine learning (ML) in predicting molecular properties enables high-precision predictions being routinely achieved. However, many ML models, such as conventional molecular graph, cannot differentiate stereoisomers of certain types, particularly conformational and chiral ones that share the same bonding connectivity but differ in spatial arrangement. Here, we designed a hybrid molecular graph network, Chemical Feature Fusion Network (CFFN), to address the issue by integrating planar and stereo information of molecules in an interweaved fashion. The three-dimensional (3D, i.e., stereo) modality guarantees precision and completeness by providing unabridged information, while the two-dimensional (2D, i.e., planar) modality brings in chemical intuitions as prior knowledge for guidance. The zipper-like arrangement of 2D and 3D information processing promotes cooperativity between them, and their synergy is the key to our model’s success. Experiments on various molecules or conformational datasets including a special newly created chiral molecule dataset comprised of various configurations and conformations demonstrate the superior performance of CFFN. The advantage of CFFN is even more significant in datasets made of small samples. Ablation experiments confirm that fusing 2D and 3D molecular graphs as unambiguous molecular descriptors can not only effectively distinguish molecules and their conformations, but also achieve more accurate and robust prediction of quantum chemical properties. Oxford University Press 2022-12-18 /pmc/articles/PMC9851338/ /pubmed/36528804 http://dx.doi.org/10.1093/bib/bbac560 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Du, Wenjie
Yang, Xiaoting
Wu, Di
Ma, FenFen
Zhang, Baicheng
Bao, Chaochao
Huo, Yaoyuan
Jiang, Jun
Chen, Xin
Wang, Yang
Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers
title Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers
title_full Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers
title_fullStr Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers
title_full_unstemmed Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers
title_short Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers
title_sort fusing 2d and 3d molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851338/
https://www.ncbi.nlm.nih.gov/pubmed/36528804
http://dx.doi.org/10.1093/bib/bbac560
work_keys_str_mv AT duwenjie fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers
AT yangxiaoting fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers
AT wudi fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers
AT mafenfen fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers
AT zhangbaicheng fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers
AT baochaochao fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers
AT huoyaoyuan fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers
AT jiangjun fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers
AT chenxin fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers
AT wangyang fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers