Cargando…
Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers
The rapid progress of machine learning (ML) in predicting molecular properties enables high-precision predictions being routinely achieved. However, many ML models, such as conventional molecular graph, cannot differentiate stereoisomers of certain types, particularly conformational and chiral ones...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851338/ https://www.ncbi.nlm.nih.gov/pubmed/36528804 http://dx.doi.org/10.1093/bib/bbac560 |
_version_ | 1784872373796732928 |
---|---|
author | Du, Wenjie Yang, Xiaoting Wu, Di Ma, FenFen Zhang, Baicheng Bao, Chaochao Huo, Yaoyuan Jiang, Jun Chen, Xin Wang, Yang |
author_facet | Du, Wenjie Yang, Xiaoting Wu, Di Ma, FenFen Zhang, Baicheng Bao, Chaochao Huo, Yaoyuan Jiang, Jun Chen, Xin Wang, Yang |
author_sort | Du, Wenjie |
collection | PubMed |
description | The rapid progress of machine learning (ML) in predicting molecular properties enables high-precision predictions being routinely achieved. However, many ML models, such as conventional molecular graph, cannot differentiate stereoisomers of certain types, particularly conformational and chiral ones that share the same bonding connectivity but differ in spatial arrangement. Here, we designed a hybrid molecular graph network, Chemical Feature Fusion Network (CFFN), to address the issue by integrating planar and stereo information of molecules in an interweaved fashion. The three-dimensional (3D, i.e., stereo) modality guarantees precision and completeness by providing unabridged information, while the two-dimensional (2D, i.e., planar) modality brings in chemical intuitions as prior knowledge for guidance. The zipper-like arrangement of 2D and 3D information processing promotes cooperativity between them, and their synergy is the key to our model’s success. Experiments on various molecules or conformational datasets including a special newly created chiral molecule dataset comprised of various configurations and conformations demonstrate the superior performance of CFFN. The advantage of CFFN is even more significant in datasets made of small samples. Ablation experiments confirm that fusing 2D and 3D molecular graphs as unambiguous molecular descriptors can not only effectively distinguish molecules and their conformations, but also achieve more accurate and robust prediction of quantum chemical properties. |
format | Online Article Text |
id | pubmed-9851338 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98513382023-01-20 Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers Du, Wenjie Yang, Xiaoting Wu, Di Ma, FenFen Zhang, Baicheng Bao, Chaochao Huo, Yaoyuan Jiang, Jun Chen, Xin Wang, Yang Brief Bioinform Problem Solving Protocol The rapid progress of machine learning (ML) in predicting molecular properties enables high-precision predictions being routinely achieved. However, many ML models, such as conventional molecular graph, cannot differentiate stereoisomers of certain types, particularly conformational and chiral ones that share the same bonding connectivity but differ in spatial arrangement. Here, we designed a hybrid molecular graph network, Chemical Feature Fusion Network (CFFN), to address the issue by integrating planar and stereo information of molecules in an interweaved fashion. The three-dimensional (3D, i.e., stereo) modality guarantees precision and completeness by providing unabridged information, while the two-dimensional (2D, i.e., planar) modality brings in chemical intuitions as prior knowledge for guidance. The zipper-like arrangement of 2D and 3D information processing promotes cooperativity between them, and their synergy is the key to our model’s success. Experiments on various molecules or conformational datasets including a special newly created chiral molecule dataset comprised of various configurations and conformations demonstrate the superior performance of CFFN. The advantage of CFFN is even more significant in datasets made of small samples. Ablation experiments confirm that fusing 2D and 3D molecular graphs as unambiguous molecular descriptors can not only effectively distinguish molecules and their conformations, but also achieve more accurate and robust prediction of quantum chemical properties. Oxford University Press 2022-12-18 /pmc/articles/PMC9851338/ /pubmed/36528804 http://dx.doi.org/10.1093/bib/bbac560 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Problem Solving Protocol Du, Wenjie Yang, Xiaoting Wu, Di Ma, FenFen Zhang, Baicheng Bao, Chaochao Huo, Yaoyuan Jiang, Jun Chen, Xin Wang, Yang Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers |
title | Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers |
title_full | Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers |
title_fullStr | Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers |
title_full_unstemmed | Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers |
title_short | Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers |
title_sort | fusing 2d and 3d molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851338/ https://www.ncbi.nlm.nih.gov/pubmed/36528804 http://dx.doi.org/10.1093/bib/bbac560 |
work_keys_str_mv | AT duwenjie fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers AT yangxiaoting fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers AT wudi fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers AT mafenfen fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers AT zhangbaicheng fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers AT baochaochao fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers AT huoyaoyuan fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers AT jiangjun fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers AT chenxin fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers AT wangyang fusing2dand3dmoleculargraphsasunambiguousmoleculardescriptorsforconformationalandchiralstereoisomers |