Cargando…

Algebraic graph-assisted bidirectional transformers for molecular property prediction

The ability of molecular property prediction is of great significance to drug discovery, human health, and environmental protection. Despite considerable efforts, quantitative prediction of various molecular properties remains a challenge. Although some machine learning models, such as bidirectional...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Dong, Gao, Kaifu, Nguyen, Duc Duy, Chen, Xin, Jiang, Yi, Wei, Guo-Wei, Pan, Feng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8192505/
https://www.ncbi.nlm.nih.gov/pubmed/34112777
http://dx.doi.org/10.1038/s41467-021-23720-w
_version_ 1783706060304941056
author Chen, Dong
Gao, Kaifu
Nguyen, Duc Duy
Chen, Xin
Jiang, Yi
Wei, Guo-Wei
Pan, Feng
author_facet Chen, Dong
Gao, Kaifu
Nguyen, Duc Duy
Chen, Xin
Jiang, Yi
Wei, Guo-Wei
Pan, Feng
author_sort Chen, Dong
collection PubMed
description The ability of molecular property prediction is of great significance to drug discovery, human health, and environmental protection. Despite considerable efforts, quantitative prediction of various molecular properties remains a challenge. Although some machine learning models, such as bidirectional encoder from transformer, can incorporate massive unlabeled molecular data into molecular representations via a self-supervised learning strategy, it neglects three-dimensional (3D) stereochemical information. Algebraic graph, specifically, element-specific multiscale weighted colored algebraic graph, embeds complementary 3D molecular information into graph invariants. We propose an algebraic graph-assisted bidirectional transformer (AGBT) framework by fusing representations generated by algebraic graph and bidirectional transformer, as well as a variety of machine learning algorithms, including decision trees, multitask learning, and deep neural networks. We validate the proposed AGBT framework on eight molecular datasets, involving quantitative toxicity, physical chemistry, and physiology datasets. Extensive numerical experiments have shown that AGBT is a state-of-the-art framework for molecular property prediction.
format Online
Article
Text
id pubmed-8192505
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81925052021-07-01 Algebraic graph-assisted bidirectional transformers for molecular property prediction Chen, Dong Gao, Kaifu Nguyen, Duc Duy Chen, Xin Jiang, Yi Wei, Guo-Wei Pan, Feng Nat Commun Article The ability of molecular property prediction is of great significance to drug discovery, human health, and environmental protection. Despite considerable efforts, quantitative prediction of various molecular properties remains a challenge. Although some machine learning models, such as bidirectional encoder from transformer, can incorporate massive unlabeled molecular data into molecular representations via a self-supervised learning strategy, it neglects three-dimensional (3D) stereochemical information. Algebraic graph, specifically, element-specific multiscale weighted colored algebraic graph, embeds complementary 3D molecular information into graph invariants. We propose an algebraic graph-assisted bidirectional transformer (AGBT) framework by fusing representations generated by algebraic graph and bidirectional transformer, as well as a variety of machine learning algorithms, including decision trees, multitask learning, and deep neural networks. We validate the proposed AGBT framework on eight molecular datasets, involving quantitative toxicity, physical chemistry, and physiology datasets. Extensive numerical experiments have shown that AGBT is a state-of-the-art framework for molecular property prediction. Nature Publishing Group UK 2021-06-10 /pmc/articles/PMC8192505/ /pubmed/34112777 http://dx.doi.org/10.1038/s41467-021-23720-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Chen, Dong
Gao, Kaifu
Nguyen, Duc Duy
Chen, Xin
Jiang, Yi
Wei, Guo-Wei
Pan, Feng
Algebraic graph-assisted bidirectional transformers for molecular property prediction
title Algebraic graph-assisted bidirectional transformers for molecular property prediction
title_full Algebraic graph-assisted bidirectional transformers for molecular property prediction
title_fullStr Algebraic graph-assisted bidirectional transformers for molecular property prediction
title_full_unstemmed Algebraic graph-assisted bidirectional transformers for molecular property prediction
title_short Algebraic graph-assisted bidirectional transformers for molecular property prediction
title_sort algebraic graph-assisted bidirectional transformers for molecular property prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8192505/
https://www.ncbi.nlm.nih.gov/pubmed/34112777
http://dx.doi.org/10.1038/s41467-021-23720-w
work_keys_str_mv AT chendong algebraicgraphassistedbidirectionaltransformersformolecularpropertyprediction
AT gaokaifu algebraicgraphassistedbidirectionaltransformersformolecularpropertyprediction
AT nguyenducduy algebraicgraphassistedbidirectionaltransformersformolecularpropertyprediction
AT chenxin algebraicgraphassistedbidirectionaltransformersformolecularpropertyprediction
AT jiangyi algebraicgraphassistedbidirectionaltransformersformolecularpropertyprediction
AT weiguowei algebraicgraphassistedbidirectionaltransformersformolecularpropertyprediction
AT panfeng algebraicgraphassistedbidirectionaltransformersformolecularpropertyprediction