Cargando…

Molecular Property Prediction by Combining LSTM and GAT

Molecular property prediction is an important direction in computer-aided drug design. In this paper, to fully explore the information from SMILE stings and graph data of molecules, we combined the SALSTM and GAT methods in order to mine the feature information of molecules from sequences and graphs...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Lei, Pan, Shourun, Xia, Leiming, Li, Zhen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10046625/
https://www.ncbi.nlm.nih.gov/pubmed/36979438
http://dx.doi.org/10.3390/biom13030503
_version_ 1785013720546541568
author Xu, Lei
Pan, Shourun
Xia, Leiming
Li, Zhen
author_facet Xu, Lei
Pan, Shourun
Xia, Leiming
Li, Zhen
author_sort Xu, Lei
collection PubMed
description Molecular property prediction is an important direction in computer-aided drug design. In this paper, to fully explore the information from SMILE stings and graph data of molecules, we combined the SALSTM and GAT methods in order to mine the feature information of molecules from sequences and graphs. The embedding atoms are obtained through SALSTM, firstly using SMILES strings, and they are combined with graph node features and fed into the GAT to extract the global molecular representation. At the same time, data augmentation is added to enlarge the training dataset and improve the performance of the model. Finally, to enhance the interpretability of the model, the attention layers of both models are fused together to highlight the key atoms. Comparison with other graph-based and sequence-based methods, for multiple datasets, shows that our method can achieve high prediction accuracy with good generalizability.
format Online
Article
Text
id pubmed-10046625
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100466252023-03-29 Molecular Property Prediction by Combining LSTM and GAT Xu, Lei Pan, Shourun Xia, Leiming Li, Zhen Biomolecules Article Molecular property prediction is an important direction in computer-aided drug design. In this paper, to fully explore the information from SMILE stings and graph data of molecules, we combined the SALSTM and GAT methods in order to mine the feature information of molecules from sequences and graphs. The embedding atoms are obtained through SALSTM, firstly using SMILES strings, and they are combined with graph node features and fed into the GAT to extract the global molecular representation. At the same time, data augmentation is added to enlarge the training dataset and improve the performance of the model. Finally, to enhance the interpretability of the model, the attention layers of both models are fused together to highlight the key atoms. Comparison with other graph-based and sequence-based methods, for multiple datasets, shows that our method can achieve high prediction accuracy with good generalizability. MDPI 2023-03-09 /pmc/articles/PMC10046625/ /pubmed/36979438 http://dx.doi.org/10.3390/biom13030503 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xu, Lei
Pan, Shourun
Xia, Leiming
Li, Zhen
Molecular Property Prediction by Combining LSTM and GAT
title Molecular Property Prediction by Combining LSTM and GAT
title_full Molecular Property Prediction by Combining LSTM and GAT
title_fullStr Molecular Property Prediction by Combining LSTM and GAT
title_full_unstemmed Molecular Property Prediction by Combining LSTM and GAT
title_short Molecular Property Prediction by Combining LSTM and GAT
title_sort molecular property prediction by combining lstm and gat
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10046625/
https://www.ncbi.nlm.nih.gov/pubmed/36979438
http://dx.doi.org/10.3390/biom13030503
work_keys_str_mv AT xulei molecularpropertypredictionbycombininglstmandgat
AT panshourun molecularpropertypredictionbycombininglstmandgat
AT xialeiming molecularpropertypredictionbycombininglstmandgat
AT lizhen molecularpropertypredictionbycombininglstmandgat