Cargando…
Molecular Property Prediction by Combining LSTM and GAT
Molecular property prediction is an important direction in computer-aided drug design. In this paper, to fully explore the information from SMILE stings and graph data of molecules, we combined the SALSTM and GAT methods in order to mine the feature information of molecules from sequences and graphs...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10046625/ https://www.ncbi.nlm.nih.gov/pubmed/36979438 http://dx.doi.org/10.3390/biom13030503 |
_version_ | 1785013720546541568 |
---|---|
author | Xu, Lei Pan, Shourun Xia, Leiming Li, Zhen |
author_facet | Xu, Lei Pan, Shourun Xia, Leiming Li, Zhen |
author_sort | Xu, Lei |
collection | PubMed |
description | Molecular property prediction is an important direction in computer-aided drug design. In this paper, to fully explore the information from SMILE stings and graph data of molecules, we combined the SALSTM and GAT methods in order to mine the feature information of molecules from sequences and graphs. The embedding atoms are obtained through SALSTM, firstly using SMILES strings, and they are combined with graph node features and fed into the GAT to extract the global molecular representation. At the same time, data augmentation is added to enlarge the training dataset and improve the performance of the model. Finally, to enhance the interpretability of the model, the attention layers of both models are fused together to highlight the key atoms. Comparison with other graph-based and sequence-based methods, for multiple datasets, shows that our method can achieve high prediction accuracy with good generalizability. |
format | Online Article Text |
id | pubmed-10046625 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100466252023-03-29 Molecular Property Prediction by Combining LSTM and GAT Xu, Lei Pan, Shourun Xia, Leiming Li, Zhen Biomolecules Article Molecular property prediction is an important direction in computer-aided drug design. In this paper, to fully explore the information from SMILE stings and graph data of molecules, we combined the SALSTM and GAT methods in order to mine the feature information of molecules from sequences and graphs. The embedding atoms are obtained through SALSTM, firstly using SMILES strings, and they are combined with graph node features and fed into the GAT to extract the global molecular representation. At the same time, data augmentation is added to enlarge the training dataset and improve the performance of the model. Finally, to enhance the interpretability of the model, the attention layers of both models are fused together to highlight the key atoms. Comparison with other graph-based and sequence-based methods, for multiple datasets, shows that our method can achieve high prediction accuracy with good generalizability. MDPI 2023-03-09 /pmc/articles/PMC10046625/ /pubmed/36979438 http://dx.doi.org/10.3390/biom13030503 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Xu, Lei Pan, Shourun Xia, Leiming Li, Zhen Molecular Property Prediction by Combining LSTM and GAT |
title | Molecular Property Prediction by Combining LSTM and GAT |
title_full | Molecular Property Prediction by Combining LSTM and GAT |
title_fullStr | Molecular Property Prediction by Combining LSTM and GAT |
title_full_unstemmed | Molecular Property Prediction by Combining LSTM and GAT |
title_short | Molecular Property Prediction by Combining LSTM and GAT |
title_sort | molecular property prediction by combining lstm and gat |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10046625/ https://www.ncbi.nlm.nih.gov/pubmed/36979438 http://dx.doi.org/10.3390/biom13030503 |
work_keys_str_mv | AT xulei molecularpropertypredictionbycombininglstmandgat AT panshourun molecularpropertypredictionbycombininglstmandgat AT xialeiming molecularpropertypredictionbycombininglstmandgat AT lizhen molecularpropertypredictionbycombininglstmandgat |