Cargando…

CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery

Molecular latent representations, derived from autoencoders (AEs), have been widely used for drug or material discovery over the past couple of years. In particular, a variety of machine learning methods based on latent representations have shown excellent performance on quantitative structure–activ...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yaqin, Xu, Yongjin, Yu, Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8658888/
https://www.ncbi.nlm.nih.gov/pubmed/34885843
http://dx.doi.org/10.3390/molecules26237257
_version_ 1784612834795061248
author Li, Yaqin
Xu, Yongjin
Yu, Yi
author_facet Li, Yaqin
Xu, Yongjin
Yu, Yi
author_sort Li, Yaqin
collection PubMed
description Molecular latent representations, derived from autoencoders (AEs), have been widely used for drug or material discovery over the past couple of years. In particular, a variety of machine learning methods based on latent representations have shown excellent performance on quantitative structure–activity relationship (QSAR) modeling. However, the sequence feature of them has not been considered in most cases. In addition, data scarcity is still the main obstacle for deep learning strategies, especially for bioactivity datasets. In this study, we propose the convolutional recurrent neural network and transfer learning (CRNNTL) method inspired by the applications of polyphonic sound detection and electrocardiogram classification. Our model takes advantage of both convolutional and recurrent neural networks for feature extraction, as well as the data augmentation method. According to QSAR modeling on 27 datasets, CRNNTL can outperform or compete with state-of-art methods in both drug and material properties. In addition, the performances on one isomers-based dataset indicate that its excellent performance results from the improved ability in global feature extraction when the ability of the local one is maintained. Then, the transfer learning results show that CRNNTL can overcome data scarcity when choosing relative source datasets. Finally, the high versatility of our model is shown by using different latent representations as inputs from other types of AEs.
format Online
Article
Text
id pubmed-8658888
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86588882021-12-10 CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery Li, Yaqin Xu, Yongjin Yu, Yi Molecules Article Molecular latent representations, derived from autoencoders (AEs), have been widely used for drug or material discovery over the past couple of years. In particular, a variety of machine learning methods based on latent representations have shown excellent performance on quantitative structure–activity relationship (QSAR) modeling. However, the sequence feature of them has not been considered in most cases. In addition, data scarcity is still the main obstacle for deep learning strategies, especially for bioactivity datasets. In this study, we propose the convolutional recurrent neural network and transfer learning (CRNNTL) method inspired by the applications of polyphonic sound detection and electrocardiogram classification. Our model takes advantage of both convolutional and recurrent neural networks for feature extraction, as well as the data augmentation method. According to QSAR modeling on 27 datasets, CRNNTL can outperform or compete with state-of-art methods in both drug and material properties. In addition, the performances on one isomers-based dataset indicate that its excellent performance results from the improved ability in global feature extraction when the ability of the local one is maintained. Then, the transfer learning results show that CRNNTL can overcome data scarcity when choosing relative source datasets. Finally, the high versatility of our model is shown by using different latent representations as inputs from other types of AEs. MDPI 2021-11-30 /pmc/articles/PMC8658888/ /pubmed/34885843 http://dx.doi.org/10.3390/molecules26237257 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Yaqin
Xu, Yongjin
Yu, Yi
CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery
title CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery
title_full CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery
title_fullStr CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery
title_full_unstemmed CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery
title_short CRNNTL: Convolutional Recurrent Neural Network and Transfer Learning for QSAR Modeling in Organic Drug and Material Discovery
title_sort crnntl: convolutional recurrent neural network and transfer learning for qsar modeling in organic drug and material discovery
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8658888/
https://www.ncbi.nlm.nih.gov/pubmed/34885843
http://dx.doi.org/10.3390/molecules26237257
work_keys_str_mv AT liyaqin crnntlconvolutionalrecurrentneuralnetworkandtransferlearningforqsarmodelinginorganicdrugandmaterialdiscovery
AT xuyongjin crnntlconvolutionalrecurrentneuralnetworkandtransferlearningforqsarmodelinginorganicdrugandmaterialdiscovery
AT yuyi crnntlconvolutionalrecurrentneuralnetworkandtransferlearningforqsarmodelinginorganicdrugandmaterialdiscovery