Cargando…
Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data
We present a flexible deep convolutional neural network method for the analysis of arbitrary sized graph structures representing molecules. This method, which makes use of the Lipinski RDKit module, an open-source cheminformatics software, enables the incorporation of any global molecular (such as m...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
De Gruyter
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798861/ https://www.ncbi.nlm.nih.gov/pubmed/30517077 http://dx.doi.org/10.1515/jib-2018-0065 |
_version_ | 1783460151001350144 |
---|---|
author | Ståhl, Niclas Falkman, Göran Karlsson, Alexander Mathiason, Gunnar Boström, Jonas |
author_facet | Ståhl, Niclas Falkman, Göran Karlsson, Alexander Mathiason, Gunnar Boström, Jonas |
author_sort | Ståhl, Niclas |
collection | PubMed |
description | We present a flexible deep convolutional neural network method for the analysis of arbitrary sized graph structures representing molecules. This method, which makes use of the Lipinski RDKit module, an open-source cheminformatics software, enables the incorporation of any global molecular (such as molecular charge and molecular weight) and local (such as atom hybridization and bond orders) information. In this paper, we show that this method significantly outperforms another recently proposed method based on deep convolutional neural networks on several datasets that are studied. Several best practices for training deep convolutional neural networks on chemical datasets are also highlighted within the article, such as how to select the information to be included in the model, how to prevent overfitting and how unbalanced classes in the data can be handled. |
format | Online Article Text |
id | pubmed-6798861 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | De Gruyter |
record_format | MEDLINE/PubMed |
spelling | pubmed-67988612019-10-28 Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data Ståhl, Niclas Falkman, Göran Karlsson, Alexander Mathiason, Gunnar Boström, Jonas J Integr Bioinform Workshops We present a flexible deep convolutional neural network method for the analysis of arbitrary sized graph structures representing molecules. This method, which makes use of the Lipinski RDKit module, an open-source cheminformatics software, enables the incorporation of any global molecular (such as molecular charge and molecular weight) and local (such as atom hybridization and bond orders) information. In this paper, we show that this method significantly outperforms another recently proposed method based on deep convolutional neural networks on several datasets that are studied. Several best practices for training deep convolutional neural networks on chemical datasets are also highlighted within the article, such as how to select the information to be included in the model, how to prevent overfitting and how unbalanced classes in the data can be handled. De Gruyter 2018-12-05 /pmc/articles/PMC6798861/ /pubmed/30517077 http://dx.doi.org/10.1515/jib-2018-0065 Text en ©2019, Niclas Ståhl et al., published by Walter de Gruyter GmbH, Berlin/Boston http://creativecommons.org/licenses/by/4.0 This work is licensed under the Creative Commons Attribution 4.0 Public License. |
spellingShingle | Workshops Ståhl, Niclas Falkman, Göran Karlsson, Alexander Mathiason, Gunnar Boström, Jonas Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data |
title | Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data |
title_full | Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data |
title_fullStr | Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data |
title_full_unstemmed | Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data |
title_short | Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data |
title_sort | deep convolutional neural networks for the prediction of molecular properties: challenges and opportunities connected to the data |
topic | Workshops |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798861/ https://www.ncbi.nlm.nih.gov/pubmed/30517077 http://dx.doi.org/10.1515/jib-2018-0065 |
work_keys_str_mv | AT stahlniclas deepconvolutionalneuralnetworksforthepredictionofmolecularpropertieschallengesandopportunitiesconnectedtothedata AT falkmangoran deepconvolutionalneuralnetworksforthepredictionofmolecularpropertieschallengesandopportunitiesconnectedtothedata AT karlssonalexander deepconvolutionalneuralnetworksforthepredictionofmolecularpropertieschallengesandopportunitiesconnectedtothedata AT mathiasongunnar deepconvolutionalneuralnetworksforthepredictionofmolecularpropertieschallengesandopportunitiesconnectedtothedata AT bostromjonas deepconvolutionalneuralnetworksforthepredictionofmolecularpropertieschallengesandopportunitiesconnectedtothedata |