Cargando…

Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data

We present a flexible deep convolutional neural network method for the analysis of arbitrary sized graph structures representing molecules. This method, which makes use of the Lipinski RDKit module, an open-source cheminformatics software, enables the incorporation of any global molecular (such as m...

Descripción completa

Detalles Bibliográficos
Autores principales: Ståhl, Niclas, Falkman, Göran, Karlsson, Alexander, Mathiason, Gunnar, Boström, Jonas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: De Gruyter 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798861/
https://www.ncbi.nlm.nih.gov/pubmed/30517077
http://dx.doi.org/10.1515/jib-2018-0065
_version_ 1783460151001350144
author Ståhl, Niclas
Falkman, Göran
Karlsson, Alexander
Mathiason, Gunnar
Boström, Jonas
author_facet Ståhl, Niclas
Falkman, Göran
Karlsson, Alexander
Mathiason, Gunnar
Boström, Jonas
author_sort Ståhl, Niclas
collection PubMed
description We present a flexible deep convolutional neural network method for the analysis of arbitrary sized graph structures representing molecules. This method, which makes use of the Lipinski RDKit module, an open-source cheminformatics software, enables the incorporation of any global molecular (such as molecular charge and molecular weight) and local (such as atom hybridization and bond orders) information. In this paper, we show that this method significantly outperforms another recently proposed method based on deep convolutional neural networks on several datasets that are studied. Several best practices for training deep convolutional neural networks on chemical datasets are also highlighted within the article, such as how to select the information to be included in the model, how to prevent overfitting and how unbalanced classes in the data can be handled.
format Online
Article
Text
id pubmed-6798861
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher De Gruyter
record_format MEDLINE/PubMed
spelling pubmed-67988612019-10-28 Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data Ståhl, Niclas Falkman, Göran Karlsson, Alexander Mathiason, Gunnar Boström, Jonas J Integr Bioinform Workshops We present a flexible deep convolutional neural network method for the analysis of arbitrary sized graph structures representing molecules. This method, which makes use of the Lipinski RDKit module, an open-source cheminformatics software, enables the incorporation of any global molecular (such as molecular charge and molecular weight) and local (such as atom hybridization and bond orders) information. In this paper, we show that this method significantly outperforms another recently proposed method based on deep convolutional neural networks on several datasets that are studied. Several best practices for training deep convolutional neural networks on chemical datasets are also highlighted within the article, such as how to select the information to be included in the model, how to prevent overfitting and how unbalanced classes in the data can be handled. De Gruyter 2018-12-05 /pmc/articles/PMC6798861/ /pubmed/30517077 http://dx.doi.org/10.1515/jib-2018-0065 Text en ©2019, Niclas Ståhl et al., published by Walter de Gruyter GmbH, Berlin/Boston http://creativecommons.org/licenses/by/4.0 This work is licensed under the Creative Commons Attribution 4.0 Public License.
spellingShingle Workshops
Ståhl, Niclas
Falkman, Göran
Karlsson, Alexander
Mathiason, Gunnar
Boström, Jonas
Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data
title Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data
title_full Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data
title_fullStr Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data
title_full_unstemmed Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data
title_short Deep Convolutional Neural Networks for the Prediction of Molecular Properties: Challenges and Opportunities Connected to the Data
title_sort deep convolutional neural networks for the prediction of molecular properties: challenges and opportunities connected to the data
topic Workshops
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798861/
https://www.ncbi.nlm.nih.gov/pubmed/30517077
http://dx.doi.org/10.1515/jib-2018-0065
work_keys_str_mv AT stahlniclas deepconvolutionalneuralnetworksforthepredictionofmolecularpropertieschallengesandopportunitiesconnectedtothedata
AT falkmangoran deepconvolutionalneuralnetworksforthepredictionofmolecularpropertieschallengesandopportunitiesconnectedtothedata
AT karlssonalexander deepconvolutionalneuralnetworksforthepredictionofmolecularpropertieschallengesandopportunitiesconnectedtothedata
AT mathiasongunnar deepconvolutionalneuralnetworksforthepredictionofmolecularpropertieschallengesandopportunitiesconnectedtothedata
AT bostromjonas deepconvolutionalneuralnetworksforthepredictionofmolecularpropertieschallengesandopportunitiesconnectedtothedata