Cargando…

Validation Study of QSAR/DNN Models Using the Competition Datasets

Since the QSAR/DNN model showed predominant predictive performance over other conventional methods in the Kaggle QSAR competition, many artificial neural network (ANN) methods have been applied to drug and material discovery. Appearance of artificial intelligence (AI), which is combined various gene...

Descripción completa

Detalles Bibliográficos
Autores principales: Kato, Yoshiki, Hamada, Shinji, Goto, Hitoshi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050538/
https://www.ncbi.nlm.nih.gov/pubmed/31802634
http://dx.doi.org/10.1002/minf.201900154
_version_ 1783502629477810176
author Kato, Yoshiki
Hamada, Shinji
Goto, Hitoshi
author_facet Kato, Yoshiki
Hamada, Shinji
Goto, Hitoshi
author_sort Kato, Yoshiki
collection PubMed
description Since the QSAR/DNN model showed predominant predictive performance over other conventional methods in the Kaggle QSAR competition, many artificial neural network (ANN) methods have been applied to drug and material discovery. Appearance of artificial intelligence (AI), which is combined various general purpose ANN platforms with large‐scale open access chemical databases, has attracting great interest and expectation in a wide range of molecular sciences. In this study, we investigate various DNN settings in order to reach a high‐level of predictive performance comparable to the champion team of the competition, even with a general purpose ANN platform, and introduce the Meister setting for constructing a good QSAR/DNNs model. Here, we have used the most commonly available DNN model and constructed many QSAR/DNN models trained with various DNN settings by using the 15 datasets employed in the competition. As a result, it was confirmed that we can constructed the QSAR/DNN model that shows the same level of R2 performance as the champion team. The difference from the DNN setting recommended by the champion team was to reduce the mini‐batch size. We have also explained that the R2 performance of each target depends on the molecular activity type, which is related to the complexity of biological mechanisms and chemical processes observed in molecular activity measurements.
format Online
Article
Text
id pubmed-7050538
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-70505382020-03-09 Validation Study of QSAR/DNN Models Using the Competition Datasets Kato, Yoshiki Hamada, Shinji Goto, Hitoshi Mol Inform Full Papers Since the QSAR/DNN model showed predominant predictive performance over other conventional methods in the Kaggle QSAR competition, many artificial neural network (ANN) methods have been applied to drug and material discovery. Appearance of artificial intelligence (AI), which is combined various general purpose ANN platforms with large‐scale open access chemical databases, has attracting great interest and expectation in a wide range of molecular sciences. In this study, we investigate various DNN settings in order to reach a high‐level of predictive performance comparable to the champion team of the competition, even with a general purpose ANN platform, and introduce the Meister setting for constructing a good QSAR/DNNs model. Here, we have used the most commonly available DNN model and constructed many QSAR/DNN models trained with various DNN settings by using the 15 datasets employed in the competition. As a result, it was confirmed that we can constructed the QSAR/DNN model that shows the same level of R2 performance as the champion team. The difference from the DNN setting recommended by the champion team was to reduce the mini‐batch size. We have also explained that the R2 performance of each target depends on the molecular activity type, which is related to the complexity of biological mechanisms and chemical processes observed in molecular activity measurements. John Wiley and Sons Inc. 2019-12-18 2020-01 /pmc/articles/PMC7050538/ /pubmed/31802634 http://dx.doi.org/10.1002/minf.201900154 Text en © 2019 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Full Papers
Kato, Yoshiki
Hamada, Shinji
Goto, Hitoshi
Validation Study of QSAR/DNN Models Using the Competition Datasets
title Validation Study of QSAR/DNN Models Using the Competition Datasets
title_full Validation Study of QSAR/DNN Models Using the Competition Datasets
title_fullStr Validation Study of QSAR/DNN Models Using the Competition Datasets
title_full_unstemmed Validation Study of QSAR/DNN Models Using the Competition Datasets
title_short Validation Study of QSAR/DNN Models Using the Competition Datasets
title_sort validation study of qsar/dnn models using the competition datasets
topic Full Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050538/
https://www.ncbi.nlm.nih.gov/pubmed/31802634
http://dx.doi.org/10.1002/minf.201900154
work_keys_str_mv AT katoyoshiki validationstudyofqsardnnmodelsusingthecompetitiondatasets
AT hamadashinji validationstudyofqsardnnmodelsusingthecompetitiondatasets
AT gotohitoshi validationstudyofqsardnnmodelsusingthecompetitiondatasets