Cargando…

How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques

A great variety of computational approaches support drug design processes, helping in selection of new potentially active compounds, and optimization of their physicochemical and ADMET properties. Machine learning is a group of methods that are able to evaluate in relatively short time enormous amou...

Descripción completa

Detalles Bibliográficos
Autores principales: Sieradzki, Igor, Leśniak, Damian, Podlewska, Sabina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7144469/
https://www.ncbi.nlm.nih.gov/pubmed/32210186
http://dx.doi.org/10.3390/molecules25061452
_version_ 1783519838075879424
author Sieradzki, Igor
Leśniak, Damian
Podlewska, Sabina
author_facet Sieradzki, Igor
Leśniak, Damian
Podlewska, Sabina
author_sort Sieradzki, Igor
collection PubMed
description A great variety of computational approaches support drug design processes, helping in selection of new potentially active compounds, and optimization of their physicochemical and ADMET properties. Machine learning is a group of methods that are able to evaluate in relatively short time enormous amounts of data. However, the quality of machine-learning-based prediction depends on the data supplied for model training. In this study, we used deep neural networks for the task of compound activity prediction and developed dropout-based approaches for estimating prediction uncertainty. Several types of analyses were performed: the relationships between the prediction error, similarity to the training set, prediction uncertainty, number and standard deviation of activity values were examined. It was tested whether incorporation of information about prediction uncertainty influences compounds ranking based on predicted activity and prediction uncertainty was used to search for the potential errors in the ChEMBL database. The obtained outcome indicates that incorporation of information about uncertainty of compound activity prediction can be of great help during virtual screening experiments.
format Online
Article
Text
id pubmed-7144469
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-71444692020-04-15 How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques Sieradzki, Igor Leśniak, Damian Podlewska, Sabina Molecules Article A great variety of computational approaches support drug design processes, helping in selection of new potentially active compounds, and optimization of their physicochemical and ADMET properties. Machine learning is a group of methods that are able to evaluate in relatively short time enormous amounts of data. However, the quality of machine-learning-based prediction depends on the data supplied for model training. In this study, we used deep neural networks for the task of compound activity prediction and developed dropout-based approaches for estimating prediction uncertainty. Several types of analyses were performed: the relationships between the prediction error, similarity to the training set, prediction uncertainty, number and standard deviation of activity values were examined. It was tested whether incorporation of information about prediction uncertainty influences compounds ranking based on predicted activity and prediction uncertainty was used to search for the potential errors in the ChEMBL database. The obtained outcome indicates that incorporation of information about uncertainty of compound activity prediction can be of great help during virtual screening experiments. MDPI 2020-03-23 /pmc/articles/PMC7144469/ /pubmed/32210186 http://dx.doi.org/10.3390/molecules25061452 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sieradzki, Igor
Leśniak, Damian
Podlewska, Sabina
How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
title How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
title_full How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
title_fullStr How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
title_full_unstemmed How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
title_short How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
title_sort how sure can we be about ml methods-based evaluation of compound activity: incorporation of information about prediction uncertainty using deep learning techniques
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7144469/
https://www.ncbi.nlm.nih.gov/pubmed/32210186
http://dx.doi.org/10.3390/molecules25061452
work_keys_str_mv AT sieradzkiigor howsurecanwebeaboutmlmethodsbasedevaluationofcompoundactivityincorporationofinformationaboutpredictionuncertaintyusingdeeplearningtechniques
AT lesniakdamian howsurecanwebeaboutmlmethodsbasedevaluationofcompoundactivityincorporationofinformationaboutpredictionuncertaintyusingdeeplearningtechniques
AT podlewskasabina howsurecanwebeaboutmlmethodsbasedevaluationofcompoundactivityincorporationofinformationaboutpredictionuncertaintyusingdeeplearningtechniques