Cargando…

Brainstorming: weighted voting prediction of inhibitors for protein targets

The “Brainstorming” approach presented in this paper is a weighted voting method that can improve the quality of predictions generated by several machine learning (ML) methods. First, an ensemble of heterogeneous ML algorithms is trained on available experimental data, then all solutions are gathere...

Descripción completa

Detalles Bibliográficos
Autor principal:	Plewczynski, Dariusz
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer-Verlag 2010
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3168748/ https://www.ncbi.nlm.nih.gov/pubmed/20857153 http://dx.doi.org/10.1007/s00894-010-0854-x

_version_	1782211418126286848
author	Plewczynski, Dariusz
author_facet	Plewczynski, Dariusz
author_sort	Plewczynski, Dariusz
collection	PubMed
description	The “Brainstorming” approach presented in this paper is a weighted voting method that can improve the quality of predictions generated by several machine learning (ML) methods. First, an ensemble of heterogeneous ML algorithms is trained on available experimental data, then all solutions are gathered and a consensus is built between them. The final prediction is performed using a voting procedure, whereby the vote of each method is weighted according to a quality coefficient calculated using multivariable linear regression (MLR). The MLR optimization procedure is very fast, therefore no additional computational cost is introduced by using this jury approach. Here, brainstorming is applied to selecting actives from large collections of compounds relating to five diverse biological targets of medicinal interest, namely HIV-reverse transcriptase, cyclooxygenase-2, dihydrofolate reductase, estrogen receptor, and thrombin. The MDL Drug Data Report (MDDR) database was used for selecting known inhibitors for these protein targets, and experimental data was then used to train a set of machine learning methods. The benchmark dataset (available at http://bio.icm.edu.pl/∼darman/chemoinfo/benchmark.tar.gz) can be used for further testing of various clustering and machine learning methods when predicting the biological activity of compounds. Depending on the protein target, the overall recall value is raised by at least 20% in comparison to any single machine learning method (including ensemble methods like random forest) and unweighted simple majority voting procedures.
format	Online Article Text
id	pubmed-3168748
institution	National Center for Biotechnology Information
language	English
publishDate	2010
publisher	Springer-Verlag
record_format	MEDLINE/PubMed
spelling	pubmed-31687482011-09-26 Brainstorming: weighted voting prediction of inhibitors for protein targets Plewczynski, Dariusz J Mol Model Original Paper The “Brainstorming” approach presented in this paper is a weighted voting method that can improve the quality of predictions generated by several machine learning (ML) methods. First, an ensemble of heterogeneous ML algorithms is trained on available experimental data, then all solutions are gathered and a consensus is built between them. The final prediction is performed using a voting procedure, whereby the vote of each method is weighted according to a quality coefficient calculated using multivariable linear regression (MLR). The MLR optimization procedure is very fast, therefore no additional computational cost is introduced by using this jury approach. Here, brainstorming is applied to selecting actives from large collections of compounds relating to five diverse biological targets of medicinal interest, namely HIV-reverse transcriptase, cyclooxygenase-2, dihydrofolate reductase, estrogen receptor, and thrombin. The MDL Drug Data Report (MDDR) database was used for selecting known inhibitors for these protein targets, and experimental data was then used to train a set of machine learning methods. The benchmark dataset (available at http://bio.icm.edu.pl/∼darman/chemoinfo/benchmark.tar.gz) can be used for further testing of various clustering and machine learning methods when predicting the biological activity of compounds. Depending on the protein target, the overall recall value is raised by at least 20% in comparison to any single machine learning method (including ensemble methods like random forest) and unweighted simple majority voting procedures. Springer-Verlag 2010-09-21 2011 /pmc/articles/PMC3168748/ /pubmed/20857153 http://dx.doi.org/10.1007/s00894-010-0854-x Text en © The Author(s) 2010 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
spellingShingle	Original Paper Plewczynski, Dariusz Brainstorming: weighted voting prediction of inhibitors for protein targets
title	Brainstorming: weighted voting prediction of inhibitors for protein targets
title_full	Brainstorming: weighted voting prediction of inhibitors for protein targets
title_fullStr	Brainstorming: weighted voting prediction of inhibitors for protein targets
title_full_unstemmed	Brainstorming: weighted voting prediction of inhibitors for protein targets
title_short	Brainstorming: weighted voting prediction of inhibitors for protein targets
title_sort	brainstorming: weighted voting prediction of inhibitors for protein targets
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3168748/ https://www.ncbi.nlm.nih.gov/pubmed/20857153 http://dx.doi.org/10.1007/s00894-010-0854-x
work_keys_str_mv	AT plewczynskidariusz brainstormingweightedvotingpredictionofinhibitorsforproteintargets

Brainstorming: weighted voting prediction of inhibitors for protein targets

Ejemplares similares