Cargando…

Efficient discovery of responses of proteins to compounds using active learning

BACKGROUND: Drug discovery and development has been aided by high throughput screening methods that detect compound effects on a single target. However, when using focused initial screening, undesirable secondary effects are often detected late in the development process after significant investment...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kangas, Joshua D, Naik, Armaghan W, Murphy, Robert F
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2014
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4030446/ https://www.ncbi.nlm.nih.gov/pubmed/24884564 http://dx.doi.org/10.1186/1471-2105-15-143

_version_	1782317387077386240
author	Kangas, Joshua D Naik, Armaghan W Murphy, Robert F
author_facet	Kangas, Joshua D Naik, Armaghan W Murphy, Robert F
author_sort	Kangas, Joshua D
collection	PubMed
description	BACKGROUND: Drug discovery and development has been aided by high throughput screening methods that detect compound effects on a single target. However, when using focused initial screening, undesirable secondary effects are often detected late in the development process after significant investment has been made. An alternative approach would be to screen against undesired effects early in the process, but the number of possible secondary targets makes this prohibitively expensive. RESULTS: This paper describes methods for making this global approach practical by constructing predictive models for many target responses to many compounds and using them to guide experimentation. We demonstrate for the first time that by jointly modeling targets and compounds using descriptive features and using active machine learning methods, accurate models can be built by doing only a small fraction of possible experiments. The methods were evaluated by computational experiments using a dataset of 177 assays and 20,000 compounds constructed from the PubChem database. CONCLUSIONS: An average of nearly 60% of all hits in the dataset were found after exploring only 3% of the experimental space which suggests that active learning can be used to enable more complete characterization of compound effects than otherwise affordable. The methods described are also likely to find widespread application outside drug discovery, such as for characterizing the effects of a large number of compounds or inhibitory RNAs on a large number of cell or tissue phenotypes.
format	Online Article Text
id	pubmed-4030446
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-40304462014-06-06 Efficient discovery of responses of proteins to compounds using active learning Kangas, Joshua D Naik, Armaghan W Murphy, Robert F BMC Bioinformatics Research Article BACKGROUND: Drug discovery and development has been aided by high throughput screening methods that detect compound effects on a single target. However, when using focused initial screening, undesirable secondary effects are often detected late in the development process after significant investment has been made. An alternative approach would be to screen against undesired effects early in the process, but the number of possible secondary targets makes this prohibitively expensive. RESULTS: This paper describes methods for making this global approach practical by constructing predictive models for many target responses to many compounds and using them to guide experimentation. We demonstrate for the first time that by jointly modeling targets and compounds using descriptive features and using active machine learning methods, accurate models can be built by doing only a small fraction of possible experiments. The methods were evaluated by computational experiments using a dataset of 177 assays and 20,000 compounds constructed from the PubChem database. CONCLUSIONS: An average of nearly 60% of all hits in the dataset were found after exploring only 3% of the experimental space which suggests that active learning can be used to enable more complete characterization of compound effects than otherwise affordable. The methods described are also likely to find widespread application outside drug discovery, such as for characterizing the effects of a large number of compounds or inhibitory RNAs on a large number of cell or tissue phenotypes. BioMed Central 2014-05-16 /pmc/articles/PMC4030446/ /pubmed/24884564 http://dx.doi.org/10.1186/1471-2105-15-143 Text en Copyright © 2014 Kangas et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Kangas, Joshua D Naik, Armaghan W Murphy, Robert F Efficient discovery of responses of proteins to compounds using active learning
title	Efficient discovery of responses of proteins to compounds using active learning
title_full	Efficient discovery of responses of proteins to compounds using active learning
title_fullStr	Efficient discovery of responses of proteins to compounds using active learning
title_full_unstemmed	Efficient discovery of responses of proteins to compounds using active learning
title_short	Efficient discovery of responses of proteins to compounds using active learning
title_sort	efficient discovery of responses of proteins to compounds using active learning
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4030446/ https://www.ncbi.nlm.nih.gov/pubmed/24884564 http://dx.doi.org/10.1186/1471-2105-15-143
work_keys_str_mv	AT kangasjoshuad efficientdiscoveryofresponsesofproteinstocompoundsusingactivelearning AT naikarmaghanw efficientdiscoveryofresponsesofproteinstocompoundsusingactivelearning AT murphyrobertf efficientdiscoveryofresponsesofproteinstocompoundsusingactivelearning

Efficient discovery of responses of proteins to compounds using active learning

Ejemplares similares