Cargando…

Active machine learning-driven experimentation to determine compound effects on protein patterns

High throughput screening determines the effects of many conditions on a given biological target. Currently, to estimate the effects of those conditions on other targets requires either strong modeling assumptions (e.g. similarities among targets) or separate screens. Ideally, data-driven experiment...

Descripción completa

Detalles Bibliográficos
Autores principales: Naik, Armaghan W, Kangas, Joshua D, Sullivan, Devin P, Murphy, Robert F
Formato: Online Artículo Texto
Lenguaje:English
Publicado: eLife Sciences Publications, Ltd 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4798950/
https://www.ncbi.nlm.nih.gov/pubmed/26840049
http://dx.doi.org/10.7554/eLife.10047
_version_ 1782422242552971264
author Naik, Armaghan W
Kangas, Joshua D
Sullivan, Devin P
Murphy, Robert F
author_facet Naik, Armaghan W
Kangas, Joshua D
Sullivan, Devin P
Murphy, Robert F
author_sort Naik, Armaghan W
collection PubMed
description High throughput screening determines the effects of many conditions on a given biological target. Currently, to estimate the effects of those conditions on other targets requires either strong modeling assumptions (e.g. similarities among targets) or separate screens. Ideally, data-driven experimentation could be used to learn accurate models for many conditions and targets without doing all possible experiments. We have previously described an active machine learning algorithm that can iteratively choose small sets of experiments to learn models of multiple effects. We now show that, with no prior knowledge and with liquid handling robotics and automated microscopy under its control, this learner accurately learned the effects of 48 chemical compounds on the subcellular localization of 48 proteins while performing only 29% of all possible experiments. The results represent the first practical demonstration of the utility of active learning-driven biological experimentation in which the set of possible phenotypes is unknown in advance. DOI: http://dx.doi.org/10.7554/eLife.10047.001
format Online
Article
Text
id pubmed-4798950
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher eLife Sciences Publications, Ltd
record_format MEDLINE/PubMed
spelling pubmed-47989502016-03-21 Active machine learning-driven experimentation to determine compound effects on protein patterns Naik, Armaghan W Kangas, Joshua D Sullivan, Devin P Murphy, Robert F eLife Cell Biology High throughput screening determines the effects of many conditions on a given biological target. Currently, to estimate the effects of those conditions on other targets requires either strong modeling assumptions (e.g. similarities among targets) or separate screens. Ideally, data-driven experimentation could be used to learn accurate models for many conditions and targets without doing all possible experiments. We have previously described an active machine learning algorithm that can iteratively choose small sets of experiments to learn models of multiple effects. We now show that, with no prior knowledge and with liquid handling robotics and automated microscopy under its control, this learner accurately learned the effects of 48 chemical compounds on the subcellular localization of 48 proteins while performing only 29% of all possible experiments. The results represent the first practical demonstration of the utility of active learning-driven biological experimentation in which the set of possible phenotypes is unknown in advance. DOI: http://dx.doi.org/10.7554/eLife.10047.001 eLife Sciences Publications, Ltd 2016-02-03 /pmc/articles/PMC4798950/ /pubmed/26840049 http://dx.doi.org/10.7554/eLife.10047 Text en © 2016, Naik et al http://creativecommons.org/licenses/by/4.0/ This article is distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use and redistribution provided that the original author and source are credited.
spellingShingle Cell Biology
Naik, Armaghan W
Kangas, Joshua D
Sullivan, Devin P
Murphy, Robert F
Active machine learning-driven experimentation to determine compound effects on protein patterns
title Active machine learning-driven experimentation to determine compound effects on protein patterns
title_full Active machine learning-driven experimentation to determine compound effects on protein patterns
title_fullStr Active machine learning-driven experimentation to determine compound effects on protein patterns
title_full_unstemmed Active machine learning-driven experimentation to determine compound effects on protein patterns
title_short Active machine learning-driven experimentation to determine compound effects on protein patterns
title_sort active machine learning-driven experimentation to determine compound effects on protein patterns
topic Cell Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4798950/
https://www.ncbi.nlm.nih.gov/pubmed/26840049
http://dx.doi.org/10.7554/eLife.10047
work_keys_str_mv AT naikarmaghanw activemachinelearningdrivenexperimentationtodeterminecompoundeffectsonproteinpatterns
AT kangasjoshuad activemachinelearningdrivenexperimentationtodeterminecompoundeffectsonproteinpatterns
AT sullivandevinp activemachinelearningdrivenexperimentationtodeterminecompoundeffectsonproteinpatterns
AT murphyrobertf activemachinelearningdrivenexperimentationtodeterminecompoundeffectsonproteinpatterns