Cargando…

Predicting Positive p53 Cancer Rescue Regions Using Most Informative Positive (MIP) Active Learning

Many protein engineering problems involve finding mutations that produce proteins with a particular function. Computational active learning is an attractive approach to discover desired biological activities. Traditional active learning techniques have been optimized to iteratively improve classifie...

Descripción completa

Detalles Bibliográficos
Autores principales: Danziger, Samuel A., Baronio, Roberta, Ho, Lydia, Hall, Linda, Salmon, Kirsty, Hatfield, G. Wesley, Kaiser, Peter, Lathrop, Richard H.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2742196/
https://www.ncbi.nlm.nih.gov/pubmed/19756158
http://dx.doi.org/10.1371/journal.pcbi.1000498
_version_ 1782171817243312128
author Danziger, Samuel A.
Baronio, Roberta
Ho, Lydia
Hall, Linda
Salmon, Kirsty
Hatfield, G. Wesley
Kaiser, Peter
Lathrop, Richard H.
author_facet Danziger, Samuel A.
Baronio, Roberta
Ho, Lydia
Hall, Linda
Salmon, Kirsty
Hatfield, G. Wesley
Kaiser, Peter
Lathrop, Richard H.
author_sort Danziger, Samuel A.
collection PubMed
description Many protein engineering problems involve finding mutations that produce proteins with a particular function. Computational active learning is an attractive approach to discover desired biological activities. Traditional active learning techniques have been optimized to iteratively improve classifier accuracy, not to quickly discover biologically significant results. We report here a novel active learning technique, Most Informative Positive (MIP), which is tailored to biological problems because it seeks novel and informative positive results. MIP active learning differs from traditional active learning methods in two ways: (1) it preferentially seeks Positive (functionally active) examples; and (2) it may be effectively extended to select gene regions suitable for high throughput combinatorial mutagenesis. We applied MIP to discover mutations in the tumor suppressor protein p53 that reactivate mutated p53 found in human cancers. This is an important biomedical goal because p53 mutants have been implicated in half of all human cancers, and restoring active p53 in tumors leads to tumor regression. MIP found Positive (cancer rescue) p53 mutants in silico using 33% fewer experiments than traditional non-MIP active learning, with only a minor decrease in classifier accuracy. Applying MIP to in vivo experimentation yielded immediate Positive results. Ten different p53 mutations found in human cancers were paired in silico with all possible single amino acid rescue mutations, from which MIP was used to select a Positive Region predicted to be enriched for p53 cancer rescue mutants. In vivo assays showed that the predicted Positive Region: (1) had significantly more (p<0.01) new strong cancer rescue mutants than control regions (Negative, and non-MIP active learning); (2) had slightly more new strong cancer rescue mutants than an Expert region selected for purely biological considerations; and (3) rescued for the first time the previously unrescuable p53 cancer mutant P152L.
format Text
id pubmed-2742196
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-27421962009-09-15 Predicting Positive p53 Cancer Rescue Regions Using Most Informative Positive (MIP) Active Learning Danziger, Samuel A. Baronio, Roberta Ho, Lydia Hall, Linda Salmon, Kirsty Hatfield, G. Wesley Kaiser, Peter Lathrop, Richard H. PLoS Comput Biol Research Article Many protein engineering problems involve finding mutations that produce proteins with a particular function. Computational active learning is an attractive approach to discover desired biological activities. Traditional active learning techniques have been optimized to iteratively improve classifier accuracy, not to quickly discover biologically significant results. We report here a novel active learning technique, Most Informative Positive (MIP), which is tailored to biological problems because it seeks novel and informative positive results. MIP active learning differs from traditional active learning methods in two ways: (1) it preferentially seeks Positive (functionally active) examples; and (2) it may be effectively extended to select gene regions suitable for high throughput combinatorial mutagenesis. We applied MIP to discover mutations in the tumor suppressor protein p53 that reactivate mutated p53 found in human cancers. This is an important biomedical goal because p53 mutants have been implicated in half of all human cancers, and restoring active p53 in tumors leads to tumor regression. MIP found Positive (cancer rescue) p53 mutants in silico using 33% fewer experiments than traditional non-MIP active learning, with only a minor decrease in classifier accuracy. Applying MIP to in vivo experimentation yielded immediate Positive results. Ten different p53 mutations found in human cancers were paired in silico with all possible single amino acid rescue mutations, from which MIP was used to select a Positive Region predicted to be enriched for p53 cancer rescue mutants. In vivo assays showed that the predicted Positive Region: (1) had significantly more (p<0.01) new strong cancer rescue mutants than control regions (Negative, and non-MIP active learning); (2) had slightly more new strong cancer rescue mutants than an Expert region selected for purely biological considerations; and (3) rescued for the first time the previously unrescuable p53 cancer mutant P152L. Public Library of Science 2009-09-04 /pmc/articles/PMC2742196/ /pubmed/19756158 http://dx.doi.org/10.1371/journal.pcbi.1000498 Text en Danziger et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Danziger, Samuel A.
Baronio, Roberta
Ho, Lydia
Hall, Linda
Salmon, Kirsty
Hatfield, G. Wesley
Kaiser, Peter
Lathrop, Richard H.
Predicting Positive p53 Cancer Rescue Regions Using Most Informative Positive (MIP) Active Learning
title Predicting Positive p53 Cancer Rescue Regions Using Most Informative Positive (MIP) Active Learning
title_full Predicting Positive p53 Cancer Rescue Regions Using Most Informative Positive (MIP) Active Learning
title_fullStr Predicting Positive p53 Cancer Rescue Regions Using Most Informative Positive (MIP) Active Learning
title_full_unstemmed Predicting Positive p53 Cancer Rescue Regions Using Most Informative Positive (MIP) Active Learning
title_short Predicting Positive p53 Cancer Rescue Regions Using Most Informative Positive (MIP) Active Learning
title_sort predicting positive p53 cancer rescue regions using most informative positive (mip) active learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2742196/
https://www.ncbi.nlm.nih.gov/pubmed/19756158
http://dx.doi.org/10.1371/journal.pcbi.1000498
work_keys_str_mv AT danzigersamuela predictingpositivep53cancerrescueregionsusingmostinformativepositivemipactivelearning
AT baronioroberta predictingpositivep53cancerrescueregionsusingmostinformativepositivemipactivelearning
AT holydia predictingpositivep53cancerrescueregionsusingmostinformativepositivemipactivelearning
AT halllinda predictingpositivep53cancerrescueregionsusingmostinformativepositivemipactivelearning
AT salmonkirsty predictingpositivep53cancerrescueregionsusingmostinformativepositivemipactivelearning
AT hatfieldgwesley predictingpositivep53cancerrescueregionsusingmostinformativepositivemipactivelearning
AT kaiserpeter predictingpositivep53cancerrescueregionsusingmostinformativepositivemipactivelearning
AT lathroprichardh predictingpositivep53cancerrescueregionsusingmostinformativepositivemipactivelearning