Cargando…
Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation
Targeted protein degradation (TPD) has rapidly emerged as a therapeutic modality to eliminate previously undruggable proteins by repurposing the cell’s endogenous protein degradation machinery. However, the susceptibility of proteins for targeting by TPD approaches, termed “degradability”, is largel...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025769/ https://www.ncbi.nlm.nih.gov/pubmed/36494034 http://dx.doi.org/10.1016/j.gpb.2022.11.008 |
_version_ | 1784909408383270912 |
---|---|
author | Zhang, Wubing Roy Burman, Shourya S. Chen, Jiaye Donovan, Katherine A. Cao, Yang Shu, Chelsea Zhang, Boning Zeng, Zexian Gu, Shengqing Zhang, Yi Li, Dian Fischer, Eric S. Tokheim, Collin Shirley Liu, X. |
author_facet | Zhang, Wubing Roy Burman, Shourya S. Chen, Jiaye Donovan, Katherine A. Cao, Yang Shu, Chelsea Zhang, Boning Zeng, Zexian Gu, Shengqing Zhang, Yi Li, Dian Fischer, Eric S. Tokheim, Collin Shirley Liu, X. |
author_sort | Zhang, Wubing |
collection | PubMed |
description | Targeted protein degradation (TPD) has rapidly emerged as a therapeutic modality to eliminate previously undruggable proteins by repurposing the cell’s endogenous protein degradation machinery. However, the susceptibility of proteins for targeting by TPD approaches, termed “degradability”, is largely unknown. Here, we developed a machine learning model, model-free analysis of protein degradability (MAPD), to predict degradability from features intrinsic to protein targets. MAPD shows accurate performance in predicting kinases that are degradable by TPD compounds [with an area under the precision–recall curve (AUPRC) of 0.759 and an area under the receiver operating characteristic curve (AUROC) of 0.775] and is likely generalizable to independent non-kinase proteins. We found five features with statistical significance to achieve optimal prediction, with ubiquitination potential being the most predictive. By structural modeling, we found that E2-accessible ubiquitination sites, but not lysine residues in general, are particularly associated with kinase degradability. Finally, we extended MAPD predictions to the entire proteome to find 964 disease-causing proteins (including proteins encoded by 278 cancer genes) that may be tractable to TPD drug development. |
format | Online Article Text |
id | pubmed-10025769 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-100257692023-03-21 Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation Zhang, Wubing Roy Burman, Shourya S. Chen, Jiaye Donovan, Katherine A. Cao, Yang Shu, Chelsea Zhang, Boning Zeng, Zexian Gu, Shengqing Zhang, Yi Li, Dian Fischer, Eric S. Tokheim, Collin Shirley Liu, X. Genomics Proteomics Bioinformatics Original Research Targeted protein degradation (TPD) has rapidly emerged as a therapeutic modality to eliminate previously undruggable proteins by repurposing the cell’s endogenous protein degradation machinery. However, the susceptibility of proteins for targeting by TPD approaches, termed “degradability”, is largely unknown. Here, we developed a machine learning model, model-free analysis of protein degradability (MAPD), to predict degradability from features intrinsic to protein targets. MAPD shows accurate performance in predicting kinases that are degradable by TPD compounds [with an area under the precision–recall curve (AUPRC) of 0.759 and an area under the receiver operating characteristic curve (AUROC) of 0.775] and is likely generalizable to independent non-kinase proteins. We found five features with statistical significance to achieve optimal prediction, with ubiquitination potential being the most predictive. By structural modeling, we found that E2-accessible ubiquitination sites, but not lysine residues in general, are particularly associated with kinase degradability. Finally, we extended MAPD predictions to the entire proteome to find 964 disease-causing proteins (including proteins encoded by 278 cancer genes) that may be tractable to TPD drug development. Elsevier 2022-10 2022-12-06 /pmc/articles/PMC10025769/ /pubmed/36494034 http://dx.doi.org/10.1016/j.gpb.2022.11.008 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Original Research Zhang, Wubing Roy Burman, Shourya S. Chen, Jiaye Donovan, Katherine A. Cao, Yang Shu, Chelsea Zhang, Boning Zeng, Zexian Gu, Shengqing Zhang, Yi Li, Dian Fischer, Eric S. Tokheim, Collin Shirley Liu, X. Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation |
title | Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation |
title_full | Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation |
title_fullStr | Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation |
title_full_unstemmed | Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation |
title_short | Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation |
title_sort | machine learning modeling of protein-intrinsic features predicts tractability of targeted protein degradation |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025769/ https://www.ncbi.nlm.nih.gov/pubmed/36494034 http://dx.doi.org/10.1016/j.gpb.2022.11.008 |
work_keys_str_mv | AT zhangwubing machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT royburmanshouryas machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT chenjiaye machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT donovankatherinea machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT caoyang machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT shuchelsea machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT zhangboning machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT zengzexian machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT gushengqing machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT zhangyi machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT lidian machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT fischererics machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT tokheimcollin machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation AT shirleyliux machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation |