Cargando…

Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation

Targeted protein degradation (TPD) has rapidly emerged as a therapeutic modality to eliminate previously undruggable proteins by repurposing the cell’s endogenous protein degradation machinery. However, the susceptibility of proteins for targeting by TPD approaches, termed “degradability”, is largel...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Wubing, Roy Burman, Shourya S., Chen, Jiaye, Donovan, Katherine A., Cao, Yang, Shu, Chelsea, Zhang, Boning, Zeng, Zexian, Gu, Shengqing, Zhang, Yi, Li, Dian, Fischer, Eric S., Tokheim, Collin, Shirley Liu, X.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025769/
https://www.ncbi.nlm.nih.gov/pubmed/36494034
http://dx.doi.org/10.1016/j.gpb.2022.11.008
_version_ 1784909408383270912
author Zhang, Wubing
Roy Burman, Shourya S.
Chen, Jiaye
Donovan, Katherine A.
Cao, Yang
Shu, Chelsea
Zhang, Boning
Zeng, Zexian
Gu, Shengqing
Zhang, Yi
Li, Dian
Fischer, Eric S.
Tokheim, Collin
Shirley Liu, X.
author_facet Zhang, Wubing
Roy Burman, Shourya S.
Chen, Jiaye
Donovan, Katherine A.
Cao, Yang
Shu, Chelsea
Zhang, Boning
Zeng, Zexian
Gu, Shengqing
Zhang, Yi
Li, Dian
Fischer, Eric S.
Tokheim, Collin
Shirley Liu, X.
author_sort Zhang, Wubing
collection PubMed
description Targeted protein degradation (TPD) has rapidly emerged as a therapeutic modality to eliminate previously undruggable proteins by repurposing the cell’s endogenous protein degradation machinery. However, the susceptibility of proteins for targeting by TPD approaches, termed “degradability”, is largely unknown. Here, we developed a machine learning model, model-free analysis of protein degradability (MAPD), to predict degradability from features intrinsic to protein targets. MAPD shows accurate performance in predicting kinases that are degradable by TPD compounds [with an area under the precision–recall curve (AUPRC) of 0.759 and an area under the receiver operating characteristic curve (AUROC) of 0.775] and is likely generalizable to independent non-kinase proteins. We found five features with statistical significance to achieve optimal prediction, with ubiquitination potential being the most predictive. By structural modeling, we found that E2-accessible ubiquitination sites, but not lysine residues in general, are particularly associated with kinase degradability. Finally, we extended MAPD predictions to the entire proteome to find 964 disease-causing proteins (including proteins encoded by 278 cancer genes) that may be tractable to TPD drug development.
format Online
Article
Text
id pubmed-10025769
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-100257692023-03-21 Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation Zhang, Wubing Roy Burman, Shourya S. Chen, Jiaye Donovan, Katherine A. Cao, Yang Shu, Chelsea Zhang, Boning Zeng, Zexian Gu, Shengqing Zhang, Yi Li, Dian Fischer, Eric S. Tokheim, Collin Shirley Liu, X. Genomics Proteomics Bioinformatics Original Research Targeted protein degradation (TPD) has rapidly emerged as a therapeutic modality to eliminate previously undruggable proteins by repurposing the cell’s endogenous protein degradation machinery. However, the susceptibility of proteins for targeting by TPD approaches, termed “degradability”, is largely unknown. Here, we developed a machine learning model, model-free analysis of protein degradability (MAPD), to predict degradability from features intrinsic to protein targets. MAPD shows accurate performance in predicting kinases that are degradable by TPD compounds [with an area under the precision–recall curve (AUPRC) of 0.759 and an area under the receiver operating characteristic curve (AUROC) of 0.775] and is likely generalizable to independent non-kinase proteins. We found five features with statistical significance to achieve optimal prediction, with ubiquitination potential being the most predictive. By structural modeling, we found that E2-accessible ubiquitination sites, but not lysine residues in general, are particularly associated with kinase degradability. Finally, we extended MAPD predictions to the entire proteome to find 964 disease-causing proteins (including proteins encoded by 278 cancer genes) that may be tractable to TPD drug development. Elsevier 2022-10 2022-12-06 /pmc/articles/PMC10025769/ /pubmed/36494034 http://dx.doi.org/10.1016/j.gpb.2022.11.008 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Research
Zhang, Wubing
Roy Burman, Shourya S.
Chen, Jiaye
Donovan, Katherine A.
Cao, Yang
Shu, Chelsea
Zhang, Boning
Zeng, Zexian
Gu, Shengqing
Zhang, Yi
Li, Dian
Fischer, Eric S.
Tokheim, Collin
Shirley Liu, X.
Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation
title Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation
title_full Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation
title_fullStr Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation
title_full_unstemmed Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation
title_short Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation
title_sort machine learning modeling of protein-intrinsic features predicts tractability of targeted protein degradation
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025769/
https://www.ncbi.nlm.nih.gov/pubmed/36494034
http://dx.doi.org/10.1016/j.gpb.2022.11.008
work_keys_str_mv AT zhangwubing machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT royburmanshouryas machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT chenjiaye machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT donovankatherinea machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT caoyang machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT shuchelsea machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT zhangboning machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT zengzexian machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT gushengqing machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT zhangyi machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT lidian machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT fischererics machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT tokheimcollin machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation
AT shirleyliux machinelearningmodelingofproteinintrinsicfeaturespredictstractabilityoftargetedproteindegradation