Cargando…

Exploration of Dark Chemical Genomics Space via Portal Learning: Applied to Targeting the Undruggable Genome and COVID-19 Anti-Infective Polypharmacology

Advances in biomedicine are largely fueled by exploring uncharted territories of human biology. Machine learning can both enable and accelerate discovery, but faces a fundamental hurdle when applied to unseen data with distributions that differ from previously observed ones—a common dilemma in scien...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, Tian, Xie, Li, Chen, Muge, Liu, Yang, He, Di, Zhang, Shuo, Mura, Cameron, Bourne, Philip E., Xie, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Journal Experts 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8647653/
https://www.ncbi.nlm.nih.gov/pubmed/34873596
http://dx.doi.org/10.21203/rs.3.rs-1109318/v1
_version_ 1784610645283438592
author Cai, Tian
Xie, Li
Chen, Muge
Liu, Yang
He, Di
Zhang, Shuo
Mura, Cameron
Bourne, Philip E.
Xie, Lei
author_facet Cai, Tian
Xie, Li
Chen, Muge
Liu, Yang
He, Di
Zhang, Shuo
Mura, Cameron
Bourne, Philip E.
Xie, Lei
author_sort Cai, Tian
collection PubMed
description Advances in biomedicine are largely fueled by exploring uncharted territories of human biology. Machine learning can both enable and accelerate discovery, but faces a fundamental hurdle when applied to unseen data with distributions that differ from previously observed ones—a common dilemma in scientific inquiry. We have developed a new deep learning framework, called Portal Learning, to explore dark chemical and biological space. Three key, novel components of our approach include: (i) end-to-end, step-wise transfer learning, in recognition of biology’s sequence-structure-function paradigm, (ii) out-of-cluster meta-learning, and (iii) stress model selection. Portal Learning provides a practical solution to the out-of-distribution (OOD) problem in statistical machine learning. Here, we have implemented Portal Learning to predict chemical-protein interactions on a genome-wide scale. Systematic studies demonstrate that Portal Learning can effectively assign ligands to unexplored gene families (unknown functions), versus existing state-of-the-art methods. Compared with AlphaFold2-based protein-ligand docking, Portal Learning significantly improved the performance by 79% in PR-AUC and 27% in ROC-AUC, respectively. The superior performance of Portal Learning allowed us to target previously “undruggable” proteins and design novel polypharmacological agents for disrupting interactions between SARS-CoV-2 and human proteins. Portal Learning is general-purpose and can be further applied to other areas of scientific inquiry.
format Online
Article
Text
id pubmed-8647653
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Journal Experts
record_format MEDLINE/PubMed
spelling pubmed-86476532021-12-07 Exploration of Dark Chemical Genomics Space via Portal Learning: Applied to Targeting the Undruggable Genome and COVID-19 Anti-Infective Polypharmacology Cai, Tian Xie, Li Chen, Muge Liu, Yang He, Di Zhang, Shuo Mura, Cameron Bourne, Philip E. Xie, Lei Res Sq Article Advances in biomedicine are largely fueled by exploring uncharted territories of human biology. Machine learning can both enable and accelerate discovery, but faces a fundamental hurdle when applied to unseen data with distributions that differ from previously observed ones—a common dilemma in scientific inquiry. We have developed a new deep learning framework, called Portal Learning, to explore dark chemical and biological space. Three key, novel components of our approach include: (i) end-to-end, step-wise transfer learning, in recognition of biology’s sequence-structure-function paradigm, (ii) out-of-cluster meta-learning, and (iii) stress model selection. Portal Learning provides a practical solution to the out-of-distribution (OOD) problem in statistical machine learning. Here, we have implemented Portal Learning to predict chemical-protein interactions on a genome-wide scale. Systematic studies demonstrate that Portal Learning can effectively assign ligands to unexplored gene families (unknown functions), versus existing state-of-the-art methods. Compared with AlphaFold2-based protein-ligand docking, Portal Learning significantly improved the performance by 79% in PR-AUC and 27% in ROC-AUC, respectively. The superior performance of Portal Learning allowed us to target previously “undruggable” proteins and design novel polypharmacological agents for disrupting interactions between SARS-CoV-2 and human proteins. Portal Learning is general-purpose and can be further applied to other areas of scientific inquiry. American Journal Experts 2021-12-01 /pmc/articles/PMC8647653/ /pubmed/34873596 http://dx.doi.org/10.21203/rs.3.rs-1109318/v1 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. https://creativecommons.org/licenses/by/4.0/License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License (https://creativecommons.org/licenses/by/4.0/)
spellingShingle Article
Cai, Tian
Xie, Li
Chen, Muge
Liu, Yang
He, Di
Zhang, Shuo
Mura, Cameron
Bourne, Philip E.
Xie, Lei
Exploration of Dark Chemical Genomics Space via Portal Learning: Applied to Targeting the Undruggable Genome and COVID-19 Anti-Infective Polypharmacology
title Exploration of Dark Chemical Genomics Space via Portal Learning: Applied to Targeting the Undruggable Genome and COVID-19 Anti-Infective Polypharmacology
title_full Exploration of Dark Chemical Genomics Space via Portal Learning: Applied to Targeting the Undruggable Genome and COVID-19 Anti-Infective Polypharmacology
title_fullStr Exploration of Dark Chemical Genomics Space via Portal Learning: Applied to Targeting the Undruggable Genome and COVID-19 Anti-Infective Polypharmacology
title_full_unstemmed Exploration of Dark Chemical Genomics Space via Portal Learning: Applied to Targeting the Undruggable Genome and COVID-19 Anti-Infective Polypharmacology
title_short Exploration of Dark Chemical Genomics Space via Portal Learning: Applied to Targeting the Undruggable Genome and COVID-19 Anti-Infective Polypharmacology
title_sort exploration of dark chemical genomics space via portal learning: applied to targeting the undruggable genome and covid-19 anti-infective polypharmacology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8647653/
https://www.ncbi.nlm.nih.gov/pubmed/34873596
http://dx.doi.org/10.21203/rs.3.rs-1109318/v1
work_keys_str_mv AT caitian explorationofdarkchemicalgenomicsspaceviaportallearningappliedtotargetingtheundruggablegenomeandcovid19antiinfectivepolypharmacology
AT xieli explorationofdarkchemicalgenomicsspaceviaportallearningappliedtotargetingtheundruggablegenomeandcovid19antiinfectivepolypharmacology
AT chenmuge explorationofdarkchemicalgenomicsspaceviaportallearningappliedtotargetingtheundruggablegenomeandcovid19antiinfectivepolypharmacology
AT liuyang explorationofdarkchemicalgenomicsspaceviaportallearningappliedtotargetingtheundruggablegenomeandcovid19antiinfectivepolypharmacology
AT hedi explorationofdarkchemicalgenomicsspaceviaportallearningappliedtotargetingtheundruggablegenomeandcovid19antiinfectivepolypharmacology
AT zhangshuo explorationofdarkchemicalgenomicsspaceviaportallearningappliedtotargetingtheundruggablegenomeandcovid19antiinfectivepolypharmacology
AT muracameron explorationofdarkchemicalgenomicsspaceviaportallearningappliedtotargetingtheundruggablegenomeandcovid19antiinfectivepolypharmacology
AT bournephilipe explorationofdarkchemicalgenomicsspaceviaportallearningappliedtotargetingtheundruggablegenomeandcovid19antiinfectivepolypharmacology
AT xielei explorationofdarkchemicalgenomicsspaceviaportallearningappliedtotargetingtheundruggablegenomeandcovid19antiinfectivepolypharmacology