Cargando…
A deep learning framework for high-throughput mechanism-driven phenotype compound screening
Target-based high-throughput compound screening dominates conventional one-drug-one-gene drug discovery process. However, the readout from the chemical modulation of a single protein is poorly correlated with phenotypic response of organism, leading to high failure rate in drug development. Chemical...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7386506/ https://www.ncbi.nlm.nih.gov/pubmed/32743586 http://dx.doi.org/10.1101/2020.07.19.211235 |
_version_ | 1783563960611504128 |
---|---|
author | Pham, Thai-Hoang Qiu, Yue Zeng, Jucheng Xie, Lei Zhang, Ping |
author_facet | Pham, Thai-Hoang Qiu, Yue Zeng, Jucheng Xie, Lei Zhang, Ping |
author_sort | Pham, Thai-Hoang |
collection | PubMed |
description | Target-based high-throughput compound screening dominates conventional one-drug-one-gene drug discovery process. However, the readout from the chemical modulation of a single protein is poorly correlated with phenotypic response of organism, leading to high failure rate in drug development. Chemical-induced gene expression profile provides an attractive solution to phenotype-based screening. However, the use of such data is currently limited by their sparseness, unreliability, and relatively low throughput. Several methods have been proposed to impute missing values for gene expression datasets. However, few existing methods can perform de novo chemical compound screening. In this study, we propose a mechanism-driven neural network-based method named DeepCE (Deep Chemical Expression) which utilizes graph convolutional neural network to learn chemical representation and multi-head attention mechanism to model chemical substructure-gene and gene-gene feature associations. In addition, we propose a novel data augmentation method which extracts useful information from unreliable experiments in L1000 dataset. The experimental results show that DeepCE achieves the superior performances not only in de novo chemical setting but also in traditional imputation setting compared to state-of-the-art baselines for the prediction of chemical-induced gene expression. We further verify the effectiveness of gene expression profiles generated from DeepCE by comparing them with gene expression profiles in L1000 dataset for downstream classification tasks including drug-target and disease predictions. To demonstrate the value of DeepCE, we apply it to patient-specific drug repurposing of COVID-19 for the first time, and generate novel lead compounds consistent with clinical evidences. Thus, DeepCE provides a potentially powerful framework for robust predictive modeling by utilizing noisy omics data as well as screening novel chemicals for the modulation of systemic response to disease. |
format | Online Article Text |
id | pubmed-7386506 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-73865062020-07-31 A deep learning framework for high-throughput mechanism-driven phenotype compound screening Pham, Thai-Hoang Qiu, Yue Zeng, Jucheng Xie, Lei Zhang, Ping bioRxiv Article Target-based high-throughput compound screening dominates conventional one-drug-one-gene drug discovery process. However, the readout from the chemical modulation of a single protein is poorly correlated with phenotypic response of organism, leading to high failure rate in drug development. Chemical-induced gene expression profile provides an attractive solution to phenotype-based screening. However, the use of such data is currently limited by their sparseness, unreliability, and relatively low throughput. Several methods have been proposed to impute missing values for gene expression datasets. However, few existing methods can perform de novo chemical compound screening. In this study, we propose a mechanism-driven neural network-based method named DeepCE (Deep Chemical Expression) which utilizes graph convolutional neural network to learn chemical representation and multi-head attention mechanism to model chemical substructure-gene and gene-gene feature associations. In addition, we propose a novel data augmentation method which extracts useful information from unreliable experiments in L1000 dataset. The experimental results show that DeepCE achieves the superior performances not only in de novo chemical setting but also in traditional imputation setting compared to state-of-the-art baselines for the prediction of chemical-induced gene expression. We further verify the effectiveness of gene expression profiles generated from DeepCE by comparing them with gene expression profiles in L1000 dataset for downstream classification tasks including drug-target and disease predictions. To demonstrate the value of DeepCE, we apply it to patient-specific drug repurposing of COVID-19 for the first time, and generate novel lead compounds consistent with clinical evidences. Thus, DeepCE provides a potentially powerful framework for robust predictive modeling by utilizing noisy omics data as well as screening novel chemicals for the modulation of systemic response to disease. Cold Spring Harbor Laboratory 2020-07-20 /pmc/articles/PMC7386506/ /pubmed/32743586 http://dx.doi.org/10.1101/2020.07.19.211235 Text en http://creativecommons.org/licenses/by-nd/4.0/It is made available under a CC-BY-ND 4.0 International license (http://creativecommons.org/licenses/by-nd/4.0/) . |
spellingShingle | Article Pham, Thai-Hoang Qiu, Yue Zeng, Jucheng Xie, Lei Zhang, Ping A deep learning framework for high-throughput mechanism-driven phenotype compound screening |
title | A deep learning framework for high-throughput mechanism-driven phenotype compound screening |
title_full | A deep learning framework for high-throughput mechanism-driven phenotype compound screening |
title_fullStr | A deep learning framework for high-throughput mechanism-driven phenotype compound screening |
title_full_unstemmed | A deep learning framework for high-throughput mechanism-driven phenotype compound screening |
title_short | A deep learning framework for high-throughput mechanism-driven phenotype compound screening |
title_sort | deep learning framework for high-throughput mechanism-driven phenotype compound screening |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7386506/ https://www.ncbi.nlm.nih.gov/pubmed/32743586 http://dx.doi.org/10.1101/2020.07.19.211235 |
work_keys_str_mv | AT phamthaihoang adeeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening AT qiuyue adeeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening AT zengjucheng adeeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening AT xielei adeeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening AT zhangping adeeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening AT phamthaihoang deeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening AT qiuyue deeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening AT zengjucheng deeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening AT xielei deeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening AT zhangping deeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening |