Cargando…

A deep learning framework for high-throughput mechanism-driven phenotype compound screening

Target-based high-throughput compound screening dominates conventional one-drug-one-gene drug discovery process. However, the readout from the chemical modulation of a single protein is poorly correlated with phenotypic response of organism, leading to high failure rate in drug development. Chemical...

Descripción completa

Detalles Bibliográficos
Autores principales: Pham, Thai-Hoang, Qiu, Yue, Zeng, Jucheng, Xie, Lei, Zhang, Ping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7386506/
https://www.ncbi.nlm.nih.gov/pubmed/32743586
http://dx.doi.org/10.1101/2020.07.19.211235
_version_ 1783563960611504128
author Pham, Thai-Hoang
Qiu, Yue
Zeng, Jucheng
Xie, Lei
Zhang, Ping
author_facet Pham, Thai-Hoang
Qiu, Yue
Zeng, Jucheng
Xie, Lei
Zhang, Ping
author_sort Pham, Thai-Hoang
collection PubMed
description Target-based high-throughput compound screening dominates conventional one-drug-one-gene drug discovery process. However, the readout from the chemical modulation of a single protein is poorly correlated with phenotypic response of organism, leading to high failure rate in drug development. Chemical-induced gene expression profile provides an attractive solution to phenotype-based screening. However, the use of such data is currently limited by their sparseness, unreliability, and relatively low throughput. Several methods have been proposed to impute missing values for gene expression datasets. However, few existing methods can perform de novo chemical compound screening. In this study, we propose a mechanism-driven neural network-based method named DeepCE (Deep Chemical Expression) which utilizes graph convolutional neural network to learn chemical representation and multi-head attention mechanism to model chemical substructure-gene and gene-gene feature associations. In addition, we propose a novel data augmentation method which extracts useful information from unreliable experiments in L1000 dataset. The experimental results show that DeepCE achieves the superior performances not only in de novo chemical setting but also in traditional imputation setting compared to state-of-the-art baselines for the prediction of chemical-induced gene expression. We further verify the effectiveness of gene expression profiles generated from DeepCE by comparing them with gene expression profiles in L1000 dataset for downstream classification tasks including drug-target and disease predictions. To demonstrate the value of DeepCE, we apply it to patient-specific drug repurposing of COVID-19 for the first time, and generate novel lead compounds consistent with clinical evidences. Thus, DeepCE provides a potentially powerful framework for robust predictive modeling by utilizing noisy omics data as well as screening novel chemicals for the modulation of systemic response to disease.
format Online
Article
Text
id pubmed-7386506
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-73865062020-07-31 A deep learning framework for high-throughput mechanism-driven phenotype compound screening Pham, Thai-Hoang Qiu, Yue Zeng, Jucheng Xie, Lei Zhang, Ping bioRxiv Article Target-based high-throughput compound screening dominates conventional one-drug-one-gene drug discovery process. However, the readout from the chemical modulation of a single protein is poorly correlated with phenotypic response of organism, leading to high failure rate in drug development. Chemical-induced gene expression profile provides an attractive solution to phenotype-based screening. However, the use of such data is currently limited by their sparseness, unreliability, and relatively low throughput. Several methods have been proposed to impute missing values for gene expression datasets. However, few existing methods can perform de novo chemical compound screening. In this study, we propose a mechanism-driven neural network-based method named DeepCE (Deep Chemical Expression) which utilizes graph convolutional neural network to learn chemical representation and multi-head attention mechanism to model chemical substructure-gene and gene-gene feature associations. In addition, we propose a novel data augmentation method which extracts useful information from unreliable experiments in L1000 dataset. The experimental results show that DeepCE achieves the superior performances not only in de novo chemical setting but also in traditional imputation setting compared to state-of-the-art baselines for the prediction of chemical-induced gene expression. We further verify the effectiveness of gene expression profiles generated from DeepCE by comparing them with gene expression profiles in L1000 dataset for downstream classification tasks including drug-target and disease predictions. To demonstrate the value of DeepCE, we apply it to patient-specific drug repurposing of COVID-19 for the first time, and generate novel lead compounds consistent with clinical evidences. Thus, DeepCE provides a potentially powerful framework for robust predictive modeling by utilizing noisy omics data as well as screening novel chemicals for the modulation of systemic response to disease. Cold Spring Harbor Laboratory 2020-07-20 /pmc/articles/PMC7386506/ /pubmed/32743586 http://dx.doi.org/10.1101/2020.07.19.211235 Text en http://creativecommons.org/licenses/by-nd/4.0/It is made available under a CC-BY-ND 4.0 International license (http://creativecommons.org/licenses/by-nd/4.0/) .
spellingShingle Article
Pham, Thai-Hoang
Qiu, Yue
Zeng, Jucheng
Xie, Lei
Zhang, Ping
A deep learning framework for high-throughput mechanism-driven phenotype compound screening
title A deep learning framework for high-throughput mechanism-driven phenotype compound screening
title_full A deep learning framework for high-throughput mechanism-driven phenotype compound screening
title_fullStr A deep learning framework for high-throughput mechanism-driven phenotype compound screening
title_full_unstemmed A deep learning framework for high-throughput mechanism-driven phenotype compound screening
title_short A deep learning framework for high-throughput mechanism-driven phenotype compound screening
title_sort deep learning framework for high-throughput mechanism-driven phenotype compound screening
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7386506/
https://www.ncbi.nlm.nih.gov/pubmed/32743586
http://dx.doi.org/10.1101/2020.07.19.211235
work_keys_str_mv AT phamthaihoang adeeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening
AT qiuyue adeeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening
AT zengjucheng adeeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening
AT xielei adeeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening
AT zhangping adeeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening
AT phamthaihoang deeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening
AT qiuyue deeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening
AT zengjucheng deeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening
AT xielei deeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening
AT zhangping deeplearningframeworkforhighthroughputmechanismdrivenphenotypecompoundscreening