Cargando…

Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors

MOTIVATION: Accurate prediction and interpretation of ligand bioactivities are essential for virtual screening and drug discovery. Unfortunately, many important drug targets lack experimental data about the ligand bioactivities; this is particularly true for G protein-coupled receptors (GPCRs), whic...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Jiansheng, Liu, Ben, Chan, Wallace K B, Wu, Weijian, Pang, Tao, Hu, Haifeng, Yan, Shancheng, Ke, Xiaoyan, Zhang, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612825/
https://www.ncbi.nlm.nih.gov/pubmed/31510691
http://dx.doi.org/10.1093/bioinformatics/btz336
_version_ 1783432945402380288
author Wu, Jiansheng
Liu, Ben
Chan, Wallace K B
Wu, Weijian
Pang, Tao
Hu, Haifeng
Yan, Shancheng
Ke, Xiaoyan
Zhang, Yang
author_facet Wu, Jiansheng
Liu, Ben
Chan, Wallace K B
Wu, Weijian
Pang, Tao
Hu, Haifeng
Yan, Shancheng
Ke, Xiaoyan
Zhang, Yang
author_sort Wu, Jiansheng
collection PubMed
description MOTIVATION: Accurate prediction and interpretation of ligand bioactivities are essential for virtual screening and drug discovery. Unfortunately, many important drug targets lack experimental data about the ligand bioactivities; this is particularly true for G protein-coupled receptors (GPCRs), which account for the targets of about a third of drugs currently on the market. Computational approaches with the potential of precise assessment of ligand bioactivities and determination of key substructural features which determine ligand bioactivities are needed to address this issue. RESULTS: A new method, SED, was proposed to predict ligand bioactivities and to recognize key substructures associated with GPCRs through the coupling of screening for Lasso of long extended-connectivity fingerprints (ECFPs) with deep neural network training. The SED pipeline contains three successive steps: (i) representation of long ECFPs for ligand molecules, (ii) feature selection by screening for Lasso of ECFPs and (iii) bioactivity prediction through a deep neural network regression model. The method was examined on a set of 16 representative GPCRs that cover most subfamilies of human GPCRs, where each has 300–5000 ligand associations. The results show that SED achieves excellent performance in modelling ligand bioactivities, especially for those in the GPCR datasets without sufficient ligand associations, where SED improved the baseline predictors by 12% in correlation coefficient (r(2)) and 19% in root mean square error. Detail data analyses suggest that the major advantage of SED lies on its ability to detect substructures from long ECFPs which significantly improves the predictive performance. AVAILABILITY AND IMPLEMENTATION: The source code and datasets of SED are freely available at https://zhanglab.ccmb.med.umich.edu/SED/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6612825
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66128252019-07-12 Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors Wu, Jiansheng Liu, Ben Chan, Wallace K B Wu, Weijian Pang, Tao Hu, Haifeng Yan, Shancheng Ke, Xiaoyan Zhang, Yang Bioinformatics Ismb/Eccb 2019 Conference Proceedings MOTIVATION: Accurate prediction and interpretation of ligand bioactivities are essential for virtual screening and drug discovery. Unfortunately, many important drug targets lack experimental data about the ligand bioactivities; this is particularly true for G protein-coupled receptors (GPCRs), which account for the targets of about a third of drugs currently on the market. Computational approaches with the potential of precise assessment of ligand bioactivities and determination of key substructural features which determine ligand bioactivities are needed to address this issue. RESULTS: A new method, SED, was proposed to predict ligand bioactivities and to recognize key substructures associated with GPCRs through the coupling of screening for Lasso of long extended-connectivity fingerprints (ECFPs) with deep neural network training. The SED pipeline contains three successive steps: (i) representation of long ECFPs for ligand molecules, (ii) feature selection by screening for Lasso of ECFPs and (iii) bioactivity prediction through a deep neural network regression model. The method was examined on a set of 16 representative GPCRs that cover most subfamilies of human GPCRs, where each has 300–5000 ligand associations. The results show that SED achieves excellent performance in modelling ligand bioactivities, especially for those in the GPCR datasets without sufficient ligand associations, where SED improved the baseline predictors by 12% in correlation coefficient (r(2)) and 19% in root mean square error. Detail data analyses suggest that the major advantage of SED lies on its ability to detect substructures from long ECFPs which significantly improves the predictive performance. AVAILABILITY AND IMPLEMENTATION: The source code and datasets of SED are freely available at https://zhanglab.ccmb.med.umich.edu/SED/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-07 2019-07-05 /pmc/articles/PMC6612825/ /pubmed/31510691 http://dx.doi.org/10.1093/bioinformatics/btz336 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2019 Conference Proceedings
Wu, Jiansheng
Liu, Ben
Chan, Wallace K B
Wu, Weijian
Pang, Tao
Hu, Haifeng
Yan, Shancheng
Ke, Xiaoyan
Zhang, Yang
Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors
title Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors
title_full Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors
title_fullStr Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors
title_full_unstemmed Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors
title_short Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors
title_sort precise modelling and interpretation of bioactivities of ligands targeting g protein-coupled receptors
topic Ismb/Eccb 2019 Conference Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612825/
https://www.ncbi.nlm.nih.gov/pubmed/31510691
http://dx.doi.org/10.1093/bioinformatics/btz336
work_keys_str_mv AT wujiansheng precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors
AT liuben precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors
AT chanwallacekb precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors
AT wuweijian precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors
AT pangtao precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors
AT huhaifeng precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors
AT yanshancheng precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors
AT kexiaoyan precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors
AT zhangyang precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors