Cargando…
Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors
MOTIVATION: Accurate prediction and interpretation of ligand bioactivities are essential for virtual screening and drug discovery. Unfortunately, many important drug targets lack experimental data about the ligand bioactivities; this is particularly true for G protein-coupled receptors (GPCRs), whic...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612825/ https://www.ncbi.nlm.nih.gov/pubmed/31510691 http://dx.doi.org/10.1093/bioinformatics/btz336 |
_version_ | 1783432945402380288 |
---|---|
author | Wu, Jiansheng Liu, Ben Chan, Wallace K B Wu, Weijian Pang, Tao Hu, Haifeng Yan, Shancheng Ke, Xiaoyan Zhang, Yang |
author_facet | Wu, Jiansheng Liu, Ben Chan, Wallace K B Wu, Weijian Pang, Tao Hu, Haifeng Yan, Shancheng Ke, Xiaoyan Zhang, Yang |
author_sort | Wu, Jiansheng |
collection | PubMed |
description | MOTIVATION: Accurate prediction and interpretation of ligand bioactivities are essential for virtual screening and drug discovery. Unfortunately, many important drug targets lack experimental data about the ligand bioactivities; this is particularly true for G protein-coupled receptors (GPCRs), which account for the targets of about a third of drugs currently on the market. Computational approaches with the potential of precise assessment of ligand bioactivities and determination of key substructural features which determine ligand bioactivities are needed to address this issue. RESULTS: A new method, SED, was proposed to predict ligand bioactivities and to recognize key substructures associated with GPCRs through the coupling of screening for Lasso of long extended-connectivity fingerprints (ECFPs) with deep neural network training. The SED pipeline contains three successive steps: (i) representation of long ECFPs for ligand molecules, (ii) feature selection by screening for Lasso of ECFPs and (iii) bioactivity prediction through a deep neural network regression model. The method was examined on a set of 16 representative GPCRs that cover most subfamilies of human GPCRs, where each has 300–5000 ligand associations. The results show that SED achieves excellent performance in modelling ligand bioactivities, especially for those in the GPCR datasets without sufficient ligand associations, where SED improved the baseline predictors by 12% in correlation coefficient (r(2)) and 19% in root mean square error. Detail data analyses suggest that the major advantage of SED lies on its ability to detect substructures from long ECFPs which significantly improves the predictive performance. AVAILABILITY AND IMPLEMENTATION: The source code and datasets of SED are freely available at https://zhanglab.ccmb.med.umich.edu/SED/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6612825 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-66128252019-07-12 Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors Wu, Jiansheng Liu, Ben Chan, Wallace K B Wu, Weijian Pang, Tao Hu, Haifeng Yan, Shancheng Ke, Xiaoyan Zhang, Yang Bioinformatics Ismb/Eccb 2019 Conference Proceedings MOTIVATION: Accurate prediction and interpretation of ligand bioactivities are essential for virtual screening and drug discovery. Unfortunately, many important drug targets lack experimental data about the ligand bioactivities; this is particularly true for G protein-coupled receptors (GPCRs), which account for the targets of about a third of drugs currently on the market. Computational approaches with the potential of precise assessment of ligand bioactivities and determination of key substructural features which determine ligand bioactivities are needed to address this issue. RESULTS: A new method, SED, was proposed to predict ligand bioactivities and to recognize key substructures associated with GPCRs through the coupling of screening for Lasso of long extended-connectivity fingerprints (ECFPs) with deep neural network training. The SED pipeline contains three successive steps: (i) representation of long ECFPs for ligand molecules, (ii) feature selection by screening for Lasso of ECFPs and (iii) bioactivity prediction through a deep neural network regression model. The method was examined on a set of 16 representative GPCRs that cover most subfamilies of human GPCRs, where each has 300–5000 ligand associations. The results show that SED achieves excellent performance in modelling ligand bioactivities, especially for those in the GPCR datasets without sufficient ligand associations, where SED improved the baseline predictors by 12% in correlation coefficient (r(2)) and 19% in root mean square error. Detail data analyses suggest that the major advantage of SED lies on its ability to detect substructures from long ECFPs which significantly improves the predictive performance. AVAILABILITY AND IMPLEMENTATION: The source code and datasets of SED are freely available at https://zhanglab.ccmb.med.umich.edu/SED/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-07 2019-07-05 /pmc/articles/PMC6612825/ /pubmed/31510691 http://dx.doi.org/10.1093/bioinformatics/btz336 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2019 Conference Proceedings Wu, Jiansheng Liu, Ben Chan, Wallace K B Wu, Weijian Pang, Tao Hu, Haifeng Yan, Shancheng Ke, Xiaoyan Zhang, Yang Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors |
title | Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors |
title_full | Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors |
title_fullStr | Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors |
title_full_unstemmed | Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors |
title_short | Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors |
title_sort | precise modelling and interpretation of bioactivities of ligands targeting g protein-coupled receptors |
topic | Ismb/Eccb 2019 Conference Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612825/ https://www.ncbi.nlm.nih.gov/pubmed/31510691 http://dx.doi.org/10.1093/bioinformatics/btz336 |
work_keys_str_mv | AT wujiansheng precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors AT liuben precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors AT chanwallacekb precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors AT wuweijian precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors AT pangtao precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors AT huhaifeng precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors AT yanshancheng precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors AT kexiaoyan precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors AT zhangyang precisemodellingandinterpretationofbioactivitiesofligandstargetinggproteincoupledreceptors |