Cargando…

Annotation of Allosteric Compounds to Enhance Bioactivity Modeling for Class A GPCRs

[Image: see text] Proteins often have both orthosteric and allosteric binding sites. Endogenous ligands, such as hormones and neurotransmitters, bind to the orthosteric site, while synthetic ligands may bind to orthosteric or allosteric sites, which has become a focal point in drug discovery. Usuall...

Descripción completa

Detalles Bibliográficos
Autores principales: Burggraaff, Lindsey, van Veen, Amber, Lam, Chi Chung, van Vlijmen, Herman W. T., IJzerman, Adriaan P., van Westen, Gerard J. P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2020
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7592116/
https://www.ncbi.nlm.nih.gov/pubmed/32931270
http://dx.doi.org/10.1021/acs.jcim.0c00695
_version_ 1783601129634922496
author Burggraaff, Lindsey
van Veen, Amber
Lam, Chi Chung
van Vlijmen, Herman W. T.
IJzerman, Adriaan P.
van Westen, Gerard J. P.
author_facet Burggraaff, Lindsey
van Veen, Amber
Lam, Chi Chung
van Vlijmen, Herman W. T.
IJzerman, Adriaan P.
van Westen, Gerard J. P.
author_sort Burggraaff, Lindsey
collection PubMed
description [Image: see text] Proteins often have both orthosteric and allosteric binding sites. Endogenous ligands, such as hormones and neurotransmitters, bind to the orthosteric site, while synthetic ligands may bind to orthosteric or allosteric sites, which has become a focal point in drug discovery. Usually, such allosteric modulators bind to a protein noncompetitively with its endogenous ligand or substrate. The growing interest in allosteric modulators has resulted in a substantial increase of these entities and their features such as binding data in chemical libraries and databases. Although this data surge fuels research focused on allosteric modulators, binding data is unfortunately not always clearly indicated as being allosteric or orthosteric. Therefore, allosteric binding data is difficult to retrieve from databases that contain a mixture of allosteric and orthosteric compounds. This decreases model performance when statistical methods, such as machine learning models, are applied. In previous work we generated an allosteric data subset of ChEMBL release 14. In the current study an improved text mining approach is used to retrieve the allosteric and orthosteric binding types from the literature in ChEMBL release 22. Moreover, convolutional deep neural networks were constructed to predict the binding types of compounds for class A G protein-coupled receptors (GPCRs). Temporal split validation showed the model predictiveness with Matthews correlation coefficient (MCC) = 0.54, sensitivity allosteric = 0.54, and sensitivity orthosteric = 0.94. Finally, this study shows that the inclusion of accurate binding types increases binding predictions by including them as descriptor (MCC = 0.27 improved to MCC = 0.34; validated for class A GPCRs, trained on all GPCRs). Although the focus of this study is mainly on class A GPCRs, binding types for all protein classes in ChEMBL were obtained and explored. The data set is included as a supplement to this study, allowing the reader to select the compounds and binding types of interest.
format Online
Article
Text
id pubmed-7592116
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-75921162020-10-28 Annotation of Allosteric Compounds to Enhance Bioactivity Modeling for Class A GPCRs Burggraaff, Lindsey van Veen, Amber Lam, Chi Chung van Vlijmen, Herman W. T. IJzerman, Adriaan P. van Westen, Gerard J. P. J Chem Inf Model [Image: see text] Proteins often have both orthosteric and allosteric binding sites. Endogenous ligands, such as hormones and neurotransmitters, bind to the orthosteric site, while synthetic ligands may bind to orthosteric or allosteric sites, which has become a focal point in drug discovery. Usually, such allosteric modulators bind to a protein noncompetitively with its endogenous ligand or substrate. The growing interest in allosteric modulators has resulted in a substantial increase of these entities and their features such as binding data in chemical libraries and databases. Although this data surge fuels research focused on allosteric modulators, binding data is unfortunately not always clearly indicated as being allosteric or orthosteric. Therefore, allosteric binding data is difficult to retrieve from databases that contain a mixture of allosteric and orthosteric compounds. This decreases model performance when statistical methods, such as machine learning models, are applied. In previous work we generated an allosteric data subset of ChEMBL release 14. In the current study an improved text mining approach is used to retrieve the allosteric and orthosteric binding types from the literature in ChEMBL release 22. Moreover, convolutional deep neural networks were constructed to predict the binding types of compounds for class A G protein-coupled receptors (GPCRs). Temporal split validation showed the model predictiveness with Matthews correlation coefficient (MCC) = 0.54, sensitivity allosteric = 0.54, and sensitivity orthosteric = 0.94. Finally, this study shows that the inclusion of accurate binding types increases binding predictions by including them as descriptor (MCC = 0.27 improved to MCC = 0.34; validated for class A GPCRs, trained on all GPCRs). Although the focus of this study is mainly on class A GPCRs, binding types for all protein classes in ChEMBL were obtained and explored. The data set is included as a supplement to this study, allowing the reader to select the compounds and binding types of interest. American Chemical Society 2020-09-15 2020-10-26 /pmc/articles/PMC7592116/ /pubmed/32931270 http://dx.doi.org/10.1021/acs.jcim.0c00695 Text en This is an open access article published under a Creative Commons Non-Commercial No Derivative Works (CC-BY-NC-ND) Attribution License (http://pubs.acs.org/page/policy/authorchoice_ccbyncnd_termsofuse.html) , which permits copying and redistribution of the article, and creation of adaptations, all for non-commercial purposes.
spellingShingle Burggraaff, Lindsey
van Veen, Amber
Lam, Chi Chung
van Vlijmen, Herman W. T.
IJzerman, Adriaan P.
van Westen, Gerard J. P.
Annotation of Allosteric Compounds to Enhance Bioactivity Modeling for Class A GPCRs
title Annotation of Allosteric Compounds to Enhance Bioactivity Modeling for Class A GPCRs
title_full Annotation of Allosteric Compounds to Enhance Bioactivity Modeling for Class A GPCRs
title_fullStr Annotation of Allosteric Compounds to Enhance Bioactivity Modeling for Class A GPCRs
title_full_unstemmed Annotation of Allosteric Compounds to Enhance Bioactivity Modeling for Class A GPCRs
title_short Annotation of Allosteric Compounds to Enhance Bioactivity Modeling for Class A GPCRs
title_sort annotation of allosteric compounds to enhance bioactivity modeling for class a gpcrs
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7592116/
https://www.ncbi.nlm.nih.gov/pubmed/32931270
http://dx.doi.org/10.1021/acs.jcim.0c00695
work_keys_str_mv AT burggraafflindsey annotationofallostericcompoundstoenhancebioactivitymodelingforclassagpcrs
AT vanveenamber annotationofallostericcompoundstoenhancebioactivitymodelingforclassagpcrs
AT lamchichung annotationofallostericcompoundstoenhancebioactivitymodelingforclassagpcrs
AT vanvlijmenhermanwt annotationofallostericcompoundstoenhancebioactivitymodelingforclassagpcrs
AT ijzermanadriaanp annotationofallostericcompoundstoenhancebioactivitymodelingforclassagpcrs
AT vanwestengerardjp annotationofallostericcompoundstoenhancebioactivitymodelingforclassagpcrs