Cargando…
Identification and analysis of conserved pockets on protein surfaces
BACKGROUND: The interaction between proteins and ligands occurs at pockets that are often lined by conserved amino acids. These pockets can represent the targets for low molecular weight drugs. In order to make the research for new medicines as productive as possible, it is necessary to exploit &quo...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3633052/ https://www.ncbi.nlm.nih.gov/pubmed/23815589 http://dx.doi.org/10.1186/1471-2105-14-S7-S9 |
_version_ | 1782266934657548288 |
---|---|
author | Cammisa, Marco Correra, Antonella Andreotti, Giuseppina Cubellis, Maria Vittoria |
author_facet | Cammisa, Marco Correra, Antonella Andreotti, Giuseppina Cubellis, Maria Vittoria |
author_sort | Cammisa, Marco |
collection | PubMed |
description | BACKGROUND: The interaction between proteins and ligands occurs at pockets that are often lined by conserved amino acids. These pockets can represent the targets for low molecular weight drugs. In order to make the research for new medicines as productive as possible, it is necessary to exploit "in silico" techniques, high throughput and fragment-based screenings that require the identification of druggable pockets on the surface of proteins, which may or may not correspond to active sites. RESULTS: We developed a tool to evaluate the conservation of each pocket detected on the protein surface by CastP. This tool was named DrosteP because it recursively searches for optimal input sequences to be used to calculate conservation. DrosteP uses a descriptor of statistical significance, Poisson p-value, as a target to optimize the choice of input sequences. To benchmark DrosteP we used monomeric or homodimer human proteins with known 3D-structure whose active site had been annotated in UniProt. DrosteP is able to detect the active site with high accuracy because in 81% of the cases it coincides with the most conserved pocket. Comparing DrosteP with analogous programs is difficult because the outputs are different. Nonetheless we could assess the efficacy of the recursive algorithm in the identification of active site pockets by calculating conservation with the same input sequences used by other programs. We analyzed the amino-acid composition of conserved pockets identified by DrosteP and we found that it differs significantly from the amino-acid composition of non conserved pockets. CONCLUSIONS: Several methods for predicting ligand binding sites on protein surfaces, that combine 3D-structure and evolutionary sequence conservation, have been proposed. Any method relying on conservation mainly depends on the choice of the input sequences. DrosteP chooses how deeply distant homologs must be collected to evaluate conservation and thus optimizes the identification of active site pockets. Moreover it recognizes conserved pockets other than those coinciding with the sites annotated in UniProt that might represent useful druggable sites. The distinctive amino-acid composition of conserved pockets provides useful hints on the fundamental principles underlying protein-ligand interaction. AVAILABILITY: http://www.icb.cnr.it/project/drosteppy/ |
format | Online Article Text |
id | pubmed-3633052 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-36330522013-04-25 Identification and analysis of conserved pockets on protein surfaces Cammisa, Marco Correra, Antonella Andreotti, Giuseppina Cubellis, Maria Vittoria BMC Bioinformatics Research BACKGROUND: The interaction between proteins and ligands occurs at pockets that are often lined by conserved amino acids. These pockets can represent the targets for low molecular weight drugs. In order to make the research for new medicines as productive as possible, it is necessary to exploit "in silico" techniques, high throughput and fragment-based screenings that require the identification of druggable pockets on the surface of proteins, which may or may not correspond to active sites. RESULTS: We developed a tool to evaluate the conservation of each pocket detected on the protein surface by CastP. This tool was named DrosteP because it recursively searches for optimal input sequences to be used to calculate conservation. DrosteP uses a descriptor of statistical significance, Poisson p-value, as a target to optimize the choice of input sequences. To benchmark DrosteP we used monomeric or homodimer human proteins with known 3D-structure whose active site had been annotated in UniProt. DrosteP is able to detect the active site with high accuracy because in 81% of the cases it coincides with the most conserved pocket. Comparing DrosteP with analogous programs is difficult because the outputs are different. Nonetheless we could assess the efficacy of the recursive algorithm in the identification of active site pockets by calculating conservation with the same input sequences used by other programs. We analyzed the amino-acid composition of conserved pockets identified by DrosteP and we found that it differs significantly from the amino-acid composition of non conserved pockets. CONCLUSIONS: Several methods for predicting ligand binding sites on protein surfaces, that combine 3D-structure and evolutionary sequence conservation, have been proposed. Any method relying on conservation mainly depends on the choice of the input sequences. DrosteP chooses how deeply distant homologs must be collected to evaluate conservation and thus optimizes the identification of active site pockets. Moreover it recognizes conserved pockets other than those coinciding with the sites annotated in UniProt that might represent useful druggable sites. The distinctive amino-acid composition of conserved pockets provides useful hints on the fundamental principles underlying protein-ligand interaction. AVAILABILITY: http://www.icb.cnr.it/project/drosteppy/ BioMed Central 2013-04-22 /pmc/articles/PMC3633052/ /pubmed/23815589 http://dx.doi.org/10.1186/1471-2105-14-S7-S9 Text en Copyright © 2013 Cubellis et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Cammisa, Marco Correra, Antonella Andreotti, Giuseppina Cubellis, Maria Vittoria Identification and analysis of conserved pockets on protein surfaces |
title | Identification and analysis of conserved pockets on protein surfaces |
title_full | Identification and analysis of conserved pockets on protein surfaces |
title_fullStr | Identification and analysis of conserved pockets on protein surfaces |
title_full_unstemmed | Identification and analysis of conserved pockets on protein surfaces |
title_short | Identification and analysis of conserved pockets on protein surfaces |
title_sort | identification and analysis of conserved pockets on protein surfaces |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3633052/ https://www.ncbi.nlm.nih.gov/pubmed/23815589 http://dx.doi.org/10.1186/1471-2105-14-S7-S9 |
work_keys_str_mv | AT cammisamarco identificationandanalysisofconservedpocketsonproteinsurfaces AT correraantonella identificationandanalysisofconservedpocketsonproteinsurfaces AT andreottigiuseppina identificationandanalysisofconservedpocketsonproteinsurfaces AT cubellismariavittoria identificationandanalysisofconservedpocketsonproteinsurfaces |