Cargando…

LigASite—a database of biologically relevant binding sites in proteins with known apo-structures

Better characterization of binding sites in proteins and the ability to accurately predict their location and energetic properties are major challenges which, if addressed, would have many valuable practical applications. Unfortunately, reliable benchmark datasets of binding sites in proteins are st...

Descripción completa

Detalles Bibliográficos
Autores principales: Dessailly, Benoit H., Lensink, Marc F., Orengo, Christine A., Wodak, Shoshana J.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2238865/
https://www.ncbi.nlm.nih.gov/pubmed/17933762
http://dx.doi.org/10.1093/nar/gkm839
_version_ 1782150478122975232
author Dessailly, Benoit H.
Lensink, Marc F.
Orengo, Christine A.
Wodak, Shoshana J.
author_facet Dessailly, Benoit H.
Lensink, Marc F.
Orengo, Christine A.
Wodak, Shoshana J.
author_sort Dessailly, Benoit H.
collection PubMed
description Better characterization of binding sites in proteins and the ability to accurately predict their location and energetic properties are major challenges which, if addressed, would have many valuable practical applications. Unfortunately, reliable benchmark datasets of binding sites in proteins are still sorely lacking. Here, we present LigASite (‘LIGand Attachment SITE’), a gold-standard dataset of binding sites in 550 proteins of known structures. LigASite consists exclusively of biologically relevant binding sites in proteins for which at least one apo- and one holo-structure are available. In defining the binding sites for each protein, information from all holo-structures is combined, considering in each case the quaternary structure defined by the PQS server. LigASite is built using simple criteria and is automatically updated as new structures become available in the PDB, thereby guaranteeing optimal data coverage over time. Both a redundant and a culled non-redundant version of the dataset is available at http://www.scmbb.ulb.ac.be/Users/benoit/LigASite. The website interface allows users to search the dataset by PDB identifiers, ligand identifiers, protein names or sequence, and to look for structural matches as defined by the CATH homologous superfamilies. The datasets can be downloaded from the website as Schema-validated XML files or comma-separated flat files.
format Text
id pubmed-2238865
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-22388652008-02-12 LigASite—a database of biologically relevant binding sites in proteins with known apo-structures Dessailly, Benoit H. Lensink, Marc F. Orengo, Christine A. Wodak, Shoshana J. Nucleic Acids Res Articles Better characterization of binding sites in proteins and the ability to accurately predict their location and energetic properties are major challenges which, if addressed, would have many valuable practical applications. Unfortunately, reliable benchmark datasets of binding sites in proteins are still sorely lacking. Here, we present LigASite (‘LIGand Attachment SITE’), a gold-standard dataset of binding sites in 550 proteins of known structures. LigASite consists exclusively of biologically relevant binding sites in proteins for which at least one apo- and one holo-structure are available. In defining the binding sites for each protein, information from all holo-structures is combined, considering in each case the quaternary structure defined by the PQS server. LigASite is built using simple criteria and is automatically updated as new structures become available in the PDB, thereby guaranteeing optimal data coverage over time. Both a redundant and a culled non-redundant version of the dataset is available at http://www.scmbb.ulb.ac.be/Users/benoit/LigASite. The website interface allows users to search the dataset by PDB identifiers, ligand identifiers, protein names or sequence, and to look for structural matches as defined by the CATH homologous superfamilies. The datasets can be downloaded from the website as Schema-validated XML files or comma-separated flat files. Oxford University Press 2008-01 2007-10-11 /pmc/articles/PMC2238865/ /pubmed/17933762 http://dx.doi.org/10.1093/nar/gkm839 Text en © 2007 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Dessailly, Benoit H.
Lensink, Marc F.
Orengo, Christine A.
Wodak, Shoshana J.
LigASite—a database of biologically relevant binding sites in proteins with known apo-structures
title LigASite—a database of biologically relevant binding sites in proteins with known apo-structures
title_full LigASite—a database of biologically relevant binding sites in proteins with known apo-structures
title_fullStr LigASite—a database of biologically relevant binding sites in proteins with known apo-structures
title_full_unstemmed LigASite—a database of biologically relevant binding sites in proteins with known apo-structures
title_short LigASite—a database of biologically relevant binding sites in proteins with known apo-structures
title_sort ligasite—a database of biologically relevant binding sites in proteins with known apo-structures
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2238865/
https://www.ncbi.nlm.nih.gov/pubmed/17933762
http://dx.doi.org/10.1093/nar/gkm839
work_keys_str_mv AT dessaillybenoith ligasiteadatabaseofbiologicallyrelevantbindingsitesinproteinswithknownapostructures
AT lensinkmarcf ligasiteadatabaseofbiologicallyrelevantbindingsitesinproteinswithknownapostructures
AT orengochristinea ligasiteadatabaseofbiologicallyrelevantbindingsitesinproteinswithknownapostructures
AT wodakshoshanaj ligasiteadatabaseofbiologicallyrelevantbindingsitesinproteinswithknownapostructures