Cargando…

PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan

This article is a tutorial for PDBj Mine, a new database and its interface for Protein Data Bank Japan (PDBj). In PDBj Mine, data are loaded from files in the PDBMLplus format (an extension of PDBML, PDB's canonical XML format, enriched with annotations), which are then served for the user of P...

Descripción completa

Detalles Bibliográficos
Autores principales: Kinjo, Akira R., Yamashita, Reiko, Nakamura, Haruki
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2997606/
https://www.ncbi.nlm.nih.gov/pubmed/20798081
http://dx.doi.org/10.1093/database/baq021
_version_ 1782193316854497280
author Kinjo, Akira R.
Yamashita, Reiko
Nakamura, Haruki
author_facet Kinjo, Akira R.
Yamashita, Reiko
Nakamura, Haruki
author_sort Kinjo, Akira R.
collection PubMed
description This article is a tutorial for PDBj Mine, a new database and its interface for Protein Data Bank Japan (PDBj). In PDBj Mine, data are loaded from files in the PDBMLplus format (an extension of PDBML, PDB's canonical XML format, enriched with annotations), which are then served for the user of PDBj via the worldwide web (WWW). We describe the basic design of the relational database (RDB) and web interfaces of PDBj Mine. The contents of PDBMLplus files are first broken into XPath entities, and these paths and data are indexed in the way that reflects the hierarchical structure of the XML files. The data for each XPath type are saved into the corresponding relational table that is named as the XPath itself. The generation of table definitions from the PDBMLplus XML schema is fully automated. For efficient search, frequently queried terms are compiled into a brief summary table. Casual users can perform simple keyword search, and 'Advanced Search' which can specify various conditions on the entries. More experienced users can query the database using SQL statements which can be constructed in a uniform manner. Thus, PDBj Mine achieves a combination of the flexibility of XML documents and the robustness of the RDB. Database URL: http://www.pdbj.org/
format Text
id pubmed-2997606
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-29976062010-12-06 PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan Kinjo, Akira R. Yamashita, Reiko Nakamura, Haruki Database (Oxford) Tutorial This article is a tutorial for PDBj Mine, a new database and its interface for Protein Data Bank Japan (PDBj). In PDBj Mine, data are loaded from files in the PDBMLplus format (an extension of PDBML, PDB's canonical XML format, enriched with annotations), which are then served for the user of PDBj via the worldwide web (WWW). We describe the basic design of the relational database (RDB) and web interfaces of PDBj Mine. The contents of PDBMLplus files are first broken into XPath entities, and these paths and data are indexed in the way that reflects the hierarchical structure of the XML files. The data for each XPath type are saved into the corresponding relational table that is named as the XPath itself. The generation of table definitions from the PDBMLplus XML schema is fully automated. For efficient search, frequently queried terms are compiled into a brief summary table. Casual users can perform simple keyword search, and 'Advanced Search' which can specify various conditions on the entries. More experienced users can query the database using SQL statements which can be constructed in a uniform manner. Thus, PDBj Mine achieves a combination of the flexibility of XML documents and the robustness of the RDB. Database URL: http://www.pdbj.org/ Oxford University Press 2010-08-25 /pmc/articles/PMC2997606/ /pubmed/20798081 http://dx.doi.org/10.1093/database/baq021 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Tutorial
Kinjo, Akira R.
Yamashita, Reiko
Nakamura, Haruki
PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan
title PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan
title_full PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan
title_fullStr PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan
title_full_unstemmed PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan
title_short PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan
title_sort pdbj mine: design and implementation of relational database interface for protein data bank japan
topic Tutorial
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2997606/
https://www.ncbi.nlm.nih.gov/pubmed/20798081
http://dx.doi.org/10.1093/database/baq021
work_keys_str_mv AT kinjoakirar pdbjminedesignandimplementationofrelationaldatabaseinterfaceforproteindatabankjapan
AT yamashitareiko pdbjminedesignandimplementationofrelationaldatabaseinterfaceforproteindatabankjapan
AT nakamuraharuki pdbjminedesignandimplementationofrelationaldatabaseinterfaceforproteindatabankjapan