Cargando…

Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces

Databases are imperative for research in bioinformatics and computational biology. Current challenges in database design include data heterogeneity and context-dependent interconnections between data entities. These challenges drove the development of unified data interfaces and specialized database...

Descripción completa

Detalles Bibliográficos
Autor principal: Ezra Tsur, Elishai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5346198/
https://www.ncbi.nlm.nih.gov/pubmed/28293298
http://dx.doi.org/10.1186/s13040-017-0130-z
_version_ 1782513842095390720
author Ezra Tsur, Elishai
author_facet Ezra Tsur, Elishai
author_sort Ezra Tsur, Elishai
collection PubMed
description Databases are imperative for research in bioinformatics and computational biology. Current challenges in database design include data heterogeneity and context-dependent interconnections between data entities. These challenges drove the development of unified data interfaces and specialized databases. The curation of specialized databases is an ever-growing challenge due to the introduction of new data sources and the emergence of new relational connections between established datasets. Here, an open-source framework for the curation of specialized databases is proposed. The framework supports user-designed models of data encapsulation, objects persistency and structured interfaces to local and external data sources such as MalaCards, Biomodels and the National Centre for Biotechnology Information (NCBI) databases. The proposed framework was implemented using Java as the development environment, EclipseLink as the data persistency agent and Apache Derby as the database manager. Syntactic analysis was based on J3D, jsoup, Apache Commons and w3c.dom open libraries. Finally, a construction of a specialized database for aneurysms associated vascular diseases is demonstrated. This database contains 3-dimensional geometries of aneurysms, patient’s clinical information, articles, biological models, related diseases and our recently published model of aneurysms’ risk of rapture. Framework is available in: http://nbel-lab.com. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-017-0130-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5346198
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53461982017-03-14 Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces Ezra Tsur, Elishai BioData Min Software Article Databases are imperative for research in bioinformatics and computational biology. Current challenges in database design include data heterogeneity and context-dependent interconnections between data entities. These challenges drove the development of unified data interfaces and specialized databases. The curation of specialized databases is an ever-growing challenge due to the introduction of new data sources and the emergence of new relational connections between established datasets. Here, an open-source framework for the curation of specialized databases is proposed. The framework supports user-designed models of data encapsulation, objects persistency and structured interfaces to local and external data sources such as MalaCards, Biomodels and the National Centre for Biotechnology Information (NCBI) databases. The proposed framework was implemented using Java as the development environment, EclipseLink as the data persistency agent and Apache Derby as the database manager. Syntactic analysis was based on J3D, jsoup, Apache Commons and w3c.dom open libraries. Finally, a construction of a specialized database for aneurysms associated vascular diseases is demonstrated. This database contains 3-dimensional geometries of aneurysms, patient’s clinical information, articles, biological models, related diseases and our recently published model of aneurysms’ risk of rapture. Framework is available in: http://nbel-lab.com. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-017-0130-z) contains supplementary material, which is available to authorized users. BioMed Central 2017-03-11 /pmc/articles/PMC5346198/ /pubmed/28293298 http://dx.doi.org/10.1186/s13040-017-0130-z Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software Article
Ezra Tsur, Elishai
Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces
title Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces
title_full Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces
title_fullStr Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces
title_full_unstemmed Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces
title_short Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces
title_sort rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces
topic Software Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5346198/
https://www.ncbi.nlm.nih.gov/pubmed/28293298
http://dx.doi.org/10.1186/s13040-017-0130-z
work_keys_str_mv AT ezratsurelishai rapiddevelopmentofentitybaseddatamodelsforbioinformaticswithpersistenceobjectorienteddesignandstructuredinterfaces