Cargando…

GENE2D: A NoSQL Integrated Data Repository of Genetic Disorders Data

There are few sources from which to obtain clinical and genetic data for use in research in Saudi Arabia. Numerous obstacles led to the difficulty of integrating these data from silos and scattered sources to provide standardized access to large data sets for patients with common health conditions....

Descripción completa

Detalles Bibliográficos
Autores principales: Samra, Halima, Li, Alice, Soh, Ben
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7551627/
https://www.ncbi.nlm.nih.gov/pubmed/32781728
http://dx.doi.org/10.3390/healthcare8030257
_version_ 1783593223833255936
author Samra, Halima
Li, Alice
Soh, Ben
author_facet Samra, Halima
Li, Alice
Soh, Ben
author_sort Samra, Halima
collection PubMed
description There are few sources from which to obtain clinical and genetic data for use in research in Saudi Arabia. Numerous obstacles led to the difficulty of integrating these data from silos and scattered sources to provide standardized access to large data sets for patients with common health conditions. To this end, we sought to contribute to this area and offer a practical and easy-to-implement solution. In this paper, we aim to design and implement a “not only SQL” (NoSQL) based integration framework to generate an Integrated Data Repository of Genetic Disorders Data (GENE2D) to integrate data from various genetic clinics and research centers in Saudi Arabia and provide an easy-to-use query interface for researchers to conduct their studies on large datasets. The major components involved in the GENE2D architecture consists of the data sources, the integrated data repository (IDR) as a central database, and the application interface. The IDR uses a NoSQL document store via MongoDB (an open source document-oriented database program) as a backend database. The application interface called Query Builder provides multiple services for data retrieval from the database using a custom query to answer simple or complex research questions. The GENE2D system demonstrates its potential to help grow and develop a national genetic disorders database in Saudi Arabia.
format Online
Article
Text
id pubmed-7551627
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75516272020-10-14 GENE2D: A NoSQL Integrated Data Repository of Genetic Disorders Data Samra, Halima Li, Alice Soh, Ben Healthcare (Basel) Article There are few sources from which to obtain clinical and genetic data for use in research in Saudi Arabia. Numerous obstacles led to the difficulty of integrating these data from silos and scattered sources to provide standardized access to large data sets for patients with common health conditions. To this end, we sought to contribute to this area and offer a practical and easy-to-implement solution. In this paper, we aim to design and implement a “not only SQL” (NoSQL) based integration framework to generate an Integrated Data Repository of Genetic Disorders Data (GENE2D) to integrate data from various genetic clinics and research centers in Saudi Arabia and provide an easy-to-use query interface for researchers to conduct their studies on large datasets. The major components involved in the GENE2D architecture consists of the data sources, the integrated data repository (IDR) as a central database, and the application interface. The IDR uses a NoSQL document store via MongoDB (an open source document-oriented database program) as a backend database. The application interface called Query Builder provides multiple services for data retrieval from the database using a custom query to answer simple or complex research questions. The GENE2D system demonstrates its potential to help grow and develop a national genetic disorders database in Saudi Arabia. MDPI 2020-08-06 /pmc/articles/PMC7551627/ /pubmed/32781728 http://dx.doi.org/10.3390/healthcare8030257 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Samra, Halima
Li, Alice
Soh, Ben
GENE2D: A NoSQL Integrated Data Repository of Genetic Disorders Data
title GENE2D: A NoSQL Integrated Data Repository of Genetic Disorders Data
title_full GENE2D: A NoSQL Integrated Data Repository of Genetic Disorders Data
title_fullStr GENE2D: A NoSQL Integrated Data Repository of Genetic Disorders Data
title_full_unstemmed GENE2D: A NoSQL Integrated Data Repository of Genetic Disorders Data
title_short GENE2D: A NoSQL Integrated Data Repository of Genetic Disorders Data
title_sort gene2d: a nosql integrated data repository of genetic disorders data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7551627/
https://www.ncbi.nlm.nih.gov/pubmed/32781728
http://dx.doi.org/10.3390/healthcare8030257
work_keys_str_mv AT samrahalima gene2danosqlintegrateddatarepositoryofgeneticdisordersdata
AT lialice gene2danosqlintegrateddatarepositoryofgeneticdisordersdata
AT sohben gene2danosqlintegrateddatarepositoryofgeneticdisordersdata