Cargando…

A perspective for biomedical data integration: Design of databases for flow cytometry

BACKGROUND: The integration of biomedical information is essential for tackling medical problems. We describe a data model in the domain of flow cytometry (FC) allowing for massive management, analysis and integration with other laboratory and clinical information. The paper is concerned with the pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Drakos, John, Karakantza, Marina, Zoumbos, Nicholas C, Lakoumentas, John, Nikiforidis, George C, Sakellaropoulos, George C
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2267440/
https://www.ncbi.nlm.nih.gov/pubmed/18275602
http://dx.doi.org/10.1186/1471-2105-9-99
_version_ 1782151630592933888
author Drakos, John
Karakantza, Marina
Zoumbos, Nicholas C
Lakoumentas, John
Nikiforidis, George C
Sakellaropoulos, George C
author_facet Drakos, John
Karakantza, Marina
Zoumbos, Nicholas C
Lakoumentas, John
Nikiforidis, George C
Sakellaropoulos, George C
author_sort Drakos, John
collection PubMed
description BACKGROUND: The integration of biomedical information is essential for tackling medical problems. We describe a data model in the domain of flow cytometry (FC) allowing for massive management, analysis and integration with other laboratory and clinical information. The paper is concerned with the proper translation of the Flow Cytometry Standard (FCS) into a relational database schema, in a way that facilitates end users at either doing research on FC or studying specific cases of patients undergone FC analysis RESULTS: The proposed database schema provides integration of data originating from diverse acquisition settings, organized in a way that allows syntactically simple queries that provide results significantly faster than the conventional implementations of the FCS standard. The proposed schema can potentially achieve up to 8 orders of magnitude reduction in query complexity and up to 2 orders of magnitude reduction in response time for data originating from flow cytometers that record 256 colours. This is mainly achieved by managing to maintain an almost constant number of data-mining procedures regardless of the size and complexity of the stored information. CONCLUSION: It is evident that using single-file data storage standards for the design of databases without any structural transformations significantly limits the flexibility of databases. Analysis of the requirements of a specific domain for integration and massive data processing can provide the necessary schema modifications that will unlock the additional functionality of a relational database.
format Text
id pubmed-2267440
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22674402008-03-14 A perspective for biomedical data integration: Design of databases for flow cytometry Drakos, John Karakantza, Marina Zoumbos, Nicholas C Lakoumentas, John Nikiforidis, George C Sakellaropoulos, George C BMC Bioinformatics Research Article BACKGROUND: The integration of biomedical information is essential for tackling medical problems. We describe a data model in the domain of flow cytometry (FC) allowing for massive management, analysis and integration with other laboratory and clinical information. The paper is concerned with the proper translation of the Flow Cytometry Standard (FCS) into a relational database schema, in a way that facilitates end users at either doing research on FC or studying specific cases of patients undergone FC analysis RESULTS: The proposed database schema provides integration of data originating from diverse acquisition settings, organized in a way that allows syntactically simple queries that provide results significantly faster than the conventional implementations of the FCS standard. The proposed schema can potentially achieve up to 8 orders of magnitude reduction in query complexity and up to 2 orders of magnitude reduction in response time for data originating from flow cytometers that record 256 colours. This is mainly achieved by managing to maintain an almost constant number of data-mining procedures regardless of the size and complexity of the stored information. CONCLUSION: It is evident that using single-file data storage standards for the design of databases without any structural transformations significantly limits the flexibility of databases. Analysis of the requirements of a specific domain for integration and massive data processing can provide the necessary schema modifications that will unlock the additional functionality of a relational database. BioMed Central 2008-02-14 /pmc/articles/PMC2267440/ /pubmed/18275602 http://dx.doi.org/10.1186/1471-2105-9-99 Text en Copyright © 2008 Drakos et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Drakos, John
Karakantza, Marina
Zoumbos, Nicholas C
Lakoumentas, John
Nikiforidis, George C
Sakellaropoulos, George C
A perspective for biomedical data integration: Design of databases for flow cytometry
title A perspective for biomedical data integration: Design of databases for flow cytometry
title_full A perspective for biomedical data integration: Design of databases for flow cytometry
title_fullStr A perspective for biomedical data integration: Design of databases for flow cytometry
title_full_unstemmed A perspective for biomedical data integration: Design of databases for flow cytometry
title_short A perspective for biomedical data integration: Design of databases for flow cytometry
title_sort perspective for biomedical data integration: design of databases for flow cytometry
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2267440/
https://www.ncbi.nlm.nih.gov/pubmed/18275602
http://dx.doi.org/10.1186/1471-2105-9-99
work_keys_str_mv AT drakosjohn aperspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry
AT karakantzamarina aperspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry
AT zoumbosnicholasc aperspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry
AT lakoumentasjohn aperspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry
AT nikiforidisgeorgec aperspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry
AT sakellaropoulosgeorgec aperspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry
AT drakosjohn perspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry
AT karakantzamarina perspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry
AT zoumbosnicholasc perspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry
AT lakoumentasjohn perspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry
AT nikiforidisgeorgec perspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry
AT sakellaropoulosgeorgec perspectiveforbiomedicaldataintegrationdesignofdatabasesforflowcytometry