Cargando…

Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid

Organizing and annotating biomedical data in structured ways has gained much interest and focus in the last 30 years. Driven by decreases in digital storage costs and advances in genetics sequencing, imaging, electronic data collection, and microarray technologies, data is being collected at an ever...

Descripción completa

Detalles Bibliográficos
Autores principales: Keator, David B., Wei, Dingying, Gadde, Syam, Bockholt, Jeremy, Grethe, Jeffrey S., Marcus, Daniel, Aucoin, Nicole, Ozyurt, Ibrahim B.
Formato: Texto
Lenguaje:English
Publicado: Frontiers Research Foundation 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2759340/
https://www.ncbi.nlm.nih.gov/pubmed/19826494
http://dx.doi.org/10.3389/neuro.11.030.2009
_version_ 1782172660580483072
author Keator, David B.
Wei, Dingying
Gadde, Syam
Bockholt, Jeremy
Grethe, Jeffrey S.
Marcus, Daniel
Aucoin, Nicole
Ozyurt, Ibrahim B.
author_facet Keator, David B.
Wei, Dingying
Gadde, Syam
Bockholt, Jeremy
Grethe, Jeffrey S.
Marcus, Daniel
Aucoin, Nicole
Ozyurt, Ibrahim B.
author_sort Keator, David B.
collection PubMed
description Organizing and annotating biomedical data in structured ways has gained much interest and focus in the last 30 years. Driven by decreases in digital storage costs and advances in genetics sequencing, imaging, electronic data collection, and microarray technologies, data is being collected at an ever increasing rate. The need to store and exchange data in meaningful ways in support of data analysis, hypothesis testing and future collaborative use is pervasive. Because trans-disciplinary projects rely on effective use of data from many domains, there is a genuine interest in informatics community on how best to store and combine this data while maintaining a high level of data quality and documentation. The difficulties in sharing and combining raw data become amplified after post-processing and/or data analysis in which the new dataset of interest is a function of the original data and may have been collected by multiple collaborating sites. Simple meta-data, documenting which subject and version of data were used for a particular analysis, becomes complicated by the heterogeneity of the collecting sites yet is critically important to the interpretation and reuse of derived results. This manuscript will present a case study of using the XML-Based Clinical Experiment Data Exchange (XCEDE) schema and the Human Imaging Database (HID) in the Biomedical Informatics Research Network's (BIRN) distributed environment to document and exchange derived data. The discussion includes an overview of the data structures used in both the XML and the database representations, insight into the design considerations, and the extensibility of the design to support additional analysis streams.
format Text
id pubmed-2759340
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Frontiers Research Foundation
record_format MEDLINE/PubMed
spelling pubmed-27593402009-10-13 Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid Keator, David B. Wei, Dingying Gadde, Syam Bockholt, Jeremy Grethe, Jeffrey S. Marcus, Daniel Aucoin, Nicole Ozyurt, Ibrahim B. Front Neuroinformatics Neuroscience Organizing and annotating biomedical data in structured ways has gained much interest and focus in the last 30 years. Driven by decreases in digital storage costs and advances in genetics sequencing, imaging, electronic data collection, and microarray technologies, data is being collected at an ever increasing rate. The need to store and exchange data in meaningful ways in support of data analysis, hypothesis testing and future collaborative use is pervasive. Because trans-disciplinary projects rely on effective use of data from many domains, there is a genuine interest in informatics community on how best to store and combine this data while maintaining a high level of data quality and documentation. The difficulties in sharing and combining raw data become amplified after post-processing and/or data analysis in which the new dataset of interest is a function of the original data and may have been collected by multiple collaborating sites. Simple meta-data, documenting which subject and version of data were used for a particular analysis, becomes complicated by the heterogeneity of the collecting sites yet is critically important to the interpretation and reuse of derived results. This manuscript will present a case study of using the XML-Based Clinical Experiment Data Exchange (XCEDE) schema and the Human Imaging Database (HID) in the Biomedical Informatics Research Network's (BIRN) distributed environment to document and exchange derived data. The discussion includes an overview of the data structures used in both the XML and the database representations, insight into the design considerations, and the extensibility of the design to support additional analysis streams. Frontiers Research Foundation 2009-09-07 /pmc/articles/PMC2759340/ /pubmed/19826494 http://dx.doi.org/10.3389/neuro.11.030.2009 Text en Copyright © 2009 Keator, Wei, Gadde, Bockholt, Grethe, Marcus, Aucoin and Ozyurt. http://www.frontiersin.org/licenseagreement This is an open-access article subject to an exclusive license agreement between the authors and the Frontiers Research Foundation, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are credited.
spellingShingle Neuroscience
Keator, David B.
Wei, Dingying
Gadde, Syam
Bockholt, Jeremy
Grethe, Jeffrey S.
Marcus, Daniel
Aucoin, Nicole
Ozyurt, Ibrahim B.
Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid
title Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid
title_full Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid
title_fullStr Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid
title_full_unstemmed Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid
title_short Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid
title_sort derived data storage and exchange workflow for large-scale neuroimaging analyses on the birn grid
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2759340/
https://www.ncbi.nlm.nih.gov/pubmed/19826494
http://dx.doi.org/10.3389/neuro.11.030.2009
work_keys_str_mv AT keatordavidb deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid
AT weidingying deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid
AT gaddesyam deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid
AT bockholtjeremy deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid
AT grethejeffreys deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid
AT marcusdaniel deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid
AT aucoinnicole deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid
AT ozyurtibrahimb deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid