Cargando…
Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid
Organizing and annotating biomedical data in structured ways has gained much interest and focus in the last 30 years. Driven by decreases in digital storage costs and advances in genetics sequencing, imaging, electronic data collection, and microarray technologies, data is being collected at an ever...
Autores principales: | , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Frontiers Research Foundation
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2759340/ https://www.ncbi.nlm.nih.gov/pubmed/19826494 http://dx.doi.org/10.3389/neuro.11.030.2009 |
_version_ | 1782172660580483072 |
---|---|
author | Keator, David B. Wei, Dingying Gadde, Syam Bockholt, Jeremy Grethe, Jeffrey S. Marcus, Daniel Aucoin, Nicole Ozyurt, Ibrahim B. |
author_facet | Keator, David B. Wei, Dingying Gadde, Syam Bockholt, Jeremy Grethe, Jeffrey S. Marcus, Daniel Aucoin, Nicole Ozyurt, Ibrahim B. |
author_sort | Keator, David B. |
collection | PubMed |
description | Organizing and annotating biomedical data in structured ways has gained much interest and focus in the last 30 years. Driven by decreases in digital storage costs and advances in genetics sequencing, imaging, electronic data collection, and microarray technologies, data is being collected at an ever increasing rate. The need to store and exchange data in meaningful ways in support of data analysis, hypothesis testing and future collaborative use is pervasive. Because trans-disciplinary projects rely on effective use of data from many domains, there is a genuine interest in informatics community on how best to store and combine this data while maintaining a high level of data quality and documentation. The difficulties in sharing and combining raw data become amplified after post-processing and/or data analysis in which the new dataset of interest is a function of the original data and may have been collected by multiple collaborating sites. Simple meta-data, documenting which subject and version of data were used for a particular analysis, becomes complicated by the heterogeneity of the collecting sites yet is critically important to the interpretation and reuse of derived results. This manuscript will present a case study of using the XML-Based Clinical Experiment Data Exchange (XCEDE) schema and the Human Imaging Database (HID) in the Biomedical Informatics Research Network's (BIRN) distributed environment to document and exchange derived data. The discussion includes an overview of the data structures used in both the XML and the database representations, insight into the design considerations, and the extensibility of the design to support additional analysis streams. |
format | Text |
id | pubmed-2759340 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Frontiers Research Foundation |
record_format | MEDLINE/PubMed |
spelling | pubmed-27593402009-10-13 Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid Keator, David B. Wei, Dingying Gadde, Syam Bockholt, Jeremy Grethe, Jeffrey S. Marcus, Daniel Aucoin, Nicole Ozyurt, Ibrahim B. Front Neuroinformatics Neuroscience Organizing and annotating biomedical data in structured ways has gained much interest and focus in the last 30 years. Driven by decreases in digital storage costs and advances in genetics sequencing, imaging, electronic data collection, and microarray technologies, data is being collected at an ever increasing rate. The need to store and exchange data in meaningful ways in support of data analysis, hypothesis testing and future collaborative use is pervasive. Because trans-disciplinary projects rely on effective use of data from many domains, there is a genuine interest in informatics community on how best to store and combine this data while maintaining a high level of data quality and documentation. The difficulties in sharing and combining raw data become amplified after post-processing and/or data analysis in which the new dataset of interest is a function of the original data and may have been collected by multiple collaborating sites. Simple meta-data, documenting which subject and version of data were used for a particular analysis, becomes complicated by the heterogeneity of the collecting sites yet is critically important to the interpretation and reuse of derived results. This manuscript will present a case study of using the XML-Based Clinical Experiment Data Exchange (XCEDE) schema and the Human Imaging Database (HID) in the Biomedical Informatics Research Network's (BIRN) distributed environment to document and exchange derived data. The discussion includes an overview of the data structures used in both the XML and the database representations, insight into the design considerations, and the extensibility of the design to support additional analysis streams. Frontiers Research Foundation 2009-09-07 /pmc/articles/PMC2759340/ /pubmed/19826494 http://dx.doi.org/10.3389/neuro.11.030.2009 Text en Copyright © 2009 Keator, Wei, Gadde, Bockholt, Grethe, Marcus, Aucoin and Ozyurt. http://www.frontiersin.org/licenseagreement This is an open-access article subject to an exclusive license agreement between the authors and the Frontiers Research Foundation, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are credited. |
spellingShingle | Neuroscience Keator, David B. Wei, Dingying Gadde, Syam Bockholt, Jeremy Grethe, Jeffrey S. Marcus, Daniel Aucoin, Nicole Ozyurt, Ibrahim B. Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid |
title | Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid |
title_full | Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid |
title_fullStr | Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid |
title_full_unstemmed | Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid |
title_short | Derived Data Storage and Exchange Workflow for Large-Scale Neuroimaging Analyses on the BIRN Grid |
title_sort | derived data storage and exchange workflow for large-scale neuroimaging analyses on the birn grid |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2759340/ https://www.ncbi.nlm.nih.gov/pubmed/19826494 http://dx.doi.org/10.3389/neuro.11.030.2009 |
work_keys_str_mv | AT keatordavidb deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid AT weidingying deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid AT gaddesyam deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid AT bockholtjeremy deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid AT grethejeffreys deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid AT marcusdaniel deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid AT aucoinnicole deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid AT ozyurtibrahimb deriveddatastorageandexchangeworkflowforlargescaleneuroimaginganalysesonthebirngrid |