Cargando…
Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience
Neuroscience continues to experience a tremendous growth in data; in terms of the volume and variety of data, the velocity at which data is acquired, and in turn the veracity of data. These challenges are a serious impediment to sharing of data, analyses, and tools within and across labs. Here, we i...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5095137/ https://www.ncbi.nlm.nih.gov/pubmed/27867355 http://dx.doi.org/10.3389/fninf.2016.00048 |
_version_ | 1782465249661681664 |
---|---|
author | Rübel, Oliver Dougherty, Max Prabhat, Denes, Peter Conant, David Chang, Edward F. Bouchard, Kristofer |
author_facet | Rübel, Oliver Dougherty, Max Prabhat, Denes, Peter Conant, David Chang, Edward F. Bouchard, Kristofer |
author_sort | Rübel, Oliver |
collection | PubMed |
description | Neuroscience continues to experience a tremendous growth in data; in terms of the volume and variety of data, the velocity at which data is acquired, and in turn the veracity of data. These challenges are a serious impediment to sharing of data, analyses, and tools within and across labs. Here, we introduce BRAINformat, a novel data standardization framework for the design and management of scientific data formats. The BRAINformat library defines application-independent design concepts and modules that together create a general framework for standardization of scientific data. We describe the formal specification of scientific data standards, which facilitates sharing and verification of data and formats. We introduce the concept of Managed Objects, enabling semantic components of data formats to be specified as self-contained units, supporting modular and reusable design of data format components and file storage. We also introduce the novel concept of Relationship Attributes for modeling and use of semantic relationships between data objects. Based on these concepts we demonstrate the application of our framework to design and implement a standard format for electrophysiology data and show how data standardization and relationship-modeling facilitate data analysis and sharing. The format uses HDF5, enabling portable, scalable, and self-describing data storage and integration with modern high-performance computing for data-driven discovery. The BRAINformat library is open source, easy-to-use, and provides detailed user and developer documentation and is freely available at: https://bitbucket.org/oruebel/brainformat. |
format | Online Article Text |
id | pubmed-5095137 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-50951372016-11-18 Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience Rübel, Oliver Dougherty, Max Prabhat, Denes, Peter Conant, David Chang, Edward F. Bouchard, Kristofer Front Neuroinform Neuroscience Neuroscience continues to experience a tremendous growth in data; in terms of the volume and variety of data, the velocity at which data is acquired, and in turn the veracity of data. These challenges are a serious impediment to sharing of data, analyses, and tools within and across labs. Here, we introduce BRAINformat, a novel data standardization framework for the design and management of scientific data formats. The BRAINformat library defines application-independent design concepts and modules that together create a general framework for standardization of scientific data. We describe the formal specification of scientific data standards, which facilitates sharing and verification of data and formats. We introduce the concept of Managed Objects, enabling semantic components of data formats to be specified as self-contained units, supporting modular and reusable design of data format components and file storage. We also introduce the novel concept of Relationship Attributes for modeling and use of semantic relationships between data objects. Based on these concepts we demonstrate the application of our framework to design and implement a standard format for electrophysiology data and show how data standardization and relationship-modeling facilitate data analysis and sharing. The format uses HDF5, enabling portable, scalable, and self-describing data storage and integration with modern high-performance computing for data-driven discovery. The BRAINformat library is open source, easy-to-use, and provides detailed user and developer documentation and is freely available at: https://bitbucket.org/oruebel/brainformat. Frontiers Media S.A. 2016-11-04 /pmc/articles/PMC5095137/ /pubmed/27867355 http://dx.doi.org/10.3389/fninf.2016.00048 Text en Copyright © 2016 Rübel, Dougherty, Prabhat, Denes, Conant, Chang and Bouchard. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Rübel, Oliver Dougherty, Max Prabhat, Denes, Peter Conant, David Chang, Edward F. Bouchard, Kristofer Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience |
title | Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience |
title_full | Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience |
title_fullStr | Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience |
title_full_unstemmed | Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience |
title_short | Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience |
title_sort | methods for specifying scientific data standards and modeling relationships with applications to neuroscience |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5095137/ https://www.ncbi.nlm.nih.gov/pubmed/27867355 http://dx.doi.org/10.3389/fninf.2016.00048 |
work_keys_str_mv | AT rubeloliver methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience AT doughertymax methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience AT prabhat methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience AT denespeter methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience AT conantdavid methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience AT changedwardf methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience AT bouchardkristofer methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience |