Cargando…

Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience

Neuroscience continues to experience a tremendous growth in data; in terms of the volume and variety of data, the velocity at which data is acquired, and in turn the veracity of data. These challenges are a serious impediment to sharing of data, analyses, and tools within and across labs. Here, we i...

Descripción completa

Detalles Bibliográficos
Autores principales: Rübel, Oliver, Dougherty, Max, Prabhat, Denes, Peter, Conant, David, Chang, Edward F., Bouchard, Kristofer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5095137/
https://www.ncbi.nlm.nih.gov/pubmed/27867355
http://dx.doi.org/10.3389/fninf.2016.00048
_version_ 1782465249661681664
author Rübel, Oliver
Dougherty, Max
Prabhat,
Denes, Peter
Conant, David
Chang, Edward F.
Bouchard, Kristofer
author_facet Rübel, Oliver
Dougherty, Max
Prabhat,
Denes, Peter
Conant, David
Chang, Edward F.
Bouchard, Kristofer
author_sort Rübel, Oliver
collection PubMed
description Neuroscience continues to experience a tremendous growth in data; in terms of the volume and variety of data, the velocity at which data is acquired, and in turn the veracity of data. These challenges are a serious impediment to sharing of data, analyses, and tools within and across labs. Here, we introduce BRAINformat, a novel data standardization framework for the design and management of scientific data formats. The BRAINformat library defines application-independent design concepts and modules that together create a general framework for standardization of scientific data. We describe the formal specification of scientific data standards, which facilitates sharing and verification of data and formats. We introduce the concept of Managed Objects, enabling semantic components of data formats to be specified as self-contained units, supporting modular and reusable design of data format components and file storage. We also introduce the novel concept of Relationship Attributes for modeling and use of semantic relationships between data objects. Based on these concepts we demonstrate the application of our framework to design and implement a standard format for electrophysiology data and show how data standardization and relationship-modeling facilitate data analysis and sharing. The format uses HDF5, enabling portable, scalable, and self-describing data storage and integration with modern high-performance computing for data-driven discovery. The BRAINformat library is open source, easy-to-use, and provides detailed user and developer documentation and is freely available at: https://bitbucket.org/oruebel/brainformat.
format Online
Article
Text
id pubmed-5095137
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-50951372016-11-18 Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience Rübel, Oliver Dougherty, Max Prabhat, Denes, Peter Conant, David Chang, Edward F. Bouchard, Kristofer Front Neuroinform Neuroscience Neuroscience continues to experience a tremendous growth in data; in terms of the volume and variety of data, the velocity at which data is acquired, and in turn the veracity of data. These challenges are a serious impediment to sharing of data, analyses, and tools within and across labs. Here, we introduce BRAINformat, a novel data standardization framework for the design and management of scientific data formats. The BRAINformat library defines application-independent design concepts and modules that together create a general framework for standardization of scientific data. We describe the formal specification of scientific data standards, which facilitates sharing and verification of data and formats. We introduce the concept of Managed Objects, enabling semantic components of data formats to be specified as self-contained units, supporting modular and reusable design of data format components and file storage. We also introduce the novel concept of Relationship Attributes for modeling and use of semantic relationships between data objects. Based on these concepts we demonstrate the application of our framework to design and implement a standard format for electrophysiology data and show how data standardization and relationship-modeling facilitate data analysis and sharing. The format uses HDF5, enabling portable, scalable, and self-describing data storage and integration with modern high-performance computing for data-driven discovery. The BRAINformat library is open source, easy-to-use, and provides detailed user and developer documentation and is freely available at: https://bitbucket.org/oruebel/brainformat. Frontiers Media S.A. 2016-11-04 /pmc/articles/PMC5095137/ /pubmed/27867355 http://dx.doi.org/10.3389/fninf.2016.00048 Text en Copyright © 2016 Rübel, Dougherty, Prabhat, Denes, Conant, Chang and Bouchard. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Rübel, Oliver
Dougherty, Max
Prabhat,
Denes, Peter
Conant, David
Chang, Edward F.
Bouchard, Kristofer
Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience
title Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience
title_full Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience
title_fullStr Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience
title_full_unstemmed Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience
title_short Methods for Specifying Scientific Data Standards and Modeling Relationships with Applications to Neuroscience
title_sort methods for specifying scientific data standards and modeling relationships with applications to neuroscience
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5095137/
https://www.ncbi.nlm.nih.gov/pubmed/27867355
http://dx.doi.org/10.3389/fninf.2016.00048
work_keys_str_mv AT rubeloliver methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience
AT doughertymax methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience
AT prabhat methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience
AT denespeter methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience
AT conantdavid methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience
AT changedwardf methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience
AT bouchardkristofer methodsforspecifyingscientificdatastandardsandmodelingrelationshipswithapplicationstoneuroscience