Cargando…

Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection

In the past scientists reported summaries of their findings; they did not provide their original data collections. Many stakeholders (e.g., funding agencies) are now requesting that such data be made publicly available. This mandate is being adopted to facilitate further discovery, and to mitigate w...

Descripción completa

Detalles Bibliográficos
Autores principales: Farrell, Brenda, Bengtson, Jason
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6799921/
https://www.ncbi.nlm.nih.gov/pubmed/31626635
http://dx.doi.org/10.1371/journal.pone.0223984
_version_ 1783460397098991616
author Farrell, Brenda
Bengtson, Jason
author_facet Farrell, Brenda
Bengtson, Jason
author_sort Farrell, Brenda
collection PubMed
description In the past scientists reported summaries of their findings; they did not provide their original data collections. Many stakeholders (e.g., funding agencies) are now requesting that such data be made publicly available. This mandate is being adopted to facilitate further discovery, and to mitigate waste and deficits in the research process. At the same time, the necessary infrastructure for data curation (e.g., repositories) has been evolving. The current target is to make research products FAIR (Findable, Accessible, Interoperable, Reusable), resulting in data that are curated and archived to be both human and machine compatible. However, most scientists have little training in data curation. Specifically, they are ill-equipped to annotate their data collections at a level that facilitates discoverability, aggregation, and broad reuse in a context separate from their creation or sub-field. To circumvent these deficits data architects may collaborate with scientists to transform and curate data. This paper’s example of a data collection describes the electrical properties of outer hair cells isolated from the mammalian cochlea. The data is expressed with a variant of The Ontology for Biomedical Investigations (OBI), mirrored to provide the metadata and nested data architecture used within the Hierarchical Data Format version 5 (HDF5) format. Each digital specimen is displayed in a tree configuration (like directories in a computer) and consists of six main branches based on the ontology classes. The data collections, scripts, and ontological OWL file (OBI based Inner Ear Electrophysiology (OBI_IEE)) are deposited in three repositories. We discuss the impediments to producing such data collections for public use, and the tools and processes required for effective implementation. This work illustrates the impact that small collaborations can have on the curation of our publicly-funded collections, and is particularly salient for fields where data is sparse, throughput is low, and sacrifice of animals is required for discovery.
format Online
Article
Text
id pubmed-6799921
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-67999212019-10-25 Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection Farrell, Brenda Bengtson, Jason PLoS One Research Article In the past scientists reported summaries of their findings; they did not provide their original data collections. Many stakeholders (e.g., funding agencies) are now requesting that such data be made publicly available. This mandate is being adopted to facilitate further discovery, and to mitigate waste and deficits in the research process. At the same time, the necessary infrastructure for data curation (e.g., repositories) has been evolving. The current target is to make research products FAIR (Findable, Accessible, Interoperable, Reusable), resulting in data that are curated and archived to be both human and machine compatible. However, most scientists have little training in data curation. Specifically, they are ill-equipped to annotate their data collections at a level that facilitates discoverability, aggregation, and broad reuse in a context separate from their creation or sub-field. To circumvent these deficits data architects may collaborate with scientists to transform and curate data. This paper’s example of a data collection describes the electrical properties of outer hair cells isolated from the mammalian cochlea. The data is expressed with a variant of The Ontology for Biomedical Investigations (OBI), mirrored to provide the metadata and nested data architecture used within the Hierarchical Data Format version 5 (HDF5) format. Each digital specimen is displayed in a tree configuration (like directories in a computer) and consists of six main branches based on the ontology classes. The data collections, scripts, and ontological OWL file (OBI based Inner Ear Electrophysiology (OBI_IEE)) are deposited in three repositories. We discuss the impediments to producing such data collections for public use, and the tools and processes required for effective implementation. This work illustrates the impact that small collaborations can have on the curation of our publicly-funded collections, and is particularly salient for fields where data is sparse, throughput is low, and sacrifice of animals is required for discovery. Public Library of Science 2019-10-18 /pmc/articles/PMC6799921/ /pubmed/31626635 http://dx.doi.org/10.1371/journal.pone.0223984 Text en © 2019 Farrell, Bengtson http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Farrell, Brenda
Bengtson, Jason
Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection
title Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection
title_full Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection
title_fullStr Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection
title_full_unstemmed Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection
title_short Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection
title_sort scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6799921/
https://www.ncbi.nlm.nih.gov/pubmed/31626635
http://dx.doi.org/10.1371/journal.pone.0223984
work_keys_str_mv AT farrellbrenda scientistanddataarchitectcollaboratetocurateandarchiveaninnerearelectrophysiologydatacollection
AT bengtsonjason scientistanddataarchitectcollaboratetocurateandarchiveaninnerearelectrophysiologydatacollection