Cargando…
Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection
In the past scientists reported summaries of their findings; they did not provide their original data collections. Many stakeholders (e.g., funding agencies) are now requesting that such data be made publicly available. This mandate is being adopted to facilitate further discovery, and to mitigate w...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6799921/ https://www.ncbi.nlm.nih.gov/pubmed/31626635 http://dx.doi.org/10.1371/journal.pone.0223984 |
_version_ | 1783460397098991616 |
---|---|
author | Farrell, Brenda Bengtson, Jason |
author_facet | Farrell, Brenda Bengtson, Jason |
author_sort | Farrell, Brenda |
collection | PubMed |
description | In the past scientists reported summaries of their findings; they did not provide their original data collections. Many stakeholders (e.g., funding agencies) are now requesting that such data be made publicly available. This mandate is being adopted to facilitate further discovery, and to mitigate waste and deficits in the research process. At the same time, the necessary infrastructure for data curation (e.g., repositories) has been evolving. The current target is to make research products FAIR (Findable, Accessible, Interoperable, Reusable), resulting in data that are curated and archived to be both human and machine compatible. However, most scientists have little training in data curation. Specifically, they are ill-equipped to annotate their data collections at a level that facilitates discoverability, aggregation, and broad reuse in a context separate from their creation or sub-field. To circumvent these deficits data architects may collaborate with scientists to transform and curate data. This paper’s example of a data collection describes the electrical properties of outer hair cells isolated from the mammalian cochlea. The data is expressed with a variant of The Ontology for Biomedical Investigations (OBI), mirrored to provide the metadata and nested data architecture used within the Hierarchical Data Format version 5 (HDF5) format. Each digital specimen is displayed in a tree configuration (like directories in a computer) and consists of six main branches based on the ontology classes. The data collections, scripts, and ontological OWL file (OBI based Inner Ear Electrophysiology (OBI_IEE)) are deposited in three repositories. We discuss the impediments to producing such data collections for public use, and the tools and processes required for effective implementation. This work illustrates the impact that small collaborations can have on the curation of our publicly-funded collections, and is particularly salient for fields where data is sparse, throughput is low, and sacrifice of animals is required for discovery. |
format | Online Article Text |
id | pubmed-6799921 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-67999212019-10-25 Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection Farrell, Brenda Bengtson, Jason PLoS One Research Article In the past scientists reported summaries of their findings; they did not provide their original data collections. Many stakeholders (e.g., funding agencies) are now requesting that such data be made publicly available. This mandate is being adopted to facilitate further discovery, and to mitigate waste and deficits in the research process. At the same time, the necessary infrastructure for data curation (e.g., repositories) has been evolving. The current target is to make research products FAIR (Findable, Accessible, Interoperable, Reusable), resulting in data that are curated and archived to be both human and machine compatible. However, most scientists have little training in data curation. Specifically, they are ill-equipped to annotate their data collections at a level that facilitates discoverability, aggregation, and broad reuse in a context separate from their creation or sub-field. To circumvent these deficits data architects may collaborate with scientists to transform and curate data. This paper’s example of a data collection describes the electrical properties of outer hair cells isolated from the mammalian cochlea. The data is expressed with a variant of The Ontology for Biomedical Investigations (OBI), mirrored to provide the metadata and nested data architecture used within the Hierarchical Data Format version 5 (HDF5) format. Each digital specimen is displayed in a tree configuration (like directories in a computer) and consists of six main branches based on the ontology classes. The data collections, scripts, and ontological OWL file (OBI based Inner Ear Electrophysiology (OBI_IEE)) are deposited in three repositories. We discuss the impediments to producing such data collections for public use, and the tools and processes required for effective implementation. This work illustrates the impact that small collaborations can have on the curation of our publicly-funded collections, and is particularly salient for fields where data is sparse, throughput is low, and sacrifice of animals is required for discovery. Public Library of Science 2019-10-18 /pmc/articles/PMC6799921/ /pubmed/31626635 http://dx.doi.org/10.1371/journal.pone.0223984 Text en © 2019 Farrell, Bengtson http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Farrell, Brenda Bengtson, Jason Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection |
title | Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection |
title_full | Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection |
title_fullStr | Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection |
title_full_unstemmed | Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection |
title_short | Scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection |
title_sort | scientist and data architect collaborate to curate and archive an inner ear electrophysiology data collection |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6799921/ https://www.ncbi.nlm.nih.gov/pubmed/31626635 http://dx.doi.org/10.1371/journal.pone.0223984 |
work_keys_str_mv | AT farrellbrenda scientistanddataarchitectcollaboratetocurateandarchiveaninnerearelectrophysiologydatacollection AT bengtsonjason scientistanddataarchitectcollaboratetocurateandarchiveaninnerearelectrophysiologydatacollection |