Cargando…

Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?

The increasing amount of publicly available research data provides the opportunity to link and integrate data in order to create and prove novel hypotheses, to repeat experiments or to compare recent data to data collected at a different time or place. However, recent studies have shown that retriev...

Descripción completa

Detalles Bibliográficos
Autores principales: Löffler, Felicitas, Wesp, Valentin, König-Ries, Birgitta, Klan, Friederike
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7990268/
https://www.ncbi.nlm.nih.gov/pubmed/33760822
http://dx.doi.org/10.1371/journal.pone.0246099
_version_ 1783669042060460032
author Löffler, Felicitas
Wesp, Valentin
König-Ries, Birgitta
Klan, Friederike
author_facet Löffler, Felicitas
Wesp, Valentin
König-Ries, Birgitta
Klan, Friederike
author_sort Löffler, Felicitas
collection PubMed
description The increasing amount of publicly available research data provides the opportunity to link and integrate data in order to create and prove novel hypotheses, to repeat experiments or to compare recent data to data collected at a different time or place. However, recent studies have shown that retrieving relevant data for data reuse is a time-consuming task in daily research practice. In this study, we explore what hampers dataset retrieval in biodiversity research, a field that produces a large amount of heterogeneous data. In particular, we focus on scholarly search interests and metadata, the primary source of data in a dataset retrieval system. We show that existing metadata currently poorly reflect information needs and therefore are the biggest obstacle in retrieving relevant data. Our findings indicate that for data seekers in the biodiversity domain environments, materials and chemicals, species, biological and chemical processes, locations, data parameters and data types are important information categories. These interests are well covered in metadata elements of domain-specific standards. However, instead of utilizing these standards, large data repositories tend to use metadata standards with domain-independent metadata fields that cover search interests only to some extent. A second problem are arbitrary keywords utilized in descriptive fields such as title, description or subject. Keywords support scholars in a full text search only if the provided terms syntactically match or their semantic relationship to terms used in a user query is known.
format Online
Article
Text
id pubmed-7990268
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-79902682021-04-05 Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs? Löffler, Felicitas Wesp, Valentin König-Ries, Birgitta Klan, Friederike PLoS One Research Article The increasing amount of publicly available research data provides the opportunity to link and integrate data in order to create and prove novel hypotheses, to repeat experiments or to compare recent data to data collected at a different time or place. However, recent studies have shown that retrieving relevant data for data reuse is a time-consuming task in daily research practice. In this study, we explore what hampers dataset retrieval in biodiversity research, a field that produces a large amount of heterogeneous data. In particular, we focus on scholarly search interests and metadata, the primary source of data in a dataset retrieval system. We show that existing metadata currently poorly reflect information needs and therefore are the biggest obstacle in retrieving relevant data. Our findings indicate that for data seekers in the biodiversity domain environments, materials and chemicals, species, biological and chemical processes, locations, data parameters and data types are important information categories. These interests are well covered in metadata elements of domain-specific standards. However, instead of utilizing these standards, large data repositories tend to use metadata standards with domain-independent metadata fields that cover search interests only to some extent. A second problem are arbitrary keywords utilized in descriptive fields such as title, description or subject. Keywords support scholars in a full text search only if the provided terms syntactically match or their semantic relationship to terms used in a user query is known. Public Library of Science 2021-03-24 /pmc/articles/PMC7990268/ /pubmed/33760822 http://dx.doi.org/10.1371/journal.pone.0246099 Text en © 2021 Löffler et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Löffler, Felicitas
Wesp, Valentin
König-Ries, Birgitta
Klan, Friederike
Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?
title Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?
title_full Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?
title_fullStr Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?
title_full_unstemmed Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?
title_short Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?
title_sort dataset search in biodiversity research: do metadata in data repositories reflect scholarly information needs?
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7990268/
https://www.ncbi.nlm.nih.gov/pubmed/33760822
http://dx.doi.org/10.1371/journal.pone.0246099
work_keys_str_mv AT lofflerfelicitas datasetsearchinbiodiversityresearchdometadataindatarepositoriesreflectscholarlyinformationneeds
AT wespvalentin datasetsearchinbiodiversityresearchdometadataindatarepositoriesreflectscholarlyinformationneeds
AT konigriesbirgitta datasetsearchinbiodiversityresearchdometadataindatarepositoriesreflectscholarlyinformationneeds
AT klanfriederike datasetsearchinbiodiversityresearchdometadataindatarepositoriesreflectscholarlyinformationneeds