Cargando…

A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences

Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusabl...

Descripción completa

Detalles Bibliográficos
Autores principales: Ammar, Ammar, Bonaretti, Serena, Winckers, Laurent, Quik, Joris, Bakker, Martine, Maier, Dieter, Lynch, Iseult, van Rijn, Jeaphianne, Willighagen, Egon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7594074/
https://www.ncbi.nlm.nih.gov/pubmed/33092028
http://dx.doi.org/10.3390/nano10102068
_version_ 1783601549518307328
author Ammar, Ammar
Bonaretti, Serena
Winckers, Laurent
Quik, Joris
Bakker, Martine
Maier, Dieter
Lynch, Iseult
van Rijn, Jeaphianne
Willighagen, Egon
author_facet Ammar, Ammar
Bonaretti, Serena
Winckers, Laurent
Quik, Joris
Bakker, Martine
Maier, Dieter
Lynch, Iseult
van Rijn, Jeaphianne
Willighagen, Egon
author_sort Ammar, Ammar
collection PubMed
description Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine the FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated the feasibility of our method on three real use cases where researchers looked for six datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress, Gene Expression Omnibus, eNanoMapper, caNanoLab, NanoCommons and ChEMBL), a registry of repositories, and a searchable resource (Google Dataset Search) via application program interfaces (API) wherever possible. With our analysis, we found that the six datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and the presence of specific attributes in registries of repositories could increase FAIRness of datasets.
format Online
Article
Text
id pubmed-7594074
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75940742020-10-30 A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences Ammar, Ammar Bonaretti, Serena Winckers, Laurent Quik, Joris Bakker, Martine Maier, Dieter Lynch, Iseult van Rijn, Jeaphianne Willighagen, Egon Nanomaterials (Basel) Article Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine the FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated the feasibility of our method on three real use cases where researchers looked for six datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress, Gene Expression Omnibus, eNanoMapper, caNanoLab, NanoCommons and ChEMBL), a registry of repositories, and a searchable resource (Google Dataset Search) via application program interfaces (API) wherever possible. With our analysis, we found that the six datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and the presence of specific attributes in registries of repositories could increase FAIRness of datasets. MDPI 2020-10-20 /pmc/articles/PMC7594074/ /pubmed/33092028 http://dx.doi.org/10.3390/nano10102068 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ammar, Ammar
Bonaretti, Serena
Winckers, Laurent
Quik, Joris
Bakker, Martine
Maier, Dieter
Lynch, Iseult
van Rijn, Jeaphianne
Willighagen, Egon
A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences
title A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences
title_full A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences
title_fullStr A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences
title_full_unstemmed A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences
title_short A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences
title_sort semi-automated workflow for fair maturity indicators in the life sciences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7594074/
https://www.ncbi.nlm.nih.gov/pubmed/33092028
http://dx.doi.org/10.3390/nano10102068
work_keys_str_mv AT ammarammar asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT bonarettiserena asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT winckerslaurent asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT quikjoris asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT bakkermartine asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT maierdieter asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT lynchiseult asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT vanrijnjeaphianne asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT willighagenegon asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT ammarammar semiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT bonarettiserena semiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT winckerslaurent semiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT quikjoris semiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT bakkermartine semiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT maierdieter semiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT lynchiseult semiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT vanrijnjeaphianne semiautomatedworkflowforfairmaturityindicatorsinthelifesciences
AT willighagenegon semiautomatedworkflowforfairmaturityindicatorsinthelifesciences