Cargando…
A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences
Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusabl...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7594074/ https://www.ncbi.nlm.nih.gov/pubmed/33092028 http://dx.doi.org/10.3390/nano10102068 |
_version_ | 1783601549518307328 |
---|---|
author | Ammar, Ammar Bonaretti, Serena Winckers, Laurent Quik, Joris Bakker, Martine Maier, Dieter Lynch, Iseult van Rijn, Jeaphianne Willighagen, Egon |
author_facet | Ammar, Ammar Bonaretti, Serena Winckers, Laurent Quik, Joris Bakker, Martine Maier, Dieter Lynch, Iseult van Rijn, Jeaphianne Willighagen, Egon |
author_sort | Ammar, Ammar |
collection | PubMed |
description | Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine the FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated the feasibility of our method on three real use cases where researchers looked for six datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress, Gene Expression Omnibus, eNanoMapper, caNanoLab, NanoCommons and ChEMBL), a registry of repositories, and a searchable resource (Google Dataset Search) via application program interfaces (API) wherever possible. With our analysis, we found that the six datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and the presence of specific attributes in registries of repositories could increase FAIRness of datasets. |
format | Online Article Text |
id | pubmed-7594074 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75940742020-10-30 A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences Ammar, Ammar Bonaretti, Serena Winckers, Laurent Quik, Joris Bakker, Martine Maier, Dieter Lynch, Iseult van Rijn, Jeaphianne Willighagen, Egon Nanomaterials (Basel) Article Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine the FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated the feasibility of our method on three real use cases where researchers looked for six datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress, Gene Expression Omnibus, eNanoMapper, caNanoLab, NanoCommons and ChEMBL), a registry of repositories, and a searchable resource (Google Dataset Search) via application program interfaces (API) wherever possible. With our analysis, we found that the six datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and the presence of specific attributes in registries of repositories could increase FAIRness of datasets. MDPI 2020-10-20 /pmc/articles/PMC7594074/ /pubmed/33092028 http://dx.doi.org/10.3390/nano10102068 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ammar, Ammar Bonaretti, Serena Winckers, Laurent Quik, Joris Bakker, Martine Maier, Dieter Lynch, Iseult van Rijn, Jeaphianne Willighagen, Egon A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences |
title | A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences |
title_full | A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences |
title_fullStr | A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences |
title_full_unstemmed | A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences |
title_short | A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences |
title_sort | semi-automated workflow for fair maturity indicators in the life sciences |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7594074/ https://www.ncbi.nlm.nih.gov/pubmed/33092028 http://dx.doi.org/10.3390/nano10102068 |
work_keys_str_mv | AT ammarammar asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT bonarettiserena asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT winckerslaurent asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT quikjoris asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT bakkermartine asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT maierdieter asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT lynchiseult asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT vanrijnjeaphianne asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT willighagenegon asemiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT ammarammar semiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT bonarettiserena semiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT winckerslaurent semiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT quikjoris semiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT bakkermartine semiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT maierdieter semiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT lynchiseult semiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT vanrijnjeaphianne semiautomatedworkflowforfairmaturityindicatorsinthelifesciences AT willighagenegon semiautomatedworkflowforfairmaturityindicatorsinthelifesciences |