Cargando…
Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database
The field of nanoinformatics is rapidly developing and provides data driven solutions in the area of nanomaterials (NM) safety. Safe by Design approaches are encouraged and promoted through regulatory initiatives and multiple scientific projects. Experimental data is at the core of nanoinformatics p...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7601422/ https://www.ncbi.nlm.nih.gov/pubmed/32987901 http://dx.doi.org/10.3390/nano10101908 |
_version_ | 1783603415319838720 |
---|---|
author | Kochev, Nikolay Jeliazkova, Nina Paskaleva, Vesselina Tancheva, Gergana Iliev, Luchesar Ritchie, Peter Jeliazkov, Vedrin |
author_facet | Kochev, Nikolay Jeliazkova, Nina Paskaleva, Vesselina Tancheva, Gergana Iliev, Luchesar Ritchie, Peter Jeliazkov, Vedrin |
author_sort | Kochev, Nikolay |
collection | PubMed |
description | The field of nanoinformatics is rapidly developing and provides data driven solutions in the area of nanomaterials (NM) safety. Safe by Design approaches are encouraged and promoted through regulatory initiatives and multiple scientific projects. Experimental data is at the core of nanoinformatics processing workflows for risk assessment. The nanosafety data is predominantly recorded in Excel spreadsheet files. Although the spreadsheets are quite convenient for the experimentalists, they also pose great challenges for the consequent processing into databases due to variability of the templates used, specific details provided by each laboratory and the need for proper metadata documentation and formatting. In this paper, we present a workflow to facilitate the conversion of spreadsheets into a FAIR (Findable, Accessible, Interoperable, and Reusable) database, with the pivotal aid of the NMDataParser tool, developed to streamline the mapping of the original file layout into the eNanoMapper semantic data model. The NMDataParser is an open source Java library and application, making use of a JSON configuration to define the mapping. We describe the JSON configuration syntax and the approaches applied for parsing different spreadsheet layouts used by the nanosafety community. Examples of using the NMDataParser tool in nanoinformatics workflows are given. Challenging cases are discussed and appropriate solutions are proposed. |
format | Online Article Text |
id | pubmed-7601422 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-76014222020-11-01 Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database Kochev, Nikolay Jeliazkova, Nina Paskaleva, Vesselina Tancheva, Gergana Iliev, Luchesar Ritchie, Peter Jeliazkov, Vedrin Nanomaterials (Basel) Article The field of nanoinformatics is rapidly developing and provides data driven solutions in the area of nanomaterials (NM) safety. Safe by Design approaches are encouraged and promoted through regulatory initiatives and multiple scientific projects. Experimental data is at the core of nanoinformatics processing workflows for risk assessment. The nanosafety data is predominantly recorded in Excel spreadsheet files. Although the spreadsheets are quite convenient for the experimentalists, they also pose great challenges for the consequent processing into databases due to variability of the templates used, specific details provided by each laboratory and the need for proper metadata documentation and formatting. In this paper, we present a workflow to facilitate the conversion of spreadsheets into a FAIR (Findable, Accessible, Interoperable, and Reusable) database, with the pivotal aid of the NMDataParser tool, developed to streamline the mapping of the original file layout into the eNanoMapper semantic data model. The NMDataParser is an open source Java library and application, making use of a JSON configuration to define the mapping. We describe the JSON configuration syntax and the approaches applied for parsing different spreadsheet layouts used by the nanosafety community. Examples of using the NMDataParser tool in nanoinformatics workflows are given. Challenging cases are discussed and appropriate solutions are proposed. MDPI 2020-09-24 /pmc/articles/PMC7601422/ /pubmed/32987901 http://dx.doi.org/10.3390/nano10101908 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kochev, Nikolay Jeliazkova, Nina Paskaleva, Vesselina Tancheva, Gergana Iliev, Luchesar Ritchie, Peter Jeliazkov, Vedrin Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database |
title | Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database |
title_full | Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database |
title_fullStr | Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database |
title_full_unstemmed | Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database |
title_short | Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database |
title_sort | your spreadsheets can be fair: a tool and fairification workflow for the enanomapper database |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7601422/ https://www.ncbi.nlm.nih.gov/pubmed/32987901 http://dx.doi.org/10.3390/nano10101908 |
work_keys_str_mv | AT kochevnikolay yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase AT jeliazkovanina yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase AT paskalevavesselina yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase AT tanchevagergana yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase AT ilievluchesar yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase AT ritchiepeter yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase AT jeliazkovvedrin yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase |