Cargando…

Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database

The field of nanoinformatics is rapidly developing and provides data driven solutions in the area of nanomaterials (NM) safety. Safe by Design approaches are encouraged and promoted through regulatory initiatives and multiple scientific projects. Experimental data is at the core of nanoinformatics p...

Descripción completa

Detalles Bibliográficos
Autores principales: Kochev, Nikolay, Jeliazkova, Nina, Paskaleva, Vesselina, Tancheva, Gergana, Iliev, Luchesar, Ritchie, Peter, Jeliazkov, Vedrin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7601422/
https://www.ncbi.nlm.nih.gov/pubmed/32987901
http://dx.doi.org/10.3390/nano10101908
_version_ 1783603415319838720
author Kochev, Nikolay
Jeliazkova, Nina
Paskaleva, Vesselina
Tancheva, Gergana
Iliev, Luchesar
Ritchie, Peter
Jeliazkov, Vedrin
author_facet Kochev, Nikolay
Jeliazkova, Nina
Paskaleva, Vesselina
Tancheva, Gergana
Iliev, Luchesar
Ritchie, Peter
Jeliazkov, Vedrin
author_sort Kochev, Nikolay
collection PubMed
description The field of nanoinformatics is rapidly developing and provides data driven solutions in the area of nanomaterials (NM) safety. Safe by Design approaches are encouraged and promoted through regulatory initiatives and multiple scientific projects. Experimental data is at the core of nanoinformatics processing workflows for risk assessment. The nanosafety data is predominantly recorded in Excel spreadsheet files. Although the spreadsheets are quite convenient for the experimentalists, they also pose great challenges for the consequent processing into databases due to variability of the templates used, specific details provided by each laboratory and the need for proper metadata documentation and formatting. In this paper, we present a workflow to facilitate the conversion of spreadsheets into a FAIR (Findable, Accessible, Interoperable, and Reusable) database, with the pivotal aid of the NMDataParser tool, developed to streamline the mapping of the original file layout into the eNanoMapper semantic data model. The NMDataParser is an open source Java library and application, making use of a JSON configuration to define the mapping. We describe the JSON configuration syntax and the approaches applied for parsing different spreadsheet layouts used by the nanosafety community. Examples of using the NMDataParser tool in nanoinformatics workflows are given. Challenging cases are discussed and appropriate solutions are proposed.
format Online
Article
Text
id pubmed-7601422
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-76014222020-11-01 Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database Kochev, Nikolay Jeliazkova, Nina Paskaleva, Vesselina Tancheva, Gergana Iliev, Luchesar Ritchie, Peter Jeliazkov, Vedrin Nanomaterials (Basel) Article The field of nanoinformatics is rapidly developing and provides data driven solutions in the area of nanomaterials (NM) safety. Safe by Design approaches are encouraged and promoted through regulatory initiatives and multiple scientific projects. Experimental data is at the core of nanoinformatics processing workflows for risk assessment. The nanosafety data is predominantly recorded in Excel spreadsheet files. Although the spreadsheets are quite convenient for the experimentalists, they also pose great challenges for the consequent processing into databases due to variability of the templates used, specific details provided by each laboratory and the need for proper metadata documentation and formatting. In this paper, we present a workflow to facilitate the conversion of spreadsheets into a FAIR (Findable, Accessible, Interoperable, and Reusable) database, with the pivotal aid of the NMDataParser tool, developed to streamline the mapping of the original file layout into the eNanoMapper semantic data model. The NMDataParser is an open source Java library and application, making use of a JSON configuration to define the mapping. We describe the JSON configuration syntax and the approaches applied for parsing different spreadsheet layouts used by the nanosafety community. Examples of using the NMDataParser tool in nanoinformatics workflows are given. Challenging cases are discussed and appropriate solutions are proposed. MDPI 2020-09-24 /pmc/articles/PMC7601422/ /pubmed/32987901 http://dx.doi.org/10.3390/nano10101908 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kochev, Nikolay
Jeliazkova, Nina
Paskaleva, Vesselina
Tancheva, Gergana
Iliev, Luchesar
Ritchie, Peter
Jeliazkov, Vedrin
Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database
title Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database
title_full Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database
title_fullStr Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database
title_full_unstemmed Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database
title_short Your Spreadsheets Can Be FAIR: A Tool and FAIRification Workflow for the eNanoMapper Database
title_sort your spreadsheets can be fair: a tool and fairification workflow for the enanomapper database
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7601422/
https://www.ncbi.nlm.nih.gov/pubmed/32987901
http://dx.doi.org/10.3390/nano10101908
work_keys_str_mv AT kochevnikolay yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase
AT jeliazkovanina yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase
AT paskalevavesselina yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase
AT tanchevagergana yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase
AT ilievluchesar yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase
AT ritchiepeter yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase
AT jeliazkovvedrin yourspreadsheetscanbefairatoolandfairificationworkflowfortheenanomapperdatabase