Cargando…

An ontology-based approach for developing a harmonised data-validation tool for European cancer registration

BACKGROUND: Population-based cancer registries constitute an important information source in cancer epidemiology. Studies collating and comparing data across regional and national boundaries have proved important for deploying and evaluating effective cancer-control strategies. A critical aspect in...

Descripción completa

Detalles Bibliográficos
Autores principales: Nicholson, Nicholas Charles, Giusti, Francesco, Bettio, Manola, Negrao Carvalho, Raquel, Dimitrova, Nadya, Dyba, Tadeusz, Flego, Manuela, Neamtiu, Luciana, Randi, Giorgia, Martos, Carmen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7789225/
https://www.ncbi.nlm.nih.gov/pubmed/33407816
http://dx.doi.org/10.1186/s13326-020-00233-x
_version_ 1783633194583588864
author Nicholson, Nicholas Charles
Giusti, Francesco
Bettio, Manola
Negrao Carvalho, Raquel
Dimitrova, Nadya
Dyba, Tadeusz
Flego, Manuela
Neamtiu, Luciana
Randi, Giorgia
Martos, Carmen
author_facet Nicholson, Nicholas Charles
Giusti, Francesco
Bettio, Manola
Negrao Carvalho, Raquel
Dimitrova, Nadya
Dyba, Tadeusz
Flego, Manuela
Neamtiu, Luciana
Randi, Giorgia
Martos, Carmen
author_sort Nicholson, Nicholas Charles
collection PubMed
description BACKGROUND: Population-based cancer registries constitute an important information source in cancer epidemiology. Studies collating and comparing data across regional and national boundaries have proved important for deploying and evaluating effective cancer-control strategies. A critical aspect in correctly comparing cancer indicators across regional and national boundaries lies in ensuring a good and harmonised level of data quality, which is a primary motivator for a centralised collection of pseudonymised data. The recent introduction of the European Union’s general data-protection regulation (GDPR) imposes stricter conditions on the collection, processing, and sharing of personal data. It also considers pseudonymised data as personal data. The new regulation motivates the need to find solutions that allow a continuation of the smooth processes leading to harmonised European cancer-registry data. One element in this regard would be the availability of a data-validation software tool based on a formalised depiction of the harmonised data-validation rules, allowing an eventual devolution of the data-validation process to the local level. RESULTS: A semantic data model was derived from the data-validation rules for harmonising cancer-data variables at European level. The data model was encapsulated in an ontology developed using the Web-Ontology Language (OWL) with the data-model entities forming the main OWL classes. The data-validation rules were added as axioms in the ontology. The reasoning function of the resulting ontology demonstrated its ability to trap registry-coding errors and in some instances to be able to correct errors. CONCLUSIONS: Describing the European cancer-registry core data set in terms of an OWL ontology affords a tool based on a formalised set of axioms for validating a cancer-registry’s data set according to harmonised, supra-national rules. The fact that the data checks are inherently linked to the data model would lead to less maintenance overheads and also allow automatic versioning synchronisation, important for distributed data-quality checking processes.
format Online
Article
Text
id pubmed-7789225
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77892252021-01-07 An ontology-based approach for developing a harmonised data-validation tool for European cancer registration Nicholson, Nicholas Charles Giusti, Francesco Bettio, Manola Negrao Carvalho, Raquel Dimitrova, Nadya Dyba, Tadeusz Flego, Manuela Neamtiu, Luciana Randi, Giorgia Martos, Carmen J Biomed Semantics Software BACKGROUND: Population-based cancer registries constitute an important information source in cancer epidemiology. Studies collating and comparing data across regional and national boundaries have proved important for deploying and evaluating effective cancer-control strategies. A critical aspect in correctly comparing cancer indicators across regional and national boundaries lies in ensuring a good and harmonised level of data quality, which is a primary motivator for a centralised collection of pseudonymised data. The recent introduction of the European Union’s general data-protection regulation (GDPR) imposes stricter conditions on the collection, processing, and sharing of personal data. It also considers pseudonymised data as personal data. The new regulation motivates the need to find solutions that allow a continuation of the smooth processes leading to harmonised European cancer-registry data. One element in this regard would be the availability of a data-validation software tool based on a formalised depiction of the harmonised data-validation rules, allowing an eventual devolution of the data-validation process to the local level. RESULTS: A semantic data model was derived from the data-validation rules for harmonising cancer-data variables at European level. The data model was encapsulated in an ontology developed using the Web-Ontology Language (OWL) with the data-model entities forming the main OWL classes. The data-validation rules were added as axioms in the ontology. The reasoning function of the resulting ontology demonstrated its ability to trap registry-coding errors and in some instances to be able to correct errors. CONCLUSIONS: Describing the European cancer-registry core data set in terms of an OWL ontology affords a tool based on a formalised set of axioms for validating a cancer-registry’s data set according to harmonised, supra-national rules. The fact that the data checks are inherently linked to the data model would lead to less maintenance overheads and also allow automatic versioning synchronisation, important for distributed data-quality checking processes. BioMed Central 2021-01-06 /pmc/articles/PMC7789225/ /pubmed/33407816 http://dx.doi.org/10.1186/s13326-020-00233-x Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Nicholson, Nicholas Charles
Giusti, Francesco
Bettio, Manola
Negrao Carvalho, Raquel
Dimitrova, Nadya
Dyba, Tadeusz
Flego, Manuela
Neamtiu, Luciana
Randi, Giorgia
Martos, Carmen
An ontology-based approach for developing a harmonised data-validation tool for European cancer registration
title An ontology-based approach for developing a harmonised data-validation tool for European cancer registration
title_full An ontology-based approach for developing a harmonised data-validation tool for European cancer registration
title_fullStr An ontology-based approach for developing a harmonised data-validation tool for European cancer registration
title_full_unstemmed An ontology-based approach for developing a harmonised data-validation tool for European cancer registration
title_short An ontology-based approach for developing a harmonised data-validation tool for European cancer registration
title_sort ontology-based approach for developing a harmonised data-validation tool for european cancer registration
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7789225/
https://www.ncbi.nlm.nih.gov/pubmed/33407816
http://dx.doi.org/10.1186/s13326-020-00233-x
work_keys_str_mv AT nicholsonnicholascharles anontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT giustifrancesco anontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT bettiomanola anontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT negraocarvalhoraquel anontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT dimitrovanadya anontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT dybatadeusz anontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT flegomanuela anontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT neamtiuluciana anontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT randigiorgia anontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT martoscarmen anontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT nicholsonnicholascharles ontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT giustifrancesco ontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT bettiomanola ontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT negraocarvalhoraquel ontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT dimitrovanadya ontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT dybatadeusz ontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT flegomanuela ontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT neamtiuluciana ontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT randigiorgia ontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration
AT martoscarmen ontologybasedapproachfordevelopingaharmoniseddatavalidationtoolforeuropeancancerregistration