Cargando…

Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema

With the pervasiveness of data mining (DM) in many areas of our society, the management of digital data, readily available for analysis, has become increasingly important. Consequently, nearly all community accepted guidelines and principles (e.g. FAIR and TRUST) for publishing such data in the digi...

Descripción completa

Detalles Bibliográficos
Autores principales: Kostovska, Ana, Džeroski, Sašo, Panov, Panče
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7556383/
http://dx.doi.org/10.1007/978-3-030-61527-7_10
_version_ 1783594207232917504
author Kostovska, Ana
Džeroski, Sašo
Panov, Panče
author_facet Kostovska, Ana
Džeroski, Sašo
Panov, Panče
author_sort Kostovska, Ana
collection PubMed
description With the pervasiveness of data mining (DM) in many areas of our society, the management of digital data, readily available for analysis, has become increasingly important. Consequently, nearly all community accepted guidelines and principles (e.g. FAIR and TRUST) for publishing such data in the digital ecosystem, stress the importance of semantic data enhancement. Having rich semantic annotation of DM datasets would support the data mining process at various choice points, such as data understanding, automatic identification of the analysis task, and reasoning over the obtained results. In this paper, we report on the developments of an ontology-based annotation schema for semantic description of DM datasets. The annotation schema combines three different aspects of semantic annotation, i.e., annotation of provenance, data mining specific, and domain-specific information. We demonstrate the utility of these annotations in two use cases: semantic annotation of remote sensing data and data about neurodegenerative diseases.
format Online
Article
Text
id pubmed-7556383
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-75563832020-10-15 Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema Kostovska, Ana Džeroski, Sašo Panov, Panče Discovery Science Article With the pervasiveness of data mining (DM) in many areas of our society, the management of digital data, readily available for analysis, has become increasingly important. Consequently, nearly all community accepted guidelines and principles (e.g. FAIR and TRUST) for publishing such data in the digital ecosystem, stress the importance of semantic data enhancement. Having rich semantic annotation of DM datasets would support the data mining process at various choice points, such as data understanding, automatic identification of the analysis task, and reasoning over the obtained results. In this paper, we report on the developments of an ontology-based annotation schema for semantic description of DM datasets. The annotation schema combines three different aspects of semantic annotation, i.e., annotation of provenance, data mining specific, and domain-specific information. We demonstrate the utility of these annotations in two use cases: semantic annotation of remote sensing data and data about neurodegenerative diseases. 2020-09-19 /pmc/articles/PMC7556383/ http://dx.doi.org/10.1007/978-3-030-61527-7_10 Text en © The Author(s) 2020 Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
spellingShingle Article
Kostovska, Ana
Džeroski, Sašo
Panov, Panče
Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema
title Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema
title_full Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema
title_fullStr Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema
title_full_unstemmed Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema
title_short Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema
title_sort semantic description of data mining datasets: an ontology-based annotation schema
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7556383/
http://dx.doi.org/10.1007/978-3-030-61527-7_10
work_keys_str_mv AT kostovskaana semanticdescriptionofdataminingdatasetsanontologybasedannotationschema
AT dzeroskisaso semanticdescriptionofdataminingdatasetsanontologybasedannotationschema
AT panovpance semanticdescriptionofdataminingdatasetsanontologybasedannotationschema