Cargando…
Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema
With the pervasiveness of data mining (DM) in many areas of our society, the management of digital data, readily available for analysis, has become increasingly important. Consequently, nearly all community accepted guidelines and principles (e.g. FAIR and TRUST) for publishing such data in the digi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7556383/ http://dx.doi.org/10.1007/978-3-030-61527-7_10 |
_version_ | 1783594207232917504 |
---|---|
author | Kostovska, Ana Džeroski, Sašo Panov, Panče |
author_facet | Kostovska, Ana Džeroski, Sašo Panov, Panče |
author_sort | Kostovska, Ana |
collection | PubMed |
description | With the pervasiveness of data mining (DM) in many areas of our society, the management of digital data, readily available for analysis, has become increasingly important. Consequently, nearly all community accepted guidelines and principles (e.g. FAIR and TRUST) for publishing such data in the digital ecosystem, stress the importance of semantic data enhancement. Having rich semantic annotation of DM datasets would support the data mining process at various choice points, such as data understanding, automatic identification of the analysis task, and reasoning over the obtained results. In this paper, we report on the developments of an ontology-based annotation schema for semantic description of DM datasets. The annotation schema combines three different aspects of semantic annotation, i.e., annotation of provenance, data mining specific, and domain-specific information. We demonstrate the utility of these annotations in two use cases: semantic annotation of remote sensing data and data about neurodegenerative diseases. |
format | Online Article Text |
id | pubmed-7556383 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-75563832020-10-15 Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema Kostovska, Ana Džeroski, Sašo Panov, Panče Discovery Science Article With the pervasiveness of data mining (DM) in many areas of our society, the management of digital data, readily available for analysis, has become increasingly important. Consequently, nearly all community accepted guidelines and principles (e.g. FAIR and TRUST) for publishing such data in the digital ecosystem, stress the importance of semantic data enhancement. Having rich semantic annotation of DM datasets would support the data mining process at various choice points, such as data understanding, automatic identification of the analysis task, and reasoning over the obtained results. In this paper, we report on the developments of an ontology-based annotation schema for semantic description of DM datasets. The annotation schema combines three different aspects of semantic annotation, i.e., annotation of provenance, data mining specific, and domain-specific information. We demonstrate the utility of these annotations in two use cases: semantic annotation of remote sensing data and data about neurodegenerative diseases. 2020-09-19 /pmc/articles/PMC7556383/ http://dx.doi.org/10.1007/978-3-030-61527-7_10 Text en © The Author(s) 2020 Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. |
spellingShingle | Article Kostovska, Ana Džeroski, Sašo Panov, Panče Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema |
title | Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema |
title_full | Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema |
title_fullStr | Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema |
title_full_unstemmed | Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema |
title_short | Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema |
title_sort | semantic description of data mining datasets: an ontology-based annotation schema |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7556383/ http://dx.doi.org/10.1007/978-3-030-61527-7_10 |
work_keys_str_mv | AT kostovskaana semanticdescriptionofdataminingdatasetsanontologybasedannotationschema AT dzeroskisaso semanticdescriptionofdataminingdatasetsanontologybasedannotationschema AT panovpance semanticdescriptionofdataminingdatasetsanontologybasedannotationschema |