Cargando…
A catalogue with semantic annotations makes multilabel datasets FAIR
Multilabel classification (MLC) is a machine learning task where the goal is to learn to label an example with multiple labels simultaneously. It receives increasing interest from the machine learning community, as evidenced by the increasing number of papers and methods that appear in the literatur...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9068705/ https://www.ncbi.nlm.nih.gov/pubmed/35508507 http://dx.doi.org/10.1038/s41598-022-11316-3 |
_version_ | 1784700275217399808 |
---|---|
author | Kostovska, Ana Bogatinovski, Jasmin Džeroski, Sašo Kocev, Dragi Panov, Panče |
author_facet | Kostovska, Ana Bogatinovski, Jasmin Džeroski, Sašo Kocev, Dragi Panov, Panče |
author_sort | Kostovska, Ana |
collection | PubMed |
description | Multilabel classification (MLC) is a machine learning task where the goal is to learn to label an example with multiple labels simultaneously. It receives increasing interest from the machine learning community, as evidenced by the increasing number of papers and methods that appear in the literature. Hence, ensuring proper, correct, robust, and trustworthy benchmarking is of utmost importance for the further development of the field. We believe that this can be achieved by adhering to the recently emerged data management standards, such as the FAIR (Findable, Accessible, Interoperable, and Reusable) and TRUST (Transparency, Responsibility, User focus, Sustainability, and Technology) principles. We introduce an ontology-based online catalogue of MLC datasets originating from various application domains following these principles. The catalogue extensively describes many MLC datasets with comprehensible meta-features, MLC-specific semantic descriptions, and different data provenance information. The MLC data catalogue is available at: http://semantichub.ijs.si/MLCdatasets. |
format | Online Article Text |
id | pubmed-9068705 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-90687052022-05-05 A catalogue with semantic annotations makes multilabel datasets FAIR Kostovska, Ana Bogatinovski, Jasmin Džeroski, Sašo Kocev, Dragi Panov, Panče Sci Rep Article Multilabel classification (MLC) is a machine learning task where the goal is to learn to label an example with multiple labels simultaneously. It receives increasing interest from the machine learning community, as evidenced by the increasing number of papers and methods that appear in the literature. Hence, ensuring proper, correct, robust, and trustworthy benchmarking is of utmost importance for the further development of the field. We believe that this can be achieved by adhering to the recently emerged data management standards, such as the FAIR (Findable, Accessible, Interoperable, and Reusable) and TRUST (Transparency, Responsibility, User focus, Sustainability, and Technology) principles. We introduce an ontology-based online catalogue of MLC datasets originating from various application domains following these principles. The catalogue extensively describes many MLC datasets with comprehensible meta-features, MLC-specific semantic descriptions, and different data provenance information. The MLC data catalogue is available at: http://semantichub.ijs.si/MLCdatasets. Nature Publishing Group UK 2022-05-04 /pmc/articles/PMC9068705/ /pubmed/35508507 http://dx.doi.org/10.1038/s41598-022-11316-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Kostovska, Ana Bogatinovski, Jasmin Džeroski, Sašo Kocev, Dragi Panov, Panče A catalogue with semantic annotations makes multilabel datasets FAIR |
title | A catalogue with semantic annotations makes multilabel datasets FAIR |
title_full | A catalogue with semantic annotations makes multilabel datasets FAIR |
title_fullStr | A catalogue with semantic annotations makes multilabel datasets FAIR |
title_full_unstemmed | A catalogue with semantic annotations makes multilabel datasets FAIR |
title_short | A catalogue with semantic annotations makes multilabel datasets FAIR |
title_sort | catalogue with semantic annotations makes multilabel datasets fair |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9068705/ https://www.ncbi.nlm.nih.gov/pubmed/35508507 http://dx.doi.org/10.1038/s41598-022-11316-3 |
work_keys_str_mv | AT kostovskaana acataloguewithsemanticannotationsmakesmultilabeldatasetsfair AT bogatinovskijasmin acataloguewithsemanticannotationsmakesmultilabeldatasetsfair AT dzeroskisaso acataloguewithsemanticannotationsmakesmultilabeldatasetsfair AT kocevdragi acataloguewithsemanticannotationsmakesmultilabeldatasetsfair AT panovpance acataloguewithsemanticannotationsmakesmultilabeldatasetsfair AT kostovskaana cataloguewithsemanticannotationsmakesmultilabeldatasetsfair AT bogatinovskijasmin cataloguewithsemanticannotationsmakesmultilabeldatasetsfair AT dzeroskisaso cataloguewithsemanticannotationsmakesmultilabeldatasetsfair AT kocevdragi cataloguewithsemanticannotationsmakesmultilabeldatasetsfair AT panovpance cataloguewithsemanticannotationsmakesmultilabeldatasetsfair |