Cargando…
A dataset of pairs of an image and tags for cataloging image-based archives
The dataset described in this paper contains pairs of images collected from the Web and their tags of keywords, which are linked to appropriate entity pages of Wikipedia, and programs to reproduce experiments. It is assumed for evaluating the disambiguation task, in which given an image and its tags...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679671/ https://www.ncbi.nlm.nih.gov/pubmed/36426020 http://dx.doi.org/10.1016/j.dib.2022.108722 |
_version_ | 1784834247089979392 |
---|---|
author | Suzuki, Tokinori Nagamizo, Kota Ikeda, Daisuke |
author_facet | Suzuki, Tokinori Nagamizo, Kota Ikeda, Daisuke |
author_sort | Suzuki, Tokinori |
collection | PubMed |
description | The dataset described in this paper contains pairs of images collected from the Web and their tags of keywords, which are linked to appropriate entity pages of Wikipedia, and programs to reproduce experiments. It is assumed for evaluating the disambiguation task, in which given an image and its tags to be disambiguated, an appropriate Wikipedia page is selected for each of the given tag. We collected images tagged keywords of animal names for that ambiguity and their tags since animal names may refer to not only names of animal but names of other types of objects, e.g., nicknames of sports teams from the photo sharing site Flickr. The tags are linked to the correspondence Wikipedia page judged by annotators. The dataset includes 420 images and 2,464 tags. It is useful for developing a system to link a keyword of an image to an entry of a knowledgebase as well as an image classification system, which include fine-grained classes, e.g. proper nouns of objects, as their classification targets. |
format | Online Article Text |
id | pubmed-9679671 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-96796712022-11-23 A dataset of pairs of an image and tags for cataloging image-based archives Suzuki, Tokinori Nagamizo, Kota Ikeda, Daisuke Data Brief Data Article The dataset described in this paper contains pairs of images collected from the Web and their tags of keywords, which are linked to appropriate entity pages of Wikipedia, and programs to reproduce experiments. It is assumed for evaluating the disambiguation task, in which given an image and its tags to be disambiguated, an appropriate Wikipedia page is selected for each of the given tag. We collected images tagged keywords of animal names for that ambiguity and their tags since animal names may refer to not only names of animal but names of other types of objects, e.g., nicknames of sports teams from the photo sharing site Flickr. The tags are linked to the correspondence Wikipedia page judged by annotators. The dataset includes 420 images and 2,464 tags. It is useful for developing a system to link a keyword of an image to an entry of a knowledgebase as well as an image classification system, which include fine-grained classes, e.g. proper nouns of objects, as their classification targets. Elsevier 2022-11-04 /pmc/articles/PMC9679671/ /pubmed/36426020 http://dx.doi.org/10.1016/j.dib.2022.108722 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Data Article Suzuki, Tokinori Nagamizo, Kota Ikeda, Daisuke A dataset of pairs of an image and tags for cataloging image-based archives |
title | A dataset of pairs of an image and tags for cataloging image-based archives |
title_full | A dataset of pairs of an image and tags for cataloging image-based archives |
title_fullStr | A dataset of pairs of an image and tags for cataloging image-based archives |
title_full_unstemmed | A dataset of pairs of an image and tags for cataloging image-based archives |
title_short | A dataset of pairs of an image and tags for cataloging image-based archives |
title_sort | dataset of pairs of an image and tags for cataloging image-based archives |
topic | Data Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679671/ https://www.ncbi.nlm.nih.gov/pubmed/36426020 http://dx.doi.org/10.1016/j.dib.2022.108722 |
work_keys_str_mv | AT suzukitokinori adatasetofpairsofanimageandtagsforcatalogingimagebasedarchives AT nagamizokota adatasetofpairsofanimageandtagsforcatalogingimagebasedarchives AT ikedadaisuke adatasetofpairsofanimageandtagsforcatalogingimagebasedarchives AT suzukitokinori datasetofpairsofanimageandtagsforcatalogingimagebasedarchives AT nagamizokota datasetofpairsofanimageandtagsforcatalogingimagebasedarchives AT ikedadaisuke datasetofpairsofanimageandtagsforcatalogingimagebasedarchives |