Cargando…

A dataset of pairs of an image and tags for cataloging image-based archives

The dataset described in this paper contains pairs of images collected from the Web and their tags of keywords, which are linked to appropriate entity pages of Wikipedia, and programs to reproduce experiments. It is assumed for evaluating the disambiguation task, in which given an image and its tags...

Descripción completa

Detalles Bibliográficos
Autores principales: Suzuki, Tokinori, Nagamizo, Kota, Ikeda, Daisuke
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679671/
https://www.ncbi.nlm.nih.gov/pubmed/36426020
http://dx.doi.org/10.1016/j.dib.2022.108722
_version_ 1784834247089979392
author Suzuki, Tokinori
Nagamizo, Kota
Ikeda, Daisuke
author_facet Suzuki, Tokinori
Nagamizo, Kota
Ikeda, Daisuke
author_sort Suzuki, Tokinori
collection PubMed
description The dataset described in this paper contains pairs of images collected from the Web and their tags of keywords, which are linked to appropriate entity pages of Wikipedia, and programs to reproduce experiments. It is assumed for evaluating the disambiguation task, in which given an image and its tags to be disambiguated, an appropriate Wikipedia page is selected for each of the given tag. We collected images tagged keywords of animal names for that ambiguity and their tags since animal names may refer to not only names of animal but names of other types of objects, e.g., nicknames of sports teams from the photo sharing site Flickr. The tags are linked to the correspondence Wikipedia page judged by annotators. The dataset includes 420 images and 2,464 tags. It is useful for developing a system to link a keyword of an image to an entry of a knowledgebase as well as an image classification system, which include fine-grained classes, e.g. proper nouns of objects, as their classification targets.
format Online
Article
Text
id pubmed-9679671
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-96796712022-11-23 A dataset of pairs of an image and tags for cataloging image-based archives Suzuki, Tokinori Nagamizo, Kota Ikeda, Daisuke Data Brief Data Article The dataset described in this paper contains pairs of images collected from the Web and their tags of keywords, which are linked to appropriate entity pages of Wikipedia, and programs to reproduce experiments. It is assumed for evaluating the disambiguation task, in which given an image and its tags to be disambiguated, an appropriate Wikipedia page is selected for each of the given tag. We collected images tagged keywords of animal names for that ambiguity and their tags since animal names may refer to not only names of animal but names of other types of objects, e.g., nicknames of sports teams from the photo sharing site Flickr. The tags are linked to the correspondence Wikipedia page judged by annotators. The dataset includes 420 images and 2,464 tags. It is useful for developing a system to link a keyword of an image to an entry of a knowledgebase as well as an image classification system, which include fine-grained classes, e.g. proper nouns of objects, as their classification targets. Elsevier 2022-11-04 /pmc/articles/PMC9679671/ /pubmed/36426020 http://dx.doi.org/10.1016/j.dib.2022.108722 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Suzuki, Tokinori
Nagamizo, Kota
Ikeda, Daisuke
A dataset of pairs of an image and tags for cataloging image-based archives
title A dataset of pairs of an image and tags for cataloging image-based archives
title_full A dataset of pairs of an image and tags for cataloging image-based archives
title_fullStr A dataset of pairs of an image and tags for cataloging image-based archives
title_full_unstemmed A dataset of pairs of an image and tags for cataloging image-based archives
title_short A dataset of pairs of an image and tags for cataloging image-based archives
title_sort dataset of pairs of an image and tags for cataloging image-based archives
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679671/
https://www.ncbi.nlm.nih.gov/pubmed/36426020
http://dx.doi.org/10.1016/j.dib.2022.108722
work_keys_str_mv AT suzukitokinori adatasetofpairsofanimageandtagsforcatalogingimagebasedarchives
AT nagamizokota adatasetofpairsofanimageandtagsforcatalogingimagebasedarchives
AT ikedadaisuke adatasetofpairsofanimageandtagsforcatalogingimagebasedarchives
AT suzukitokinori datasetofpairsofanimageandtagsforcatalogingimagebasedarchives
AT nagamizokota datasetofpairsofanimageandtagsforcatalogingimagebasedarchives
AT ikedadaisuke datasetofpairsofanimageandtagsforcatalogingimagebasedarchives