Cargando…

A dataset of pairs of an image and tags for cataloging image-based archives

The dataset described in this paper contains pairs of images collected from the Web and their tags of keywords, which are linked to appropriate entity pages of Wikipedia, and programs to reproduce experiments. It is assumed for evaluating the disambiguation task, in which given an image and its tags...

Descripción completa

Detalles Bibliográficos
Autores principales: Suzuki, Tokinori, Nagamizo, Kota, Ikeda, Daisuke
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679671/
https://www.ncbi.nlm.nih.gov/pubmed/36426020
http://dx.doi.org/10.1016/j.dib.2022.108722
Descripción
Sumario:The dataset described in this paper contains pairs of images collected from the Web and their tags of keywords, which are linked to appropriate entity pages of Wikipedia, and programs to reproduce experiments. It is assumed for evaluating the disambiguation task, in which given an image and its tags to be disambiguated, an appropriate Wikipedia page is selected for each of the given tag. We collected images tagged keywords of animal names for that ambiguity and their tags since animal names may refer to not only names of animal but names of other types of objects, e.g., nicknames of sports teams from the photo sharing site Flickr. The tags are linked to the correspondence Wikipedia page judged by annotators. The dataset includes 420 images and 2,464 tags. It is useful for developing a system to link a keyword of an image to an entry of a knowledgebase as well as an image classification system, which include fine-grained classes, e.g. proper nouns of objects, as their classification targets.