Cargando…

Annotated Flickr dataset for identification of professional photographers

We collected and computed various data and statistics from a sample of Flickr users who uploaded photos to the platform in December 2021 and their photos, obtaining a final number of 27,516 users and 2,647,928 photos. Having the total number of photos uploaded and the number of photos uploaded in De...

Descripción completa

Detalles Bibliográficos
Autores principales: Gaspar Marco, Rubén, Strukova, Sofia, Gómez Mármol, Félix, Ruipérez-Valiente, José A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10481170/
https://www.ncbi.nlm.nih.gov/pubmed/37680346
http://dx.doi.org/10.1016/j.dib.2023.109511
_version_ 1785101916634611712
author Gaspar Marco, Rubén
Strukova, Sofia
Gómez Mármol, Félix
Ruipérez-Valiente, José A.
author_facet Gaspar Marco, Rubén
Strukova, Sofia
Gómez Mármol, Félix
Ruipérez-Valiente, José A.
author_sort Gaspar Marco, Rubén
collection PubMed
description We collected and computed various data and statistics from a sample of Flickr users who uploaded photos to the platform in December 2021 and their photos, obtaining a final number of 27,516 users and 2,647,928 photos. Having the total number of photos uploaded and the number of photos uploaded in December by each user, we selected a representative sample of those whose activity was not overly concentrated in December and obtained data from those who specified their occupation. In addition to the data collected directly from Flickr, we enriched the dataset with new features resulting from the automated analysis of the photos and their comments. One of the most valuable features of this data collection is that each photo has three Image Quality Assessment scores representing aesthetic and technical aspects. For this, we used Convolutional Neural Networks trained with human-labeled data. Furthermore, we added labels to indicate whether the user is a professional photographer, so the data are specially prepared for supervised training.
format Online
Article
Text
id pubmed-10481170
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-104811702023-09-07 Annotated Flickr dataset for identification of professional photographers Gaspar Marco, Rubén Strukova, Sofia Gómez Mármol, Félix Ruipérez-Valiente, José A. Data Brief Data Article We collected and computed various data and statistics from a sample of Flickr users who uploaded photos to the platform in December 2021 and their photos, obtaining a final number of 27,516 users and 2,647,928 photos. Having the total number of photos uploaded and the number of photos uploaded in December by each user, we selected a representative sample of those whose activity was not overly concentrated in December and obtained data from those who specified their occupation. In addition to the data collected directly from Flickr, we enriched the dataset with new features resulting from the automated analysis of the photos and their comments. One of the most valuable features of this data collection is that each photo has three Image Quality Assessment scores representing aesthetic and technical aspects. For this, we used Convolutional Neural Networks trained with human-labeled data. Furthermore, we added labels to indicate whether the user is a professional photographer, so the data are specially prepared for supervised training. Elsevier 2023-08-23 /pmc/articles/PMC10481170/ /pubmed/37680346 http://dx.doi.org/10.1016/j.dib.2023.109511 Text en © 2023 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Gaspar Marco, Rubén
Strukova, Sofia
Gómez Mármol, Félix
Ruipérez-Valiente, José A.
Annotated Flickr dataset for identification of professional photographers
title Annotated Flickr dataset for identification of professional photographers
title_full Annotated Flickr dataset for identification of professional photographers
title_fullStr Annotated Flickr dataset for identification of professional photographers
title_full_unstemmed Annotated Flickr dataset for identification of professional photographers
title_short Annotated Flickr dataset for identification of professional photographers
title_sort annotated flickr dataset for identification of professional photographers
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10481170/
https://www.ncbi.nlm.nih.gov/pubmed/37680346
http://dx.doi.org/10.1016/j.dib.2023.109511
work_keys_str_mv AT gasparmarcoruben annotatedflickrdatasetforidentificationofprofessionalphotographers
AT strukovasofia annotatedflickrdatasetforidentificationofprofessionalphotographers
AT gomezmarmolfelix annotatedflickrdatasetforidentificationofprofessionalphotographers
AT ruiperezvalientejosea annotatedflickrdatasetforidentificationofprofessionalphotographers