Cargando…
Annotated Flickr dataset for identification of professional photographers
We collected and computed various data and statistics from a sample of Flickr users who uploaded photos to the platform in December 2021 and their photos, obtaining a final number of 27,516 users and 2,647,928 photos. Having the total number of photos uploaded and the number of photos uploaded in De...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10481170/ https://www.ncbi.nlm.nih.gov/pubmed/37680346 http://dx.doi.org/10.1016/j.dib.2023.109511 |
_version_ | 1785101916634611712 |
---|---|
author | Gaspar Marco, Rubén Strukova, Sofia Gómez Mármol, Félix Ruipérez-Valiente, José A. |
author_facet | Gaspar Marco, Rubén Strukova, Sofia Gómez Mármol, Félix Ruipérez-Valiente, José A. |
author_sort | Gaspar Marco, Rubén |
collection | PubMed |
description | We collected and computed various data and statistics from a sample of Flickr users who uploaded photos to the platform in December 2021 and their photos, obtaining a final number of 27,516 users and 2,647,928 photos. Having the total number of photos uploaded and the number of photos uploaded in December by each user, we selected a representative sample of those whose activity was not overly concentrated in December and obtained data from those who specified their occupation. In addition to the data collected directly from Flickr, we enriched the dataset with new features resulting from the automated analysis of the photos and their comments. One of the most valuable features of this data collection is that each photo has three Image Quality Assessment scores representing aesthetic and technical aspects. For this, we used Convolutional Neural Networks trained with human-labeled data. Furthermore, we added labels to indicate whether the user is a professional photographer, so the data are specially prepared for supervised training. |
format | Online Article Text |
id | pubmed-10481170 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-104811702023-09-07 Annotated Flickr dataset for identification of professional photographers Gaspar Marco, Rubén Strukova, Sofia Gómez Mármol, Félix Ruipérez-Valiente, José A. Data Brief Data Article We collected and computed various data and statistics from a sample of Flickr users who uploaded photos to the platform in December 2021 and their photos, obtaining a final number of 27,516 users and 2,647,928 photos. Having the total number of photos uploaded and the number of photos uploaded in December by each user, we selected a representative sample of those whose activity was not overly concentrated in December and obtained data from those who specified their occupation. In addition to the data collected directly from Flickr, we enriched the dataset with new features resulting from the automated analysis of the photos and their comments. One of the most valuable features of this data collection is that each photo has three Image Quality Assessment scores representing aesthetic and technical aspects. For this, we used Convolutional Neural Networks trained with human-labeled data. Furthermore, we added labels to indicate whether the user is a professional photographer, so the data are specially prepared for supervised training. Elsevier 2023-08-23 /pmc/articles/PMC10481170/ /pubmed/37680346 http://dx.doi.org/10.1016/j.dib.2023.109511 Text en © 2023 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Data Article Gaspar Marco, Rubén Strukova, Sofia Gómez Mármol, Félix Ruipérez-Valiente, José A. Annotated Flickr dataset for identification of professional photographers |
title | Annotated Flickr dataset for identification of professional photographers |
title_full | Annotated Flickr dataset for identification of professional photographers |
title_fullStr | Annotated Flickr dataset for identification of professional photographers |
title_full_unstemmed | Annotated Flickr dataset for identification of professional photographers |
title_short | Annotated Flickr dataset for identification of professional photographers |
title_sort | annotated flickr dataset for identification of professional photographers |
topic | Data Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10481170/ https://www.ncbi.nlm.nih.gov/pubmed/37680346 http://dx.doi.org/10.1016/j.dib.2023.109511 |
work_keys_str_mv | AT gasparmarcoruben annotatedflickrdatasetforidentificationofprofessionalphotographers AT strukovasofia annotatedflickrdatasetforidentificationofprofessionalphotographers AT gomezmarmolfelix annotatedflickrdatasetforidentificationofprofessionalphotographers AT ruiperezvalientejosea annotatedflickrdatasetforidentificationofprofessionalphotographers |