Cargando…

WikiArtVectors: Style and Color Representations of Artworks for Cultural Analysis via Information Theoretic Measures

With the increase in massive digitized datasets of cultural artefacts, social and cultural scientists have an unprecedented opportunity for the discovery and expansion of cultural theory. The WikiArt dataset is one such example, with over 250,000 high quality images of historically significant artwo...

Descripción completa

Detalles Bibliográficos
Autores principales: Srinivasa Desikan, Bhargav, Shimao, Hajime, Miton, Helena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9497612/
https://www.ncbi.nlm.nih.gov/pubmed/36141061
http://dx.doi.org/10.3390/e24091175
_version_ 1784794548805828608
author Srinivasa Desikan, Bhargav
Shimao, Hajime
Miton, Helena
author_facet Srinivasa Desikan, Bhargav
Shimao, Hajime
Miton, Helena
author_sort Srinivasa Desikan, Bhargav
collection PubMed
description With the increase in massive digitized datasets of cultural artefacts, social and cultural scientists have an unprecedented opportunity for the discovery and expansion of cultural theory. The WikiArt dataset is one such example, with over 250,000 high quality images of historically significant artworks by over 3000 artists, ranging from the 15th century to the present day; it is a rich source for the potential mining of patterns and differences among artists, genres, and styles. However, such datasets are often difficult to analyse and use for answering complex questions of cultural evolution and divergence because of their raw formats as image files, which are represented as multi-dimensional tensors/matrices. Recent developments in machine learning, multi-modal data analysis and image processing, however, open the door for us to create representations of images that extract important, domain-specific features from images. Art historians have long emphasised the importance of art style, and the colors used in art, as ways to characterise and retrieve art across genre, style, and artist. In this paper, we release a massive vector-based dataset of paintings (WikiArtVectors), with style representations and color distributions, which provides cultural and social scientists with a framework and database to explore relationships across these two vital dimensions. We use state-of-the-art deep learning and human perceptual color distributions to extract the representations for each painting, and aggregate them across artist, style, and genre. These vector representations and distributions can then be used in tandem with information-theoretic and distance metrics to identify large-scale patterns across art style, genre, and artist. We demonstrate the consistency of these vectors, and provide early explorations, while detailing future work and directions. All of our data and code is publicly available on GitHub.
format Online
Article
Text
id pubmed-9497612
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94976122022-09-23 WikiArtVectors: Style and Color Representations of Artworks for Cultural Analysis via Information Theoretic Measures Srinivasa Desikan, Bhargav Shimao, Hajime Miton, Helena Entropy (Basel) Article With the increase in massive digitized datasets of cultural artefacts, social and cultural scientists have an unprecedented opportunity for the discovery and expansion of cultural theory. The WikiArt dataset is one such example, with over 250,000 high quality images of historically significant artworks by over 3000 artists, ranging from the 15th century to the present day; it is a rich source for the potential mining of patterns and differences among artists, genres, and styles. However, such datasets are often difficult to analyse and use for answering complex questions of cultural evolution and divergence because of their raw formats as image files, which are represented as multi-dimensional tensors/matrices. Recent developments in machine learning, multi-modal data analysis and image processing, however, open the door for us to create representations of images that extract important, domain-specific features from images. Art historians have long emphasised the importance of art style, and the colors used in art, as ways to characterise and retrieve art across genre, style, and artist. In this paper, we release a massive vector-based dataset of paintings (WikiArtVectors), with style representations and color distributions, which provides cultural and social scientists with a framework and database to explore relationships across these two vital dimensions. We use state-of-the-art deep learning and human perceptual color distributions to extract the representations for each painting, and aggregate them across artist, style, and genre. These vector representations and distributions can then be used in tandem with information-theoretic and distance metrics to identify large-scale patterns across art style, genre, and artist. We demonstrate the consistency of these vectors, and provide early explorations, while detailing future work and directions. All of our data and code is publicly available on GitHub. MDPI 2022-08-23 /pmc/articles/PMC9497612/ /pubmed/36141061 http://dx.doi.org/10.3390/e24091175 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Srinivasa Desikan, Bhargav
Shimao, Hajime
Miton, Helena
WikiArtVectors: Style and Color Representations of Artworks for Cultural Analysis via Information Theoretic Measures
title WikiArtVectors: Style and Color Representations of Artworks for Cultural Analysis via Information Theoretic Measures
title_full WikiArtVectors: Style and Color Representations of Artworks for Cultural Analysis via Information Theoretic Measures
title_fullStr WikiArtVectors: Style and Color Representations of Artworks for Cultural Analysis via Information Theoretic Measures
title_full_unstemmed WikiArtVectors: Style and Color Representations of Artworks for Cultural Analysis via Information Theoretic Measures
title_short WikiArtVectors: Style and Color Representations of Artworks for Cultural Analysis via Information Theoretic Measures
title_sort wikiartvectors: style and color representations of artworks for cultural analysis via information theoretic measures
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9497612/
https://www.ncbi.nlm.nih.gov/pubmed/36141061
http://dx.doi.org/10.3390/e24091175
work_keys_str_mv AT srinivasadesikanbhargav wikiartvectorsstyleandcolorrepresentationsofartworksforculturalanalysisviainformationtheoreticmeasures
AT shimaohajime wikiartvectorsstyleandcolorrepresentationsofartworksforculturalanalysisviainformationtheoreticmeasures
AT mitonhelena wikiartvectorsstyleandcolorrepresentationsofartworksforculturalanalysisviainformationtheoreticmeasures