Cargando…

Methodology for using a Bayesian nonparametric model to uncover universal patterns in color naming

Language is an integral part of society which enables communication among its members. To shed light on how words gain their meaning and how their meaning evolves over time, color naming is often used as a case study. The color domain can be defined by a physical space, making it a useful concept fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Joe, Kirbi, Gooyabadi, Maryam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8720911/
https://www.ncbi.nlm.nih.gov/pubmed/35004206
http://dx.doi.org/10.1016/j.mex.2021.101572
_version_ 1784625225822896128
author Joe, Kirbi
Gooyabadi, Maryam
author_facet Joe, Kirbi
Gooyabadi, Maryam
author_sort Joe, Kirbi
collection PubMed
description Language is an integral part of society which enables communication among its members. To shed light on how words gain their meaning and how their meaning evolves over time, color naming is often used as a case study. The color domain can be defined by a physical space, making it a useful concept for studying denotation of meaning. Though humans can distinguish millions of colors, language provides us with a small, manageable set of terms for categorizing the space. Partitions of the color space vary across different language groups and evolve over time (e.g. new color terms may enter a language). Investigating universal patterns in color naming provides insight into the mechanisms that give rise to the observed data. Recently, computational techniques have been utilized to study this phenomenon. Here, we develop a methodology for transforming a color naming data set—namely, the World Color Survey—which is based on constraints imposed by the stimulus space. This transformed data is used to initialize a nonparametric Bayesian machine learning model in order to implement a culture and theory-independent study of universal color naming patterns across different language groups. All of the methods described are executed by our Python software package called ColorBBDP. • Data from the World Color Survey is transformed from its original format into binary features vectors which can be given as input to the Beta-Bernoulli Dirichlet Process Mixture Model. • This paper provides a specific application of Variational Inference on the Beta-Bernoulli Dirichlet Process Mixture Model towards a color naming data set. • New mathematical measures for performing post-cluster analyses are also detailed in this paper.
format Online
Article
Text
id pubmed-8720911
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-87209112022-01-07 Methodology for using a Bayesian nonparametric model to uncover universal patterns in color naming Joe, Kirbi Gooyabadi, Maryam MethodsX Method Article Language is an integral part of society which enables communication among its members. To shed light on how words gain their meaning and how their meaning evolves over time, color naming is often used as a case study. The color domain can be defined by a physical space, making it a useful concept for studying denotation of meaning. Though humans can distinguish millions of colors, language provides us with a small, manageable set of terms for categorizing the space. Partitions of the color space vary across different language groups and evolve over time (e.g. new color terms may enter a language). Investigating universal patterns in color naming provides insight into the mechanisms that give rise to the observed data. Recently, computational techniques have been utilized to study this phenomenon. Here, we develop a methodology for transforming a color naming data set—namely, the World Color Survey—which is based on constraints imposed by the stimulus space. This transformed data is used to initialize a nonparametric Bayesian machine learning model in order to implement a culture and theory-independent study of universal color naming patterns across different language groups. All of the methods described are executed by our Python software package called ColorBBDP. • Data from the World Color Survey is transformed from its original format into binary features vectors which can be given as input to the Beta-Bernoulli Dirichlet Process Mixture Model. • This paper provides a specific application of Variational Inference on the Beta-Bernoulli Dirichlet Process Mixture Model towards a color naming data set. • New mathematical measures for performing post-cluster analyses are also detailed in this paper. Elsevier 2021-11-02 /pmc/articles/PMC8720911/ /pubmed/35004206 http://dx.doi.org/10.1016/j.mex.2021.101572 Text en © 2021 The Authors. Published by Elsevier B.V. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Method Article
Joe, Kirbi
Gooyabadi, Maryam
Methodology for using a Bayesian nonparametric model to uncover universal patterns in color naming
title Methodology for using a Bayesian nonparametric model to uncover universal patterns in color naming
title_full Methodology for using a Bayesian nonparametric model to uncover universal patterns in color naming
title_fullStr Methodology for using a Bayesian nonparametric model to uncover universal patterns in color naming
title_full_unstemmed Methodology for using a Bayesian nonparametric model to uncover universal patterns in color naming
title_short Methodology for using a Bayesian nonparametric model to uncover universal patterns in color naming
title_sort methodology for using a bayesian nonparametric model to uncover universal patterns in color naming
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8720911/
https://www.ncbi.nlm.nih.gov/pubmed/35004206
http://dx.doi.org/10.1016/j.mex.2021.101572
work_keys_str_mv AT joekirbi methodologyforusingabayesiannonparametricmodeltouncoveruniversalpatternsincolornaming
AT gooyabadimaryam methodologyforusingabayesiannonparametricmodeltouncoveruniversalpatternsincolornaming