Cargando…

Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary

Lexical phenomena, such as clusters of words, disseminate through social networks at different rates but most models of diffusion focus on the discrete adoption of new lexical phenomena (i.e. new topics or memes). It is possible much of lexical diffusion happens via the changing rates of existing wo...

Descripción completa

Detalles Bibliográficos
Autores principales: Zamani, Mohammadzaman, Schwartz, H. Andrew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8211116/
https://www.ncbi.nlm.nih.gov/pubmed/34151320
http://dx.doi.org/10.1145/3442381.3449819
_version_ 1783709416017625088
author Zamani, Mohammadzaman
Schwartz, H. Andrew
author_facet Zamani, Mohammadzaman
Schwartz, H. Andrew
author_sort Zamani, Mohammadzaman
collection PubMed
description Lexical phenomena, such as clusters of words, disseminate through social networks at different rates but most models of diffusion focus on the discrete adoption of new lexical phenomena (i.e. new topics or memes). It is possible much of lexical diffusion happens via the changing rates of existing word categories or concepts (those that are already being used, at least to some extent, regularly) rather than new ones. In this study we introduce a new metric, contrastive lexical diffusion (CLD) coefficient, which attempts to measure the degree to which ordinary language (here clusters of common words) catch on over friendship connections over time. For instance topics related to meeting and job are found to be sticky, while negative thinking and emotion, and global events, like ‘school orientation’ were found to be less sticky even though they change rates over time. We evaluate CLD coefficient over both quantitative and qualitative tests, studied over 6 years of language on Twitter. We find CLD predicts the spread of tweets and friendship connections, scores converge with human judgments of lexical diffusion (r=0.92), and CLD coefficients replicate across disjoint networks (r=0.85). Comparing CLD scores can help understand lexical diffusion: positive emotion words appear more diffusive than negative emotions, first-person plurals (we) score higher than other pronouns, and numbers and time appear non-contagious.
format Online
Article
Text
id pubmed-8211116
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-82111162021-06-17 Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary Zamani, Mohammadzaman Schwartz, H. Andrew Proc Int World Wide Web Conf Article Lexical phenomena, such as clusters of words, disseminate through social networks at different rates but most models of diffusion focus on the discrete adoption of new lexical phenomena (i.e. new topics or memes). It is possible much of lexical diffusion happens via the changing rates of existing word categories or concepts (those that are already being used, at least to some extent, regularly) rather than new ones. In this study we introduce a new metric, contrastive lexical diffusion (CLD) coefficient, which attempts to measure the degree to which ordinary language (here clusters of common words) catch on over friendship connections over time. For instance topics related to meeting and job are found to be sticky, while negative thinking and emotion, and global events, like ‘school orientation’ were found to be less sticky even though they change rates over time. We evaluate CLD coefficient over both quantitative and qualitative tests, studied over 6 years of language on Twitter. We find CLD predicts the spread of tweets and friendship connections, scores converge with human judgments of lexical diffusion (r=0.92), and CLD coefficients replicate across disjoint networks (r=0.85). Comparing CLD scores can help understand lexical diffusion: positive emotion words appear more diffusive than negative emotions, first-person plurals (we) score higher than other pronouns, and numbers and time appear non-contagious. 2021-04 /pmc/articles/PMC8211116/ /pubmed/34151320 http://dx.doi.org/10.1145/3442381.3449819 Text en https://creativecommons.org/licenses/by/4.0/This paper is published under the Creative Commons Attribution 4.0 International (CC-BY 4.0) license. Authors reserve their rights to disseminate the work on their personal and corporate Web sites with the appropriate attribution.
spellingShingle Article
Zamani, Mohammadzaman
Schwartz, H. Andrew
Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary
title Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary
title_full Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary
title_fullStr Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary
title_full_unstemmed Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary
title_short Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary
title_sort contrastive lexical diffusion coefficient: quantifying the stickiness of the ordinary
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8211116/
https://www.ncbi.nlm.nih.gov/pubmed/34151320
http://dx.doi.org/10.1145/3442381.3449819
work_keys_str_mv AT zamanimohammadzaman contrastivelexicaldiffusioncoefficientquantifyingthestickinessoftheordinary
AT schwartzhandrew contrastivelexicaldiffusioncoefficientquantifyingthestickinessoftheordinary