Cargando…
Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary
Lexical phenomena, such as clusters of words, disseminate through social networks at different rates but most models of diffusion focus on the discrete adoption of new lexical phenomena (i.e. new topics or memes). It is possible much of lexical diffusion happens via the changing rates of existing wo...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8211116/ https://www.ncbi.nlm.nih.gov/pubmed/34151320 http://dx.doi.org/10.1145/3442381.3449819 |
_version_ | 1783709416017625088 |
---|---|
author | Zamani, Mohammadzaman Schwartz, H. Andrew |
author_facet | Zamani, Mohammadzaman Schwartz, H. Andrew |
author_sort | Zamani, Mohammadzaman |
collection | PubMed |
description | Lexical phenomena, such as clusters of words, disseminate through social networks at different rates but most models of diffusion focus on the discrete adoption of new lexical phenomena (i.e. new topics or memes). It is possible much of lexical diffusion happens via the changing rates of existing word categories or concepts (those that are already being used, at least to some extent, regularly) rather than new ones. In this study we introduce a new metric, contrastive lexical diffusion (CLD) coefficient, which attempts to measure the degree to which ordinary language (here clusters of common words) catch on over friendship connections over time. For instance topics related to meeting and job are found to be sticky, while negative thinking and emotion, and global events, like ‘school orientation’ were found to be less sticky even though they change rates over time. We evaluate CLD coefficient over both quantitative and qualitative tests, studied over 6 years of language on Twitter. We find CLD predicts the spread of tweets and friendship connections, scores converge with human judgments of lexical diffusion (r=0.92), and CLD coefficients replicate across disjoint networks (r=0.85). Comparing CLD scores can help understand lexical diffusion: positive emotion words appear more diffusive than negative emotions, first-person plurals (we) score higher than other pronouns, and numbers and time appear non-contagious. |
format | Online Article Text |
id | pubmed-8211116 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
record_format | MEDLINE/PubMed |
spelling | pubmed-82111162021-06-17 Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary Zamani, Mohammadzaman Schwartz, H. Andrew Proc Int World Wide Web Conf Article Lexical phenomena, such as clusters of words, disseminate through social networks at different rates but most models of diffusion focus on the discrete adoption of new lexical phenomena (i.e. new topics or memes). It is possible much of lexical diffusion happens via the changing rates of existing word categories or concepts (those that are already being used, at least to some extent, regularly) rather than new ones. In this study we introduce a new metric, contrastive lexical diffusion (CLD) coefficient, which attempts to measure the degree to which ordinary language (here clusters of common words) catch on over friendship connections over time. For instance topics related to meeting and job are found to be sticky, while negative thinking and emotion, and global events, like ‘school orientation’ were found to be less sticky even though they change rates over time. We evaluate CLD coefficient over both quantitative and qualitative tests, studied over 6 years of language on Twitter. We find CLD predicts the spread of tweets and friendship connections, scores converge with human judgments of lexical diffusion (r=0.92), and CLD coefficients replicate across disjoint networks (r=0.85). Comparing CLD scores can help understand lexical diffusion: positive emotion words appear more diffusive than negative emotions, first-person plurals (we) score higher than other pronouns, and numbers and time appear non-contagious. 2021-04 /pmc/articles/PMC8211116/ /pubmed/34151320 http://dx.doi.org/10.1145/3442381.3449819 Text en https://creativecommons.org/licenses/by/4.0/This paper is published under the Creative Commons Attribution 4.0 International (CC-BY 4.0) license. Authors reserve their rights to disseminate the work on their personal and corporate Web sites with the appropriate attribution. |
spellingShingle | Article Zamani, Mohammadzaman Schwartz, H. Andrew Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary |
title | Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary |
title_full | Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary |
title_fullStr | Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary |
title_full_unstemmed | Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary |
title_short | Contrastive Lexical Diffusion Coefficient: Quantifying the Stickiness of the Ordinary |
title_sort | contrastive lexical diffusion coefficient: quantifying the stickiness of the ordinary |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8211116/ https://www.ncbi.nlm.nih.gov/pubmed/34151320 http://dx.doi.org/10.1145/3442381.3449819 |
work_keys_str_mv | AT zamanimohammadzaman contrastivelexicaldiffusioncoefficientquantifyingthestickinessoftheordinary AT schwartzhandrew contrastivelexicaldiffusioncoefficientquantifyingthestickinessoftheordinary |