Cargando…

Comparative Analysis of Preference in Contemporary and Earlier Texts Using Entropy Measures

Research in computational textual aesthetics has shown that there are textual correlates of preference in prose texts. The present study investigates whether textual correlates of preference vary across different time periods (contemporary texts versus texts from the 19th and early 20th centuries)....

Descripción completa

Detalles Bibliográficos
Autores principales: Mohseni, Mahdi, Redies, Christoph, Gast, Volker
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10048171/
https://www.ncbi.nlm.nih.gov/pubmed/36981375
http://dx.doi.org/10.3390/e25030486
_version_ 1785014113539194880
author Mohseni, Mahdi
Redies, Christoph
Gast, Volker
author_facet Mohseni, Mahdi
Redies, Christoph
Gast, Volker
author_sort Mohseni, Mahdi
collection PubMed
description Research in computational textual aesthetics has shown that there are textual correlates of preference in prose texts. The present study investigates whether textual correlates of preference vary across different time periods (contemporary texts versus texts from the 19th and early 20th centuries). Preference is operationalized in different ways for the two periods, in terms of canonization for the earlier texts, and through sales figures for the contemporary texts. As potential textual correlates of preference, we measure degrees of (un)predictability in the distributions of two types of low-level observables, parts of speech and sentence length. Specifically, we calculate two entropy measures, Shannon Entropy as a global measure of unpredictability, and Approximate Entropy as a local measure of surprise (unpredictability in a specific context). Preferred texts from both periods (contemporary bestsellers and canonical earlier texts) are characterized by higher degrees of unpredictability. However, unlike canonicity in the earlier texts, sales figures in contemporary texts are reflected in global (text-level) distributions only (as measured with Shannon Entropy), while surprise in local distributions (as measured with Approximate Entropy) does not have an additional discriminating effect. Our findings thus suggest that there are both time-invariant correlates of preference, and period-specific correlates.
format Online
Article
Text
id pubmed-10048171
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100481712023-03-29 Comparative Analysis of Preference in Contemporary and Earlier Texts Using Entropy Measures Mohseni, Mahdi Redies, Christoph Gast, Volker Entropy (Basel) Article Research in computational textual aesthetics has shown that there are textual correlates of preference in prose texts. The present study investigates whether textual correlates of preference vary across different time periods (contemporary texts versus texts from the 19th and early 20th centuries). Preference is operationalized in different ways for the two periods, in terms of canonization for the earlier texts, and through sales figures for the contemporary texts. As potential textual correlates of preference, we measure degrees of (un)predictability in the distributions of two types of low-level observables, parts of speech and sentence length. Specifically, we calculate two entropy measures, Shannon Entropy as a global measure of unpredictability, and Approximate Entropy as a local measure of surprise (unpredictability in a specific context). Preferred texts from both periods (contemporary bestsellers and canonical earlier texts) are characterized by higher degrees of unpredictability. However, unlike canonicity in the earlier texts, sales figures in contemporary texts are reflected in global (text-level) distributions only (as measured with Shannon Entropy), while surprise in local distributions (as measured with Approximate Entropy) does not have an additional discriminating effect. Our findings thus suggest that there are both time-invariant correlates of preference, and period-specific correlates. MDPI 2023-03-10 /pmc/articles/PMC10048171/ /pubmed/36981375 http://dx.doi.org/10.3390/e25030486 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mohseni, Mahdi
Redies, Christoph
Gast, Volker
Comparative Analysis of Preference in Contemporary and Earlier Texts Using Entropy Measures
title Comparative Analysis of Preference in Contemporary and Earlier Texts Using Entropy Measures
title_full Comparative Analysis of Preference in Contemporary and Earlier Texts Using Entropy Measures
title_fullStr Comparative Analysis of Preference in Contemporary and Earlier Texts Using Entropy Measures
title_full_unstemmed Comparative Analysis of Preference in Contemporary and Earlier Texts Using Entropy Measures
title_short Comparative Analysis of Preference in Contemporary and Earlier Texts Using Entropy Measures
title_sort comparative analysis of preference in contemporary and earlier texts using entropy measures
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10048171/
https://www.ncbi.nlm.nih.gov/pubmed/36981375
http://dx.doi.org/10.3390/e25030486
work_keys_str_mv AT mohsenimahdi comparativeanalysisofpreferenceincontemporaryandearliertextsusingentropymeasures
AT redieschristoph comparativeanalysisofpreferenceincontemporaryandearliertextsusingentropymeasures
AT gastvolker comparativeanalysisofpreferenceincontemporaryandearliertextsusingentropymeasures