Cargando…

Fractality and Variability in Canonical and Non-Canonical English Fiction and in Non-Fictional Texts

This study investigates global properties of three categories of English text: canonical fiction, non-canonical fiction, and non-fictional texts. The central hypothesis of the study is that there are systematic differences with respect to structural design features between canonical and non-canonica...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohseni, Mahdi, Gast, Volker, Redies, Christoph
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8044424/
https://www.ncbi.nlm.nih.gov/pubmed/33868078
http://dx.doi.org/10.3389/fpsyg.2021.599063
_version_ 1783678481944543232
author Mohseni, Mahdi
Gast, Volker
Redies, Christoph
author_facet Mohseni, Mahdi
Gast, Volker
Redies, Christoph
author_sort Mohseni, Mahdi
collection PubMed
description This study investigates global properties of three categories of English text: canonical fiction, non-canonical fiction, and non-fictional texts. The central hypothesis of the study is that there are systematic differences with respect to structural design features between canonical and non-canonical fiction, and between fictional and non-fictional texts. To investigate these differences, we compiled a corpus containing texts of the three categories of interest, the Jena Corpus of Expository and Fictional Prose (JEFP Corpus). Two aspects of global structure are investigated, variability and self-similar (fractal) patterns, which reflect long-range correlations along texts. We use four types of basic observations, (i) the frequency of POS-tags per sentence, (ii) sentence length, (iii) lexical diversity, and (iv) the distribution of topic probabilities in segments of texts. These basic observations are grouped into two more general categories, (a) the lower-level properties (i) and (ii), which are observed at the level of the sentence (reflecting linguistic decoding), and (b) the higher-level properties (iii) and (iv), which are observed at the textual level (reflecting comprehension/integration). The observations for each property are transformed into series, which are analyzed in terms of variance and subjected to Multi-Fractal Detrended Fluctuation Analysis (MFDFA), giving rise to three statistics: (i) the degree of fractality ([Formula: see text]), (ii) the degree of multifractality ([Formula: see text]), i.e., the width of the fractal spectrum, and (iii) the degree of asymmetry ([Formula: see text]) of the fractal spectrum. The statistics thus obtained are compared individually across text categories and jointly fed into a classification model (Support Vector Machine). Our results show that there are in fact differences between the three text categories of interest. In general, lower-level text properties are better discriminators than higher-level text properties. Canonical fictional texts differ from non-canonical ones primarily in terms of variability in lower-level text properties. Fractality seems to be a universal feature of text, slightly more pronounced in non-fictional than in fictional texts. On the basis of our results obtained on the basis of corpus data we point out some avenues for future research leading toward a more comprehensive analysis of textual aesthetics, e.g., using experimental methodologies.
format Online
Article
Text
id pubmed-8044424
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-80444242021-04-15 Fractality and Variability in Canonical and Non-Canonical English Fiction and in Non-Fictional Texts Mohseni, Mahdi Gast, Volker Redies, Christoph Front Psychol Psychology This study investigates global properties of three categories of English text: canonical fiction, non-canonical fiction, and non-fictional texts. The central hypothesis of the study is that there are systematic differences with respect to structural design features between canonical and non-canonical fiction, and between fictional and non-fictional texts. To investigate these differences, we compiled a corpus containing texts of the three categories of interest, the Jena Corpus of Expository and Fictional Prose (JEFP Corpus). Two aspects of global structure are investigated, variability and self-similar (fractal) patterns, which reflect long-range correlations along texts. We use four types of basic observations, (i) the frequency of POS-tags per sentence, (ii) sentence length, (iii) lexical diversity, and (iv) the distribution of topic probabilities in segments of texts. These basic observations are grouped into two more general categories, (a) the lower-level properties (i) and (ii), which are observed at the level of the sentence (reflecting linguistic decoding), and (b) the higher-level properties (iii) and (iv), which are observed at the textual level (reflecting comprehension/integration). The observations for each property are transformed into series, which are analyzed in terms of variance and subjected to Multi-Fractal Detrended Fluctuation Analysis (MFDFA), giving rise to three statistics: (i) the degree of fractality ([Formula: see text]), (ii) the degree of multifractality ([Formula: see text]), i.e., the width of the fractal spectrum, and (iii) the degree of asymmetry ([Formula: see text]) of the fractal spectrum. The statistics thus obtained are compared individually across text categories and jointly fed into a classification model (Support Vector Machine). Our results show that there are in fact differences between the three text categories of interest. In general, lower-level text properties are better discriminators than higher-level text properties. Canonical fictional texts differ from non-canonical ones primarily in terms of variability in lower-level text properties. Fractality seems to be a universal feature of text, slightly more pronounced in non-fictional than in fictional texts. On the basis of our results obtained on the basis of corpus data we point out some avenues for future research leading toward a more comprehensive analysis of textual aesthetics, e.g., using experimental methodologies. Frontiers Media S.A. 2021-03-31 /pmc/articles/PMC8044424/ /pubmed/33868078 http://dx.doi.org/10.3389/fpsyg.2021.599063 Text en Copyright © 2021 Mohseni, Gast and Redies. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Mohseni, Mahdi
Gast, Volker
Redies, Christoph
Fractality and Variability in Canonical and Non-Canonical English Fiction and in Non-Fictional Texts
title Fractality and Variability in Canonical and Non-Canonical English Fiction and in Non-Fictional Texts
title_full Fractality and Variability in Canonical and Non-Canonical English Fiction and in Non-Fictional Texts
title_fullStr Fractality and Variability in Canonical and Non-Canonical English Fiction and in Non-Fictional Texts
title_full_unstemmed Fractality and Variability in Canonical and Non-Canonical English Fiction and in Non-Fictional Texts
title_short Fractality and Variability in Canonical and Non-Canonical English Fiction and in Non-Fictional Texts
title_sort fractality and variability in canonical and non-canonical english fiction and in non-fictional texts
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8044424/
https://www.ncbi.nlm.nih.gov/pubmed/33868078
http://dx.doi.org/10.3389/fpsyg.2021.599063
work_keys_str_mv AT mohsenimahdi fractalityandvariabilityincanonicalandnoncanonicalenglishfictionandinnonfictionaltexts
AT gastvolker fractalityandvariabilityincanonicalandnoncanonicalenglishfictionandinnonfictionaltexts
AT redieschristoph fractalityandvariabilityincanonicalandnoncanonicalenglishfictionandinnonfictionaltexts