Cargando…

Information Density and the Extraposition of German Relative Clauses

This paper aims to find a correlation between Information Density (ID) and extraposition of Relative Clauses (RC) in Early New High German. Since surprisal is connected to perceiving difficulties, the impact on the working memory is lower for frequent combinations with low surprisal-values than it i...

Descripción completa

Detalles Bibliográficos
Autores principales: Voigtmann, Sophia, Speyer, Augustin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8660694/
https://www.ncbi.nlm.nih.gov/pubmed/34899450
http://dx.doi.org/10.3389/fpsyg.2021.650969
_version_ 1784613243179761664
author Voigtmann, Sophia
Speyer, Augustin
author_facet Voigtmann, Sophia
Speyer, Augustin
author_sort Voigtmann, Sophia
collection PubMed
description This paper aims to find a correlation between Information Density (ID) and extraposition of Relative Clauses (RC) in Early New High German. Since surprisal is connected to perceiving difficulties, the impact on the working memory is lower for frequent combinations with low surprisal-values than it is for rare combinations with higher surprisal-values. To improve text comprehension, producers therefore distribute information as evenly as possible across a discourse. Extraposed RC are expected to have a higher surprisal-value than embedded RC. We intend to find evidence for this idea in RC taken from scientific texts from the 17th to 19th century. We built a corpus of tokenized, lemmatized and normalized papers about medicine from the 17th and 19th century, manually determined the RC-variants and calculated a skipgram-Language Model to compute the 2-Skip-bigram surprisal of every word of the relevant sentences. A logistic regression over the summed up surprisal values shows a significant result, which indicates a correlation between surprisal values and extraposition. So, for these periods it can be said that RC are more likely to be extraposed when they have a high total surprisal value. The influence of surprisal values also seems to be stable across time. The comparison of the analyzed language periods shows no significant change.
format Online
Article
Text
id pubmed-8660694
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86606942021-12-11 Information Density and the Extraposition of German Relative Clauses Voigtmann, Sophia Speyer, Augustin Front Psychol Psychology This paper aims to find a correlation between Information Density (ID) and extraposition of Relative Clauses (RC) in Early New High German. Since surprisal is connected to perceiving difficulties, the impact on the working memory is lower for frequent combinations with low surprisal-values than it is for rare combinations with higher surprisal-values. To improve text comprehension, producers therefore distribute information as evenly as possible across a discourse. Extraposed RC are expected to have a higher surprisal-value than embedded RC. We intend to find evidence for this idea in RC taken from scientific texts from the 17th to 19th century. We built a corpus of tokenized, lemmatized and normalized papers about medicine from the 17th and 19th century, manually determined the RC-variants and calculated a skipgram-Language Model to compute the 2-Skip-bigram surprisal of every word of the relevant sentences. A logistic regression over the summed up surprisal values shows a significant result, which indicates a correlation between surprisal values and extraposition. So, for these periods it can be said that RC are more likely to be extraposed when they have a high total surprisal value. The influence of surprisal values also seems to be stable across time. The comparison of the analyzed language periods shows no significant change. Frontiers Media S.A. 2021-11-26 /pmc/articles/PMC8660694/ /pubmed/34899450 http://dx.doi.org/10.3389/fpsyg.2021.650969 Text en Copyright © 2021 Voigtmann and Speyer. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Voigtmann, Sophia
Speyer, Augustin
Information Density and the Extraposition of German Relative Clauses
title Information Density and the Extraposition of German Relative Clauses
title_full Information Density and the Extraposition of German Relative Clauses
title_fullStr Information Density and the Extraposition of German Relative Clauses
title_full_unstemmed Information Density and the Extraposition of German Relative Clauses
title_short Information Density and the Extraposition of German Relative Clauses
title_sort information density and the extraposition of german relative clauses
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8660694/
https://www.ncbi.nlm.nih.gov/pubmed/34899450
http://dx.doi.org/10.3389/fpsyg.2021.650969
work_keys_str_mv AT voigtmannsophia informationdensityandtheextrapositionofgermanrelativeclauses
AT speyeraugustin informationdensityandtheextrapositionofgermanrelativeclauses