Cargando…

On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers

This contribution seeks to provide a rational probabilistic explanation for the intelligibility of words in a genetically related language that is unknown to the reader, a phenomenon referred to as intercomprehension. In this research domain, linguistic distance, among other factors, was proved to c...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jágrová, Klára, Hedderich, Michael, Mosbach, Marius, Avgustinova, Tania, Klakow, Dietrich
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Psychology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8278517/ https://www.ncbi.nlm.nih.gov/pubmed/34276486 http://dx.doi.org/10.3389/fpsyg.2021.662277

_version_	1783722277588697088
author	Jágrová, Klára Hedderich, Michael Mosbach, Marius Avgustinova, Tania Klakow, Dietrich
author_facet	Jágrová, Klára Hedderich, Michael Mosbach, Marius Avgustinova, Tania Klakow, Dietrich
author_sort	Jágrová, Klára
collection	PubMed
description	This contribution seeks to provide a rational probabilistic explanation for the intelligibility of words in a genetically related language that is unknown to the reader, a phenomenon referred to as intercomprehension. In this research domain, linguistic distance, among other factors, was proved to correlate well with the mutual intelligibility of individual words. However, the role of context for the intelligibility of target words in sentences was subject to very few studies. To address this, we analyze data from web-based experiments in which Czech (CS) respondents were asked to translate highly predictable target words at the final position of Polish sentences. We compare correlations of target word intelligibility with data from 3-g language models (LMs) to their correlations with data obtained from context-aware LMs. More specifically, we evaluate two context-aware LM architectures: Long Short-Term Memory (LSTMs) that can, theoretically, take infinitely long-distance dependencies into account and Transformer-based LMs which can access the whole input sequence at the same time. We investigate how their use of context affects surprisal and its correlation with intelligibility.
format	Online Article Text
id	pubmed-8278517
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-82785172021-07-15 On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers Jágrová, Klára Hedderich, Michael Mosbach, Marius Avgustinova, Tania Klakow, Dietrich Front Psychol Psychology This contribution seeks to provide a rational probabilistic explanation for the intelligibility of words in a genetically related language that is unknown to the reader, a phenomenon referred to as intercomprehension. In this research domain, linguistic distance, among other factors, was proved to correlate well with the mutual intelligibility of individual words. However, the role of context for the intelligibility of target words in sentences was subject to very few studies. To address this, we analyze data from web-based experiments in which Czech (CS) respondents were asked to translate highly predictable target words at the final position of Polish sentences. We compare correlations of target word intelligibility with data from 3-g language models (LMs) to their correlations with data obtained from context-aware LMs. More specifically, we evaluate two context-aware LM architectures: Long Short-Term Memory (LSTMs) that can, theoretically, take infinitely long-distance dependencies into account and Transformer-based LMs which can access the whole input sequence at the same time. We investigate how their use of context affects surprisal and its correlation with intelligibility. Frontiers Media S.A. 2021-06-30 /pmc/articles/PMC8278517/ /pubmed/34276486 http://dx.doi.org/10.3389/fpsyg.2021.662277 Text en Copyright © 2021 Jágrová, Hedderich, Mosbach, Avgustinova and Klakow. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Psychology Jágrová, Klára Hedderich, Michael Mosbach, Marius Avgustinova, Tania Klakow, Dietrich On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers
title	On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers
title_full	On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers
title_fullStr	On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers
title_full_unstemmed	On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers
title_short	On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers
title_sort	on the correlation of context-aware language models with the intelligibility of polish target words to czech readers
topic	Psychology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8278517/ https://www.ncbi.nlm.nih.gov/pubmed/34276486 http://dx.doi.org/10.3389/fpsyg.2021.662277
work_keys_str_mv	AT jagrovaklara onthecorrelationofcontextawarelanguagemodelswiththeintelligibilityofpolishtargetwordstoczechreaders AT hedderichmichael onthecorrelationofcontextawarelanguagemodelswiththeintelligibilityofpolishtargetwordstoczechreaders AT mosbachmarius onthecorrelationofcontextawarelanguagemodelswiththeintelligibilityofpolishtargetwordstoczechreaders AT avgustinovatania onthecorrelationofcontextawarelanguagemodelswiththeintelligibilityofpolishtargetwordstoczechreaders AT klakowdietrich onthecorrelationofcontextawarelanguagemodelswiththeintelligibilityofpolishtargetwordstoczechreaders

On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers

Ejemplares similares