Cargando…

A study on surprisal and semantic relatedness for eye-tracking data prediction

Previous research in computational linguistics dedicated a lot of effort to using language modeling and/or distributional semantic models to predict metrics extracted from eye-tracking data. However, it is not clear whether the two components have a distinct contribution, with recent studies claimin...

Descripción completa

Detalles Bibliográficos
Autores principales: Salicchi, Lavinia, Chersoni, Emmanuele, Lenci, Alessandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931754/
https://www.ncbi.nlm.nih.gov/pubmed/36818086
http://dx.doi.org/10.3389/fpsyg.2023.1112365
_version_ 1784889302615851008
author Salicchi, Lavinia
Chersoni, Emmanuele
Lenci, Alessandro
author_facet Salicchi, Lavinia
Chersoni, Emmanuele
Lenci, Alessandro
author_sort Salicchi, Lavinia
collection PubMed
description Previous research in computational linguistics dedicated a lot of effort to using language modeling and/or distributional semantic models to predict metrics extracted from eye-tracking data. However, it is not clear whether the two components have a distinct contribution, with recent studies claiming that surprisal scores estimated with large-scale, deep learning-based language models subsume the semantic relatedness component. In our study, we propose a regression experiment for estimating different eye-tracking metrics on two English corpora, contrasting the quality of the predictions with and without the surprisal and the relatedness components. Different types of relatedness scores derived from both static and contextual models have also been tested. Our results suggest that both components play a role in the prediction, with semantic relatedness surprisingly contributing also to the prediction of function words. Moreover, they show that when the metric is computed with the contextual embeddings of the BERT model, it is able to explain a higher amount of variance.
format Online
Article
Text
id pubmed-9931754
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-99317542023-02-17 A study on surprisal and semantic relatedness for eye-tracking data prediction Salicchi, Lavinia Chersoni, Emmanuele Lenci, Alessandro Front Psychol Psychology Previous research in computational linguistics dedicated a lot of effort to using language modeling and/or distributional semantic models to predict metrics extracted from eye-tracking data. However, it is not clear whether the two components have a distinct contribution, with recent studies claiming that surprisal scores estimated with large-scale, deep learning-based language models subsume the semantic relatedness component. In our study, we propose a regression experiment for estimating different eye-tracking metrics on two English corpora, contrasting the quality of the predictions with and without the surprisal and the relatedness components. Different types of relatedness scores derived from both static and contextual models have also been tested. Our results suggest that both components play a role in the prediction, with semantic relatedness surprisingly contributing also to the prediction of function words. Moreover, they show that when the metric is computed with the contextual embeddings of the BERT model, it is able to explain a higher amount of variance. Frontiers Media S.A. 2023-02-02 /pmc/articles/PMC9931754/ /pubmed/36818086 http://dx.doi.org/10.3389/fpsyg.2023.1112365 Text en Copyright © 2023 Salicchi, Chersoni and Lenci. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Salicchi, Lavinia
Chersoni, Emmanuele
Lenci, Alessandro
A study on surprisal and semantic relatedness for eye-tracking data prediction
title A study on surprisal and semantic relatedness for eye-tracking data prediction
title_full A study on surprisal and semantic relatedness for eye-tracking data prediction
title_fullStr A study on surprisal and semantic relatedness for eye-tracking data prediction
title_full_unstemmed A study on surprisal and semantic relatedness for eye-tracking data prediction
title_short A study on surprisal and semantic relatedness for eye-tracking data prediction
title_sort study on surprisal and semantic relatedness for eye-tracking data prediction
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931754/
https://www.ncbi.nlm.nih.gov/pubmed/36818086
http://dx.doi.org/10.3389/fpsyg.2023.1112365
work_keys_str_mv AT salicchilavinia astudyonsurprisalandsemanticrelatednessforeyetrackingdataprediction
AT chersoniemmanuele astudyonsurprisalandsemanticrelatednessforeyetrackingdataprediction
AT lencialessandro astudyonsurprisalandsemanticrelatednessforeyetrackingdataprediction
AT salicchilavinia studyonsurprisalandsemanticrelatednessforeyetrackingdataprediction
AT chersoniemmanuele studyonsurprisalandsemanticrelatednessforeyetrackingdataprediction
AT lencialessandro studyonsurprisalandsemanticrelatednessforeyetrackingdataprediction