Cargando…

Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)

For any research program examining how ambiguous words are processed in broader linguistic contexts, a first step is to establish factors relating to the frequency balance or dominance of those words’ multiple meanings, as well as the similarity of those meanings to one other. Homonyms—words with di...

Descripción completa

Detalles Bibliográficos
Autores principales: DeLong, Katherine A., Trott, Sean, Kutas, Marta
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10040203/
https://www.ncbi.nlm.nih.gov/pubmed/35689168
http://dx.doi.org/10.3758/s13428-022-01869-6
_version_ 1784912431550562304
author DeLong, Katherine A.
Trott, Sean
Kutas, Marta
author_facet DeLong, Katherine A.
Trott, Sean
Kutas, Marta
author_sort DeLong, Katherine A.
collection PubMed
description For any research program examining how ambiguous words are processed in broader linguistic contexts, a first step is to establish factors relating to the frequency balance or dominance of those words’ multiple meanings, as well as the similarity of those meanings to one other. Homonyms—words with divergent meanings—are one ambiguous word type commonly utilized in psycholinguistic research. In contrast, although polysemes—words with multiple related senses—are far more common in English, they have been less frequently used as tools for understanding one-to-many word-to-meaning mappings. The current paper details two norming studies of a relatively large number of ambiguous English words. In the first, offline dominance norming is detailed for 547 homonyms and polysemes via a free association task suitable for words across the ambiguity continuum, with a goal of identifying words with more equibiased meanings. The second norming assesses offline meaning similarity for a partial subset of 318 ambiguous words (including homonyms, unambiguous words, and polysemes divided into regular and irregular types) using a novel, continuous rating method reliant on the linguistic phenomenon of zeugma. In addition, we conduct computational analyses on the human similarity norming data using the BERT pretrained neural language model (Devlin et al., 2018, BERT: Pre-training of deep bidirectional transformers for language understanding. ArXiv Preprint. arXiv:1810.04805) to evaluate factors that may explain variance beyond that accounted for by dictionary-criteria ambiguity categories. Finally, we make available the summarized item dominance values and similarity ratings in resultant appendices (see supplementary material), as well as individual item and participant norming data, which can be accessed online (https://osf.io/g7fmv/). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-022-01869-6.
format Online
Article
Text
id pubmed-10040203
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-100402032023-06-10 Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT) DeLong, Katherine A. Trott, Sean Kutas, Marta Behav Res Methods Article For any research program examining how ambiguous words are processed in broader linguistic contexts, a first step is to establish factors relating to the frequency balance or dominance of those words’ multiple meanings, as well as the similarity of those meanings to one other. Homonyms—words with divergent meanings—are one ambiguous word type commonly utilized in psycholinguistic research. In contrast, although polysemes—words with multiple related senses—are far more common in English, they have been less frequently used as tools for understanding one-to-many word-to-meaning mappings. The current paper details two norming studies of a relatively large number of ambiguous English words. In the first, offline dominance norming is detailed for 547 homonyms and polysemes via a free association task suitable for words across the ambiguity continuum, with a goal of identifying words with more equibiased meanings. The second norming assesses offline meaning similarity for a partial subset of 318 ambiguous words (including homonyms, unambiguous words, and polysemes divided into regular and irregular types) using a novel, continuous rating method reliant on the linguistic phenomenon of zeugma. In addition, we conduct computational analyses on the human similarity norming data using the BERT pretrained neural language model (Devlin et al., 2018, BERT: Pre-training of deep bidirectional transformers for language understanding. ArXiv Preprint. arXiv:1810.04805) to evaluate factors that may explain variance beyond that accounted for by dictionary-criteria ambiguity categories. Finally, we make available the summarized item dominance values and similarity ratings in resultant appendices (see supplementary material), as well as individual item and participant norming data, which can be accessed online (https://osf.io/g7fmv/). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-022-01869-6. Springer US 2022-06-10 2023 /pmc/articles/PMC10040203/ /pubmed/35689168 http://dx.doi.org/10.3758/s13428-022-01869-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
DeLong, Katherine A.
Trott, Sean
Kutas, Marta
Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title_full Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title_fullStr Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title_full_unstemmed Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title_short Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title_sort offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (bert)
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10040203/
https://www.ncbi.nlm.nih.gov/pubmed/35689168
http://dx.doi.org/10.3758/s13428-022-01869-6
work_keys_str_mv AT delongkatherinea offlinedominanceandzeugmaticsimilaritynormingsofvariablyambiguouswordsassessedagainstaneurallanguagemodelbert
AT trottsean offlinedominanceandzeugmaticsimilaritynormingsofvariablyambiguouswordsassessedagainstaneurallanguagemodelbert
AT kutasmarta offlinedominanceandzeugmaticsimilaritynormingsofvariablyambiguouswordsassessedagainstaneurallanguagemodelbert