Cargando…
Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
For any research program examining how ambiguous words are processed in broader linguistic contexts, a first step is to establish factors relating to the frequency balance or dominance of those words’ multiple meanings, as well as the similarity of those meanings to one other. Homonyms—words with di...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10040203/ https://www.ncbi.nlm.nih.gov/pubmed/35689168 http://dx.doi.org/10.3758/s13428-022-01869-6 |
_version_ | 1784912431550562304 |
---|---|
author | DeLong, Katherine A. Trott, Sean Kutas, Marta |
author_facet | DeLong, Katherine A. Trott, Sean Kutas, Marta |
author_sort | DeLong, Katherine A. |
collection | PubMed |
description | For any research program examining how ambiguous words are processed in broader linguistic contexts, a first step is to establish factors relating to the frequency balance or dominance of those words’ multiple meanings, as well as the similarity of those meanings to one other. Homonyms—words with divergent meanings—are one ambiguous word type commonly utilized in psycholinguistic research. In contrast, although polysemes—words with multiple related senses—are far more common in English, they have been less frequently used as tools for understanding one-to-many word-to-meaning mappings. The current paper details two norming studies of a relatively large number of ambiguous English words. In the first, offline dominance norming is detailed for 547 homonyms and polysemes via a free association task suitable for words across the ambiguity continuum, with a goal of identifying words with more equibiased meanings. The second norming assesses offline meaning similarity for a partial subset of 318 ambiguous words (including homonyms, unambiguous words, and polysemes divided into regular and irregular types) using a novel, continuous rating method reliant on the linguistic phenomenon of zeugma. In addition, we conduct computational analyses on the human similarity norming data using the BERT pretrained neural language model (Devlin et al., 2018, BERT: Pre-training of deep bidirectional transformers for language understanding. ArXiv Preprint. arXiv:1810.04805) to evaluate factors that may explain variance beyond that accounted for by dictionary-criteria ambiguity categories. Finally, we make available the summarized item dominance values and similarity ratings in resultant appendices (see supplementary material), as well as individual item and participant norming data, which can be accessed online (https://osf.io/g7fmv/). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-022-01869-6. |
format | Online Article Text |
id | pubmed-10040203 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-100402032023-06-10 Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT) DeLong, Katherine A. Trott, Sean Kutas, Marta Behav Res Methods Article For any research program examining how ambiguous words are processed in broader linguistic contexts, a first step is to establish factors relating to the frequency balance or dominance of those words’ multiple meanings, as well as the similarity of those meanings to one other. Homonyms—words with divergent meanings—are one ambiguous word type commonly utilized in psycholinguistic research. In contrast, although polysemes—words with multiple related senses—are far more common in English, they have been less frequently used as tools for understanding one-to-many word-to-meaning mappings. The current paper details two norming studies of a relatively large number of ambiguous English words. In the first, offline dominance norming is detailed for 547 homonyms and polysemes via a free association task suitable for words across the ambiguity continuum, with a goal of identifying words with more equibiased meanings. The second norming assesses offline meaning similarity for a partial subset of 318 ambiguous words (including homonyms, unambiguous words, and polysemes divided into regular and irregular types) using a novel, continuous rating method reliant on the linguistic phenomenon of zeugma. In addition, we conduct computational analyses on the human similarity norming data using the BERT pretrained neural language model (Devlin et al., 2018, BERT: Pre-training of deep bidirectional transformers for language understanding. ArXiv Preprint. arXiv:1810.04805) to evaluate factors that may explain variance beyond that accounted for by dictionary-criteria ambiguity categories. Finally, we make available the summarized item dominance values and similarity ratings in resultant appendices (see supplementary material), as well as individual item and participant norming data, which can be accessed online (https://osf.io/g7fmv/). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-022-01869-6. Springer US 2022-06-10 2023 /pmc/articles/PMC10040203/ /pubmed/35689168 http://dx.doi.org/10.3758/s13428-022-01869-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article DeLong, Katherine A. Trott, Sean Kutas, Marta Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT) |
title | Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT) |
title_full | Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT) |
title_fullStr | Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT) |
title_full_unstemmed | Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT) |
title_short | Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT) |
title_sort | offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (bert) |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10040203/ https://www.ncbi.nlm.nih.gov/pubmed/35689168 http://dx.doi.org/10.3758/s13428-022-01869-6 |
work_keys_str_mv | AT delongkatherinea offlinedominanceandzeugmaticsimilaritynormingsofvariablyambiguouswordsassessedagainstaneurallanguagemodelbert AT trottsean offlinedominanceandzeugmaticsimilaritynormingsofvariablyambiguouswordsassessedagainstaneurallanguagemodelbert AT kutasmarta offlinedominanceandzeugmaticsimilaritynormingsofvariablyambiguouswordsassessedagainstaneurallanguagemodelbert |