Cargando…

Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)

For any research program examining how ambiguous words are processed in broader linguistic contexts, a first step is to establish factors relating to the frequency balance or dominance of those words’ multiple meanings, as well as the similarity of those meanings to one other. Homonyms—words with di...

Descripción completa

Detalles Bibliográficos
Autores principales:	DeLong, Katherine A., Trott, Sean, Kutas, Marta
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer US 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10040203/ https://www.ncbi.nlm.nih.gov/pubmed/35689168 http://dx.doi.org/10.3758/s13428-022-01869-6

_version_	1784912431550562304
author	DeLong, Katherine A. Trott, Sean Kutas, Marta
author_facet	DeLong, Katherine A. Trott, Sean Kutas, Marta
author_sort	DeLong, Katherine A.
collection	PubMed
description	For any research program examining how ambiguous words are processed in broader linguistic contexts, a first step is to establish factors relating to the frequency balance or dominance of those words’ multiple meanings, as well as the similarity of those meanings to one other. Homonyms—words with divergent meanings—are one ambiguous word type commonly utilized in psycholinguistic research. In contrast, although polysemes—words with multiple related senses—are far more common in English, they have been less frequently used as tools for understanding one-to-many word-to-meaning mappings. The current paper details two norming studies of a relatively large number of ambiguous English words. In the first, offline dominance norming is detailed for 547 homonyms and polysemes via a free association task suitable for words across the ambiguity continuum, with a goal of identifying words with more equibiased meanings. The second norming assesses offline meaning similarity for a partial subset of 318 ambiguous words (including homonyms, unambiguous words, and polysemes divided into regular and irregular types) using a novel, continuous rating method reliant on the linguistic phenomenon of zeugma. In addition, we conduct computational analyses on the human similarity norming data using the BERT pretrained neural language model (Devlin et al., 2018, BERT: Pre-training of deep bidirectional transformers for language understanding. ArXiv Preprint. arXiv:1810.04805) to evaluate factors that may explain variance beyond that accounted for by dictionary-criteria ambiguity categories. Finally, we make available the summarized item dominance values and similarity ratings in resultant appendices (see supplementary material), as well as individual item and participant norming data, which can be accessed online (https://osf.io/g7fmv/). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-022-01869-6.
format	Online Article Text
id	pubmed-10040203
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-100402032023-06-10 Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT) DeLong, Katherine A. Trott, Sean Kutas, Marta Behav Res Methods Article For any research program examining how ambiguous words are processed in broader linguistic contexts, a first step is to establish factors relating to the frequency balance or dominance of those words’ multiple meanings, as well as the similarity of those meanings to one other. Homonyms—words with divergent meanings—are one ambiguous word type commonly utilized in psycholinguistic research. In contrast, although polysemes—words with multiple related senses—are far more common in English, they have been less frequently used as tools for understanding one-to-many word-to-meaning mappings. The current paper details two norming studies of a relatively large number of ambiguous English words. In the first, offline dominance norming is detailed for 547 homonyms and polysemes via a free association task suitable for words across the ambiguity continuum, with a goal of identifying words with more equibiased meanings. The second norming assesses offline meaning similarity for a partial subset of 318 ambiguous words (including homonyms, unambiguous words, and polysemes divided into regular and irregular types) using a novel, continuous rating method reliant on the linguistic phenomenon of zeugma. In addition, we conduct computational analyses on the human similarity norming data using the BERT pretrained neural language model (Devlin et al., 2018, BERT: Pre-training of deep bidirectional transformers for language understanding. ArXiv Preprint. arXiv:1810.04805) to evaluate factors that may explain variance beyond that accounted for by dictionary-criteria ambiguity categories. Finally, we make available the summarized item dominance values and similarity ratings in resultant appendices (see supplementary material), as well as individual item and participant norming data, which can be accessed online (https://osf.io/g7fmv/). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-022-01869-6. Springer US 2022-06-10 2023 /pmc/articles/PMC10040203/ /pubmed/35689168 http://dx.doi.org/10.3758/s13428-022-01869-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article DeLong, Katherine A. Trott, Sean Kutas, Marta Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title	Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title_full	Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title_fullStr	Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title_full_unstemmed	Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title_short	Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)
title_sort	offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (bert)
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10040203/ https://www.ncbi.nlm.nih.gov/pubmed/35689168 http://dx.doi.org/10.3758/s13428-022-01869-6
work_keys_str_mv	AT delongkatherinea offlinedominanceandzeugmaticsimilaritynormingsofvariablyambiguouswordsassessedagainstaneurallanguagemodelbert AT trottsean offlinedominanceandzeugmaticsimilaritynormingsofvariablyambiguouswordsassessedagainstaneurallanguagemodelbert AT kutasmarta offlinedominanceandzeugmaticsimilaritynormingsofvariablyambiguouswordsassessedagainstaneurallanguagemodelbert

Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)

Ejemplares similares