Cargando…

Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020)

Semantic diversity refers to the degree of semantic variability in the contexts in which a particular word is used. We have previously proposed a method for measuring semantic diversity based on latent semantic analysis (LSA). In a recent paper, Cevoli et al. (2020) attempted to replicate our method...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hoffman, Paul, Lambon Ralph, Matthew A., Rogers, Timothy T.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer US 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9374602/ https://www.ncbi.nlm.nih.gov/pubmed/34591284 http://dx.doi.org/10.3758/s13428-021-01693-4

_version_	1784767819945082880
author	Hoffman, Paul Lambon Ralph, Matthew A. Rogers, Timothy T.
author_facet	Hoffman, Paul Lambon Ralph, Matthew A. Rogers, Timothy T.
author_sort	Hoffman, Paul
collection	PubMed
description	Semantic diversity refers to the degree of semantic variability in the contexts in which a particular word is used. We have previously proposed a method for measuring semantic diversity based on latent semantic analysis (LSA). In a recent paper, Cevoli et al. (2020) attempted to replicate our method and obtained different semantic diversity values. They suggested that this discrepancy occurred because they scaled their LSA vectors by their singular values, while we did not. Using their new results, they argued that semantic diversity is not related to ambiguity in word meaning, as we originally proposed. In this reply, we demonstrate that the use of unscaled vectors provides better fits to human semantic judgements than scaled ones. Thus we argue that our original semantic diversity measure should be preferred over the Cevoli et al. version. We replicate Cevoli et al.’s analysis using the original semantic diversity measure and find (a) our original measure is a better predictor of word recognition latencies than the Cevoli et al. equivalent and (b) that, unlike Cevoli et al.’s measure, our semantic diversity is reliably associated with a measure of polysemy based on dictionary definitions. We conclude that the Hoffman et al. semantic diversity measure is better-suited to capturing the contextual variability among words and that words appearing in a more diverse set of contexts have more variable semantic representations. However, we found that homonyms did not have higher semantic diversity values than non-homonyms, suggesting that the measure does not capture this special case of ambiguity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-021-01693-4.
format	Online Article Text
id	pubmed-9374602
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-93746022022-08-14 Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020) Hoffman, Paul Lambon Ralph, Matthew A. Rogers, Timothy T. Behav Res Methods Article Semantic diversity refers to the degree of semantic variability in the contexts in which a particular word is used. We have previously proposed a method for measuring semantic diversity based on latent semantic analysis (LSA). In a recent paper, Cevoli et al. (2020) attempted to replicate our method and obtained different semantic diversity values. They suggested that this discrepancy occurred because they scaled their LSA vectors by their singular values, while we did not. Using their new results, they argued that semantic diversity is not related to ambiguity in word meaning, as we originally proposed. In this reply, we demonstrate that the use of unscaled vectors provides better fits to human semantic judgements than scaled ones. Thus we argue that our original semantic diversity measure should be preferred over the Cevoli et al. version. We replicate Cevoli et al.’s analysis using the original semantic diversity measure and find (a) our original measure is a better predictor of word recognition latencies than the Cevoli et al. equivalent and (b) that, unlike Cevoli et al.’s measure, our semantic diversity is reliably associated with a measure of polysemy based on dictionary definitions. We conclude that the Hoffman et al. semantic diversity measure is better-suited to capturing the contextual variability among words and that words appearing in a more diverse set of contexts have more variable semantic representations. However, we found that homonyms did not have higher semantic diversity values than non-homonyms, suggesting that the measure does not capture this special case of ambiguity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-021-01693-4. Springer US 2021-09-29 2022 /pmc/articles/PMC9374602/ /pubmed/34591284 http://dx.doi.org/10.3758/s13428-021-01693-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Hoffman, Paul Lambon Ralph, Matthew A. Rogers, Timothy T. Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020)
title	Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020)
title_full	Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020)
title_fullStr	Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020)
title_full_unstemmed	Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020)
title_short	Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020)
title_sort	semantic diversity is best measured with unscaled vectors: reply to cevoli, watkins and rastle (2020)
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9374602/ https://www.ncbi.nlm.nih.gov/pubmed/34591284 http://dx.doi.org/10.3758/s13428-021-01693-4
work_keys_str_mv	AT hoffmanpaul semanticdiversityisbestmeasuredwithunscaledvectorsreplytocevoliwatkinsandrastle2020 AT lambonralphmatthewa semanticdiversityisbestmeasuredwithunscaledvectorsreplytocevoliwatkinsandrastle2020 AT rogerstimothyt semanticdiversityisbestmeasuredwithunscaledvectorsreplytocevoliwatkinsandrastle2020

Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020)

Ejemplares similares