Cargando…
Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020)
Semantic diversity refers to the degree of semantic variability in the contexts in which a particular word is used. We have previously proposed a method for measuring semantic diversity based on latent semantic analysis (LSA). In a recent paper, Cevoli et al. (2020) attempted to replicate our method...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9374602/ https://www.ncbi.nlm.nih.gov/pubmed/34591284 http://dx.doi.org/10.3758/s13428-021-01693-4 |
_version_ | 1784767819945082880 |
---|---|
author | Hoffman, Paul Lambon Ralph, Matthew A. Rogers, Timothy T. |
author_facet | Hoffman, Paul Lambon Ralph, Matthew A. Rogers, Timothy T. |
author_sort | Hoffman, Paul |
collection | PubMed |
description | Semantic diversity refers to the degree of semantic variability in the contexts in which a particular word is used. We have previously proposed a method for measuring semantic diversity based on latent semantic analysis (LSA). In a recent paper, Cevoli et al. (2020) attempted to replicate our method and obtained different semantic diversity values. They suggested that this discrepancy occurred because they scaled their LSA vectors by their singular values, while we did not. Using their new results, they argued that semantic diversity is not related to ambiguity in word meaning, as we originally proposed. In this reply, we demonstrate that the use of unscaled vectors provides better fits to human semantic judgements than scaled ones. Thus we argue that our original semantic diversity measure should be preferred over the Cevoli et al. version. We replicate Cevoli et al.’s analysis using the original semantic diversity measure and find (a) our original measure is a better predictor of word recognition latencies than the Cevoli et al. equivalent and (b) that, unlike Cevoli et al.’s measure, our semantic diversity is reliably associated with a measure of polysemy based on dictionary definitions. We conclude that the Hoffman et al. semantic diversity measure is better-suited to capturing the contextual variability among words and that words appearing in a more diverse set of contexts have more variable semantic representations. However, we found that homonyms did not have higher semantic diversity values than non-homonyms, suggesting that the measure does not capture this special case of ambiguity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-021-01693-4. |
format | Online Article Text |
id | pubmed-9374602 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-93746022022-08-14 Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020) Hoffman, Paul Lambon Ralph, Matthew A. Rogers, Timothy T. Behav Res Methods Article Semantic diversity refers to the degree of semantic variability in the contexts in which a particular word is used. We have previously proposed a method for measuring semantic diversity based on latent semantic analysis (LSA). In a recent paper, Cevoli et al. (2020) attempted to replicate our method and obtained different semantic diversity values. They suggested that this discrepancy occurred because they scaled their LSA vectors by their singular values, while we did not. Using their new results, they argued that semantic diversity is not related to ambiguity in word meaning, as we originally proposed. In this reply, we demonstrate that the use of unscaled vectors provides better fits to human semantic judgements than scaled ones. Thus we argue that our original semantic diversity measure should be preferred over the Cevoli et al. version. We replicate Cevoli et al.’s analysis using the original semantic diversity measure and find (a) our original measure is a better predictor of word recognition latencies than the Cevoli et al. equivalent and (b) that, unlike Cevoli et al.’s measure, our semantic diversity is reliably associated with a measure of polysemy based on dictionary definitions. We conclude that the Hoffman et al. semantic diversity measure is better-suited to capturing the contextual variability among words and that words appearing in a more diverse set of contexts have more variable semantic representations. However, we found that homonyms did not have higher semantic diversity values than non-homonyms, suggesting that the measure does not capture this special case of ambiguity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-021-01693-4. Springer US 2021-09-29 2022 /pmc/articles/PMC9374602/ /pubmed/34591284 http://dx.doi.org/10.3758/s13428-021-01693-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Hoffman, Paul Lambon Ralph, Matthew A. Rogers, Timothy T. Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020) |
title | Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020) |
title_full | Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020) |
title_fullStr | Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020) |
title_full_unstemmed | Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020) |
title_short | Semantic diversity is best measured with unscaled vectors: Reply to Cevoli, Watkins and Rastle (2020) |
title_sort | semantic diversity is best measured with unscaled vectors: reply to cevoli, watkins and rastle (2020) |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9374602/ https://www.ncbi.nlm.nih.gov/pubmed/34591284 http://dx.doi.org/10.3758/s13428-021-01693-4 |
work_keys_str_mv | AT hoffmanpaul semanticdiversityisbestmeasuredwithunscaledvectorsreplytocevoliwatkinsandrastle2020 AT lambonralphmatthewa semanticdiversityisbestmeasuredwithunscaledvectorsreplytocevoliwatkinsandrastle2020 AT rogerstimothyt semanticdiversityisbestmeasuredwithunscaledvectorsreplytocevoliwatkinsandrastle2020 |