Cargando…

Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window

Measuring the semantic similarity between words is important for natural language processing tasks. The traditional models of semantic similarity perform well in most cases, but when dealing with words that involve geographical context, spatial semantics of implied spatial information are rarely pre...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Bozhi, Fei, Teng, Kang, Yuhao, Li, Meng, Du, Qingyun, Han, Meng, Dong, Ning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7377466/
https://www.ncbi.nlm.nih.gov/pubmed/32702022
http://dx.doi.org/10.1371/journal.pone.0236347
_version_ 1783562222045233152
author Wang, Bozhi
Fei, Teng
Kang, Yuhao
Li, Meng
Du, Qingyun
Han, Meng
Dong, Ning
author_facet Wang, Bozhi
Fei, Teng
Kang, Yuhao
Li, Meng
Du, Qingyun
Han, Meng
Dong, Ning
author_sort Wang, Bozhi
collection PubMed
description Measuring the semantic similarity between words is important for natural language processing tasks. The traditional models of semantic similarity perform well in most cases, but when dealing with words that involve geographical context, spatial semantics of implied spatial information are rarely preserved. Geographic information retrieval (GIR) methods have focused on this issue; however, they sometimes fail to solve the problem because the spatial and textual similarities of words are considered and calculated separately. In this paper, from the perspective of spatial context, we consider the two parts as a whole—spatial context semantics, and we propose a method that measures spatial semantic similarity using a sliding geospatial context window for geo-tagged words. The proposed method was first validated with a set of simulated data and then applied to a real-world dataset from Flickr. As a result, a spatial semantic similarity model at different scales is presented. We believe this model is a necessary supplement for traditional textual-language semantic analyses of words obtained by word-embedding technologies. This study has the potential to improve the quality of recommendation systems by considering relevant spatial context semantics, and benefits linguistic semantic research by emphasising the spatial cognition among words.
format Online
Article
Text
id pubmed-7377466
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-73774662020-07-27 Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window Wang, Bozhi Fei, Teng Kang, Yuhao Li, Meng Du, Qingyun Han, Meng Dong, Ning PLoS One Research Article Measuring the semantic similarity between words is important for natural language processing tasks. The traditional models of semantic similarity perform well in most cases, but when dealing with words that involve geographical context, spatial semantics of implied spatial information are rarely preserved. Geographic information retrieval (GIR) methods have focused on this issue; however, they sometimes fail to solve the problem because the spatial and textual similarities of words are considered and calculated separately. In this paper, from the perspective of spatial context, we consider the two parts as a whole—spatial context semantics, and we propose a method that measures spatial semantic similarity using a sliding geospatial context window for geo-tagged words. The proposed method was first validated with a set of simulated data and then applied to a real-world dataset from Flickr. As a result, a spatial semantic similarity model at different scales is presented. We believe this model is a necessary supplement for traditional textual-language semantic analyses of words obtained by word-embedding technologies. This study has the potential to improve the quality of recommendation systems by considering relevant spatial context semantics, and benefits linguistic semantic research by emphasising the spatial cognition among words. Public Library of Science 2020-07-23 /pmc/articles/PMC7377466/ /pubmed/32702022 http://dx.doi.org/10.1371/journal.pone.0236347 Text en © 2020 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Bozhi
Fei, Teng
Kang, Yuhao
Li, Meng
Du, Qingyun
Han, Meng
Dong, Ning
Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window
title Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window
title_full Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window
title_fullStr Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window
title_full_unstemmed Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window
title_short Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window
title_sort understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7377466/
https://www.ncbi.nlm.nih.gov/pubmed/32702022
http://dx.doi.org/10.1371/journal.pone.0236347
work_keys_str_mv AT wangbozhi understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow
AT feiteng understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow
AT kangyuhao understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow
AT limeng understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow
AT duqingyun understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow
AT hanmeng understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow
AT dongning understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow