Cargando…
Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window
Measuring the semantic similarity between words is important for natural language processing tasks. The traditional models of semantic similarity perform well in most cases, but when dealing with words that involve geographical context, spatial semantics of implied spatial information are rarely pre...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7377466/ https://www.ncbi.nlm.nih.gov/pubmed/32702022 http://dx.doi.org/10.1371/journal.pone.0236347 |
_version_ | 1783562222045233152 |
---|---|
author | Wang, Bozhi Fei, Teng Kang, Yuhao Li, Meng Du, Qingyun Han, Meng Dong, Ning |
author_facet | Wang, Bozhi Fei, Teng Kang, Yuhao Li, Meng Du, Qingyun Han, Meng Dong, Ning |
author_sort | Wang, Bozhi |
collection | PubMed |
description | Measuring the semantic similarity between words is important for natural language processing tasks. The traditional models of semantic similarity perform well in most cases, but when dealing with words that involve geographical context, spatial semantics of implied spatial information are rarely preserved. Geographic information retrieval (GIR) methods have focused on this issue; however, they sometimes fail to solve the problem because the spatial and textual similarities of words are considered and calculated separately. In this paper, from the perspective of spatial context, we consider the two parts as a whole—spatial context semantics, and we propose a method that measures spatial semantic similarity using a sliding geospatial context window for geo-tagged words. The proposed method was first validated with a set of simulated data and then applied to a real-world dataset from Flickr. As a result, a spatial semantic similarity model at different scales is presented. We believe this model is a necessary supplement for traditional textual-language semantic analyses of words obtained by word-embedding technologies. This study has the potential to improve the quality of recommendation systems by considering relevant spatial context semantics, and benefits linguistic semantic research by emphasising the spatial cognition among words. |
format | Online Article Text |
id | pubmed-7377466 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-73774662020-07-27 Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window Wang, Bozhi Fei, Teng Kang, Yuhao Li, Meng Du, Qingyun Han, Meng Dong, Ning PLoS One Research Article Measuring the semantic similarity between words is important for natural language processing tasks. The traditional models of semantic similarity perform well in most cases, but when dealing with words that involve geographical context, spatial semantics of implied spatial information are rarely preserved. Geographic information retrieval (GIR) methods have focused on this issue; however, they sometimes fail to solve the problem because the spatial and textual similarities of words are considered and calculated separately. In this paper, from the perspective of spatial context, we consider the two parts as a whole—spatial context semantics, and we propose a method that measures spatial semantic similarity using a sliding geospatial context window for geo-tagged words. The proposed method was first validated with a set of simulated data and then applied to a real-world dataset from Flickr. As a result, a spatial semantic similarity model at different scales is presented. We believe this model is a necessary supplement for traditional textual-language semantic analyses of words obtained by word-embedding technologies. This study has the potential to improve the quality of recommendation systems by considering relevant spatial context semantics, and benefits linguistic semantic research by emphasising the spatial cognition among words. Public Library of Science 2020-07-23 /pmc/articles/PMC7377466/ /pubmed/32702022 http://dx.doi.org/10.1371/journal.pone.0236347 Text en © 2020 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Wang, Bozhi Fei, Teng Kang, Yuhao Li, Meng Du, Qingyun Han, Meng Dong, Ning Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window |
title | Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window |
title_full | Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window |
title_fullStr | Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window |
title_full_unstemmed | Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window |
title_short | Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window |
title_sort | understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7377466/ https://www.ncbi.nlm.nih.gov/pubmed/32702022 http://dx.doi.org/10.1371/journal.pone.0236347 |
work_keys_str_mv | AT wangbozhi understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow AT feiteng understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow AT kangyuhao understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow AT limeng understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow AT duqingyun understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow AT hanmeng understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow AT dongning understandingthespatialdimensionofnaturallanguagebymeasuringthespatialsemanticsimilarityofwordsthroughascalablegeospatialcontextwindow |