Cargando…

Applying the Bell’s Test to Chinese Texts

Search engines are able to find documents containing patterns from a query. This approach can be used for alphabetic languages such as English. However, Chinese is highly dependent on context. The significant problem of Chinese text processing is the missing blanks between words, so it is necessary...

Descripción completa

Detalles Bibliográficos
Autores principales: Bessmertny, Igor A., Huang, Xiaoxi, Platonov, Aleksei V., Yu, Chuqiao, Koroleva, Julia A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516728/
https://www.ncbi.nlm.nih.gov/pubmed/33286049
http://dx.doi.org/10.3390/e22030275
_version_ 1783587068165750784
author Bessmertny, Igor A.
Huang, Xiaoxi
Platonov, Aleksei V.
Yu, Chuqiao
Koroleva, Julia A.
author_facet Bessmertny, Igor A.
Huang, Xiaoxi
Platonov, Aleksei V.
Yu, Chuqiao
Koroleva, Julia A.
author_sort Bessmertny, Igor A.
collection PubMed
description Search engines are able to find documents containing patterns from a query. This approach can be used for alphabetic languages such as English. However, Chinese is highly dependent on context. The significant problem of Chinese text processing is the missing blanks between words, so it is necessary to segment the text to words before any other action. Algorithms for Chinese text segmentation should consider context; that is, the word segmentation process depends on other ideograms. As the existing segmentation algorithms are imperfect, we have considered an approach to build the context from all possible n-grams surrounding the query words. This paper proposes a quantum-inspired approach to rank Chinese text documents by their relevancy to the query. Particularly, this approach uses Bell’s test, which measures the quantum entanglement of two words within the context. The contexts of words are built using the hyperspace analogue to language (HAL) algorithm. Experiments fulfilled in three domains demonstrated that the proposed approach provides acceptable results.
format Online
Article
Text
id pubmed-7516728
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75167282020-11-09 Applying the Bell’s Test to Chinese Texts Bessmertny, Igor A. Huang, Xiaoxi Platonov, Aleksei V. Yu, Chuqiao Koroleva, Julia A. Entropy (Basel) Article Search engines are able to find documents containing patterns from a query. This approach can be used for alphabetic languages such as English. However, Chinese is highly dependent on context. The significant problem of Chinese text processing is the missing blanks between words, so it is necessary to segment the text to words before any other action. Algorithms for Chinese text segmentation should consider context; that is, the word segmentation process depends on other ideograms. As the existing segmentation algorithms are imperfect, we have considered an approach to build the context from all possible n-grams surrounding the query words. This paper proposes a quantum-inspired approach to rank Chinese text documents by their relevancy to the query. Particularly, this approach uses Bell’s test, which measures the quantum entanglement of two words within the context. The contexts of words are built using the hyperspace analogue to language (HAL) algorithm. Experiments fulfilled in three domains demonstrated that the proposed approach provides acceptable results. MDPI 2020-02-28 /pmc/articles/PMC7516728/ /pubmed/33286049 http://dx.doi.org/10.3390/e22030275 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Bessmertny, Igor A.
Huang, Xiaoxi
Platonov, Aleksei V.
Yu, Chuqiao
Koroleva, Julia A.
Applying the Bell’s Test to Chinese Texts
title Applying the Bell’s Test to Chinese Texts
title_full Applying the Bell’s Test to Chinese Texts
title_fullStr Applying the Bell’s Test to Chinese Texts
title_full_unstemmed Applying the Bell’s Test to Chinese Texts
title_short Applying the Bell’s Test to Chinese Texts
title_sort applying the bell’s test to chinese texts
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516728/
https://www.ncbi.nlm.nih.gov/pubmed/33286049
http://dx.doi.org/10.3390/e22030275
work_keys_str_mv AT bessmertnyigora applyingthebellstesttochinesetexts
AT huangxiaoxi applyingthebellstesttochinesetexts
AT platonovalekseiv applyingthebellstesttochinesetexts
AT yuchuqiao applyingthebellstesttochinesetexts
AT korolevajuliaa applyingthebellstesttochinesetexts