Cargando…
Applying the Bell’s Test to Chinese Texts
Search engines are able to find documents containing patterns from a query. This approach can be used for alphabetic languages such as English. However, Chinese is highly dependent on context. The significant problem of Chinese text processing is the missing blanks between words, so it is necessary...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516728/ https://www.ncbi.nlm.nih.gov/pubmed/33286049 http://dx.doi.org/10.3390/e22030275 |
_version_ | 1783587068165750784 |
---|---|
author | Bessmertny, Igor A. Huang, Xiaoxi Platonov, Aleksei V. Yu, Chuqiao Koroleva, Julia A. |
author_facet | Bessmertny, Igor A. Huang, Xiaoxi Platonov, Aleksei V. Yu, Chuqiao Koroleva, Julia A. |
author_sort | Bessmertny, Igor A. |
collection | PubMed |
description | Search engines are able to find documents containing patterns from a query. This approach can be used for alphabetic languages such as English. However, Chinese is highly dependent on context. The significant problem of Chinese text processing is the missing blanks between words, so it is necessary to segment the text to words before any other action. Algorithms for Chinese text segmentation should consider context; that is, the word segmentation process depends on other ideograms. As the existing segmentation algorithms are imperfect, we have considered an approach to build the context from all possible n-grams surrounding the query words. This paper proposes a quantum-inspired approach to rank Chinese text documents by their relevancy to the query. Particularly, this approach uses Bell’s test, which measures the quantum entanglement of two words within the context. The contexts of words are built using the hyperspace analogue to language (HAL) algorithm. Experiments fulfilled in three domains demonstrated that the proposed approach provides acceptable results. |
format | Online Article Text |
id | pubmed-7516728 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75167282020-11-09 Applying the Bell’s Test to Chinese Texts Bessmertny, Igor A. Huang, Xiaoxi Platonov, Aleksei V. Yu, Chuqiao Koroleva, Julia A. Entropy (Basel) Article Search engines are able to find documents containing patterns from a query. This approach can be used for alphabetic languages such as English. However, Chinese is highly dependent on context. The significant problem of Chinese text processing is the missing blanks between words, so it is necessary to segment the text to words before any other action. Algorithms for Chinese text segmentation should consider context; that is, the word segmentation process depends on other ideograms. As the existing segmentation algorithms are imperfect, we have considered an approach to build the context from all possible n-grams surrounding the query words. This paper proposes a quantum-inspired approach to rank Chinese text documents by their relevancy to the query. Particularly, this approach uses Bell’s test, which measures the quantum entanglement of two words within the context. The contexts of words are built using the hyperspace analogue to language (HAL) algorithm. Experiments fulfilled in three domains demonstrated that the proposed approach provides acceptable results. MDPI 2020-02-28 /pmc/articles/PMC7516728/ /pubmed/33286049 http://dx.doi.org/10.3390/e22030275 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Bessmertny, Igor A. Huang, Xiaoxi Platonov, Aleksei V. Yu, Chuqiao Koroleva, Julia A. Applying the Bell’s Test to Chinese Texts |
title | Applying the Bell’s Test to Chinese Texts |
title_full | Applying the Bell’s Test to Chinese Texts |
title_fullStr | Applying the Bell’s Test to Chinese Texts |
title_full_unstemmed | Applying the Bell’s Test to Chinese Texts |
title_short | Applying the Bell’s Test to Chinese Texts |
title_sort | applying the bell’s test to chinese texts |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516728/ https://www.ncbi.nlm.nih.gov/pubmed/33286049 http://dx.doi.org/10.3390/e22030275 |
work_keys_str_mv | AT bessmertnyigora applyingthebellstesttochinesetexts AT huangxiaoxi applyingthebellstesttochinesetexts AT platonovalekseiv applyingthebellstesttochinesetexts AT yuchuqiao applyingthebellstesttochinesetexts AT korolevajuliaa applyingthebellstesttochinesetexts |