Cargando…

Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model

Sentence semantic matching is the cornerstone of many natural language processing tasks, including Chinese language processing. It is well known that Chinese sentences with different polysemous words or word order may have totally different semantic meanings. Thus, to represent and match the sentenc...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xu, Lu, Wenpeng, Zhang, Guoqiang, Li, Fangfang, Wang, Shoujin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206256/
http://dx.doi.org/10.1007/978-3-030-47436-2_19
_version_ 1783530379149312000
author Zhang, Xu
Lu, Wenpeng
Zhang, Guoqiang
Li, Fangfang
Wang, Shoujin
author_facet Zhang, Xu
Lu, Wenpeng
Zhang, Guoqiang
Li, Fangfang
Wang, Shoujin
author_sort Zhang, Xu
collection PubMed
description Sentence semantic matching is the cornerstone of many natural language processing tasks, including Chinese language processing. It is well known that Chinese sentences with different polysemous words or word order may have totally different semantic meanings. Thus, to represent and match the sentence semantic meaning accurately, one challenge that must be solved is how to capture the semantic features from the multi-granularity perspective, e.g., characters and words. To address the above challenge, we propose a novel sentence semantic matching model which is based on the fusion of semantic features from character-granularity and word-granularity, respectively. Particularly, the multi-granularity fusion intends to extract more semantic features to better optimize the downstream sentence semantic matching. In addition, we propose the equilibrium cross-entropy, a novel loss function, by setting mean square error (MSE) as an equilibrium factor of cross-entropy. The experimental results conducted on Chinese open data set demonstrate that our proposed model combined with binary equilibrium cross-entropy loss function is superior to the existing state-of-the-art sentence semantic matching models.
format Online
Article
Text
id pubmed-7206256
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72062562020-05-08 Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model Zhang, Xu Lu, Wenpeng Zhang, Guoqiang Li, Fangfang Wang, Shoujin Advances in Knowledge Discovery and Data Mining Article Sentence semantic matching is the cornerstone of many natural language processing tasks, including Chinese language processing. It is well known that Chinese sentences with different polysemous words or word order may have totally different semantic meanings. Thus, to represent and match the sentence semantic meaning accurately, one challenge that must be solved is how to capture the semantic features from the multi-granularity perspective, e.g., characters and words. To address the above challenge, we propose a novel sentence semantic matching model which is based on the fusion of semantic features from character-granularity and word-granularity, respectively. Particularly, the multi-granularity fusion intends to extract more semantic features to better optimize the downstream sentence semantic matching. In addition, we propose the equilibrium cross-entropy, a novel loss function, by setting mean square error (MSE) as an equilibrium factor of cross-entropy. The experimental results conducted on Chinese open data set demonstrate that our proposed model combined with binary equilibrium cross-entropy loss function is superior to the existing state-of-the-art sentence semantic matching models. 2020-04-17 /pmc/articles/PMC7206256/ http://dx.doi.org/10.1007/978-3-030-47436-2_19 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Zhang, Xu
Lu, Wenpeng
Zhang, Guoqiang
Li, Fangfang
Wang, Shoujin
Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model
title Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model
title_full Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model
title_fullStr Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model
title_full_unstemmed Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model
title_short Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model
title_sort chinese sentence semantic matching based on multi-granularity fusion model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206256/
http://dx.doi.org/10.1007/978-3-030-47436-2_19
work_keys_str_mv AT zhangxu chinesesentencesemanticmatchingbasedonmultigranularityfusionmodel
AT luwenpeng chinesesentencesemanticmatchingbasedonmultigranularityfusionmodel
AT zhangguoqiang chinesesentencesemanticmatchingbasedonmultigranularityfusionmodel
AT lifangfang chinesesentencesemanticmatchingbasedonmultigranularityfusionmodel
AT wangshoujin chinesesentencesemanticmatchingbasedonmultigranularityfusionmodel