Cargando…
Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model
Sentence semantic matching is the cornerstone of many natural language processing tasks, including Chinese language processing. It is well known that Chinese sentences with different polysemous words or word order may have totally different semantic meanings. Thus, to represent and match the sentenc...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206256/ http://dx.doi.org/10.1007/978-3-030-47436-2_19 |
_version_ | 1783530379149312000 |
---|---|
author | Zhang, Xu Lu, Wenpeng Zhang, Guoqiang Li, Fangfang Wang, Shoujin |
author_facet | Zhang, Xu Lu, Wenpeng Zhang, Guoqiang Li, Fangfang Wang, Shoujin |
author_sort | Zhang, Xu |
collection | PubMed |
description | Sentence semantic matching is the cornerstone of many natural language processing tasks, including Chinese language processing. It is well known that Chinese sentences with different polysemous words or word order may have totally different semantic meanings. Thus, to represent and match the sentence semantic meaning accurately, one challenge that must be solved is how to capture the semantic features from the multi-granularity perspective, e.g., characters and words. To address the above challenge, we propose a novel sentence semantic matching model which is based on the fusion of semantic features from character-granularity and word-granularity, respectively. Particularly, the multi-granularity fusion intends to extract more semantic features to better optimize the downstream sentence semantic matching. In addition, we propose the equilibrium cross-entropy, a novel loss function, by setting mean square error (MSE) as an equilibrium factor of cross-entropy. The experimental results conducted on Chinese open data set demonstrate that our proposed model combined with binary equilibrium cross-entropy loss function is superior to the existing state-of-the-art sentence semantic matching models. |
format | Online Article Text |
id | pubmed-7206256 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72062562020-05-08 Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model Zhang, Xu Lu, Wenpeng Zhang, Guoqiang Li, Fangfang Wang, Shoujin Advances in Knowledge Discovery and Data Mining Article Sentence semantic matching is the cornerstone of many natural language processing tasks, including Chinese language processing. It is well known that Chinese sentences with different polysemous words or word order may have totally different semantic meanings. Thus, to represent and match the sentence semantic meaning accurately, one challenge that must be solved is how to capture the semantic features from the multi-granularity perspective, e.g., characters and words. To address the above challenge, we propose a novel sentence semantic matching model which is based on the fusion of semantic features from character-granularity and word-granularity, respectively. Particularly, the multi-granularity fusion intends to extract more semantic features to better optimize the downstream sentence semantic matching. In addition, we propose the equilibrium cross-entropy, a novel loss function, by setting mean square error (MSE) as an equilibrium factor of cross-entropy. The experimental results conducted on Chinese open data set demonstrate that our proposed model combined with binary equilibrium cross-entropy loss function is superior to the existing state-of-the-art sentence semantic matching models. 2020-04-17 /pmc/articles/PMC7206256/ http://dx.doi.org/10.1007/978-3-030-47436-2_19 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Zhang, Xu Lu, Wenpeng Zhang, Guoqiang Li, Fangfang Wang, Shoujin Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model |
title | Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model |
title_full | Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model |
title_fullStr | Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model |
title_full_unstemmed | Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model |
title_short | Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model |
title_sort | chinese sentence semantic matching based on multi-granularity fusion model |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206256/ http://dx.doi.org/10.1007/978-3-030-47436-2_19 |
work_keys_str_mv | AT zhangxu chinesesentencesemanticmatchingbasedonmultigranularityfusionmodel AT luwenpeng chinesesentencesemanticmatchingbasedonmultigranularityfusionmodel AT zhangguoqiang chinesesentencesemanticmatchingbasedonmultigranularityfusionmodel AT lifangfang chinesesentencesemanticmatchingbasedonmultigranularityfusionmodel AT wangshoujin chinesesentencesemanticmatchingbasedonmultigranularityfusionmodel |