Cargando…
A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks
Currently, most work on comparing differences between simplified and traditional Chinese only focuses on the character or lexical level, without taking the global differences into consideration. In order to solve this problem, this paper proposes to use complex network analysis of word co-occurrence...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728479/ https://www.ncbi.nlm.nih.gov/pubmed/33343654 http://dx.doi.org/10.1155/2020/8863847 |
_version_ | 1783621284212506624 |
---|---|
author | Jiang, Zhongqiang Zhao, Dongmei Zheng, Jiangbin Chen, Yidong |
author_facet | Jiang, Zhongqiang Zhao, Dongmei Zheng, Jiangbin Chen, Yidong |
author_sort | Jiang, Zhongqiang |
collection | PubMed |
description | Currently, most work on comparing differences between simplified and traditional Chinese only focuses on the character or lexical level, without taking the global differences into consideration. In order to solve this problem, this paper proposes to use complex network analysis of word co-occurrence networks, which have been successfully applied to the language analysis research and can tackle global characters and explore the differences between simplified and traditional Chinese. Specially, we first constructed a word co-occurrence network for simplified and traditional Chinese using selected news corpora. Then, the complex network analysis methods were performed, including network statistics analysis, kernel lexicon comparison, and motif analysis, to gain a global understanding of these networks. After that, the networks were compared based on the properties obtained. Through comparison, we can obtain three interesting results: first, the co-occurrence networks of simplified Chinese and traditional Chinese are both small-world and scale-free networks. However, given the same corpus size, the co-occurrence networks of traditional Chinese tend to have more nodes, which may be due to a large number of one-to-many character/word mappings from simplified Chinese to traditional Chinese; second, since traditional Chinese retains more ancient Chinese words and uses fewer weak verbs, the traditional Chinese kernel lexicons have more entries than the simplified Chinese kernel lexicons; third, motif analysis shows that there is no difference between the simplified Chinese network and the corresponding traditional Chinese network, which means that simplified and traditional Chinese are semantically consistent. |
format | Online Article Text |
id | pubmed-7728479 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-77284792020-12-17 A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks Jiang, Zhongqiang Zhao, Dongmei Zheng, Jiangbin Chen, Yidong Comput Intell Neurosci Research Article Currently, most work on comparing differences between simplified and traditional Chinese only focuses on the character or lexical level, without taking the global differences into consideration. In order to solve this problem, this paper proposes to use complex network analysis of word co-occurrence networks, which have been successfully applied to the language analysis research and can tackle global characters and explore the differences between simplified and traditional Chinese. Specially, we first constructed a word co-occurrence network for simplified and traditional Chinese using selected news corpora. Then, the complex network analysis methods were performed, including network statistics analysis, kernel lexicon comparison, and motif analysis, to gain a global understanding of these networks. After that, the networks were compared based on the properties obtained. Through comparison, we can obtain three interesting results: first, the co-occurrence networks of simplified Chinese and traditional Chinese are both small-world and scale-free networks. However, given the same corpus size, the co-occurrence networks of traditional Chinese tend to have more nodes, which may be due to a large number of one-to-many character/word mappings from simplified Chinese to traditional Chinese; second, since traditional Chinese retains more ancient Chinese words and uses fewer weak verbs, the traditional Chinese kernel lexicons have more entries than the simplified Chinese kernel lexicons; third, motif analysis shows that there is no difference between the simplified Chinese network and the corresponding traditional Chinese network, which means that simplified and traditional Chinese are semantically consistent. Hindawi 2020-12-03 /pmc/articles/PMC7728479/ /pubmed/33343654 http://dx.doi.org/10.1155/2020/8863847 Text en Copyright © 2020 Zhongqiang Jiang et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Jiang, Zhongqiang Zhao, Dongmei Zheng, Jiangbin Chen, Yidong A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks |
title | A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks |
title_full | A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks |
title_fullStr | A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks |
title_full_unstemmed | A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks |
title_short | A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks |
title_sort | study on differences between simplified and traditional chinese based on complex network analysis of the word co-occurrence networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728479/ https://www.ncbi.nlm.nih.gov/pubmed/33343654 http://dx.doi.org/10.1155/2020/8863847 |
work_keys_str_mv | AT jiangzhongqiang astudyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks AT zhaodongmei astudyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks AT zhengjiangbin astudyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks AT chenyidong astudyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks AT jiangzhongqiang studyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks AT zhaodongmei studyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks AT zhengjiangbin studyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks AT chenyidong studyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks |