Cargando…

A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks

Currently, most work on comparing differences between simplified and traditional Chinese only focuses on the character or lexical level, without taking the global differences into consideration. In order to solve this problem, this paper proposes to use complex network analysis of word co-occurrence...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Zhongqiang, Zhao, Dongmei, Zheng, Jiangbin, Chen, Yidong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728479/
https://www.ncbi.nlm.nih.gov/pubmed/33343654
http://dx.doi.org/10.1155/2020/8863847
_version_ 1783621284212506624
author Jiang, Zhongqiang
Zhao, Dongmei
Zheng, Jiangbin
Chen, Yidong
author_facet Jiang, Zhongqiang
Zhao, Dongmei
Zheng, Jiangbin
Chen, Yidong
author_sort Jiang, Zhongqiang
collection PubMed
description Currently, most work on comparing differences between simplified and traditional Chinese only focuses on the character or lexical level, without taking the global differences into consideration. In order to solve this problem, this paper proposes to use complex network analysis of word co-occurrence networks, which have been successfully applied to the language analysis research and can tackle global characters and explore the differences between simplified and traditional Chinese. Specially, we first constructed a word co-occurrence network for simplified and traditional Chinese using selected news corpora. Then, the complex network analysis methods were performed, including network statistics analysis, kernel lexicon comparison, and motif analysis, to gain a global understanding of these networks. After that, the networks were compared based on the properties obtained. Through comparison, we can obtain three interesting results: first, the co-occurrence networks of simplified Chinese and traditional Chinese are both small-world and scale-free networks. However, given the same corpus size, the co-occurrence networks of traditional Chinese tend to have more nodes, which may be due to a large number of one-to-many character/word mappings from simplified Chinese to traditional Chinese; second, since traditional Chinese retains more ancient Chinese words and uses fewer weak verbs, the traditional Chinese kernel lexicons have more entries than the simplified Chinese kernel lexicons; third, motif analysis shows that there is no difference between the simplified Chinese network and the corresponding traditional Chinese network, which means that simplified and traditional Chinese are semantically consistent.
format Online
Article
Text
id pubmed-7728479
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-77284792020-12-17 A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks Jiang, Zhongqiang Zhao, Dongmei Zheng, Jiangbin Chen, Yidong Comput Intell Neurosci Research Article Currently, most work on comparing differences between simplified and traditional Chinese only focuses on the character or lexical level, without taking the global differences into consideration. In order to solve this problem, this paper proposes to use complex network analysis of word co-occurrence networks, which have been successfully applied to the language analysis research and can tackle global characters and explore the differences between simplified and traditional Chinese. Specially, we first constructed a word co-occurrence network for simplified and traditional Chinese using selected news corpora. Then, the complex network analysis methods were performed, including network statistics analysis, kernel lexicon comparison, and motif analysis, to gain a global understanding of these networks. After that, the networks were compared based on the properties obtained. Through comparison, we can obtain three interesting results: first, the co-occurrence networks of simplified Chinese and traditional Chinese are both small-world and scale-free networks. However, given the same corpus size, the co-occurrence networks of traditional Chinese tend to have more nodes, which may be due to a large number of one-to-many character/word mappings from simplified Chinese to traditional Chinese; second, since traditional Chinese retains more ancient Chinese words and uses fewer weak verbs, the traditional Chinese kernel lexicons have more entries than the simplified Chinese kernel lexicons; third, motif analysis shows that there is no difference between the simplified Chinese network and the corresponding traditional Chinese network, which means that simplified and traditional Chinese are semantically consistent. Hindawi 2020-12-03 /pmc/articles/PMC7728479/ /pubmed/33343654 http://dx.doi.org/10.1155/2020/8863847 Text en Copyright © 2020 Zhongqiang Jiang et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jiang, Zhongqiang
Zhao, Dongmei
Zheng, Jiangbin
Chen, Yidong
A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks
title A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks
title_full A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks
title_fullStr A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks
title_full_unstemmed A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks
title_short A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks
title_sort study on differences between simplified and traditional chinese based on complex network analysis of the word co-occurrence networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728479/
https://www.ncbi.nlm.nih.gov/pubmed/33343654
http://dx.doi.org/10.1155/2020/8863847
work_keys_str_mv AT jiangzhongqiang astudyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks
AT zhaodongmei astudyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks
AT zhengjiangbin astudyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks
AT chenyidong astudyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks
AT jiangzhongqiang studyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks
AT zhaodongmei studyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks
AT zhengjiangbin studyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks
AT chenyidong studyondifferencesbetweensimplifiedandtraditionalchinesebasedoncomplexnetworkanalysisofthewordcooccurrencenetworks