Cargando…
The construction of Chinese microblog gender-specific thesauruses and user gender classification
Based on the statistical features, short text messages published by different gender users are different in terms of the words and semantics used. In this paper, two new features are constructed after constructing a gender-specific thesaurus. A new classification model is constructed by combining th...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6223889/ https://www.ncbi.nlm.nih.gov/pubmed/30465023 http://dx.doi.org/10.1007/s41109-018-0104-1 |
_version_ | 1783369490037211136 |
---|---|
author | Zhu, Zhiliang Ke, Zejun Cui, Jiayin Yu, Hai Liu, Guoqi |
author_facet | Zhu, Zhiliang Ke, Zejun Cui, Jiayin Yu, Hai Liu, Guoqi |
author_sort | Zhu, Zhiliang |
collection | PubMed |
description | Based on the statistical features, short text messages published by different gender users are different in terms of the words and semantics used. In this paper, two new features are constructed after constructing a gender-specific thesaurus. A new classification model is constructed by combining the traditional statistical features and the improved text implicitness feature. The experimental evaluation performed on the Sina Weibo dataset demonstrated the effectiveness of gender-specific thesaurus-based features, and the improved text implicitness feature improved the accuracy of gender classification to 84.7%. |
format | Online Article Text |
id | pubmed-6223889 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-62238892018-11-19 The construction of Chinese microblog gender-specific thesauruses and user gender classification Zhu, Zhiliang Ke, Zejun Cui, Jiayin Yu, Hai Liu, Guoqi Appl Netw Sci Research Based on the statistical features, short text messages published by different gender users are different in terms of the words and semantics used. In this paper, two new features are constructed after constructing a gender-specific thesaurus. A new classification model is constructed by combining the traditional statistical features and the improved text implicitness feature. The experimental evaluation performed on the Sina Weibo dataset demonstrated the effectiveness of gender-specific thesaurus-based features, and the improved text implicitness feature improved the accuracy of gender classification to 84.7%. Springer International Publishing 2018-11-08 2018 /pmc/articles/PMC6223889/ /pubmed/30465023 http://dx.doi.org/10.1007/s41109-018-0104-1 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Research Zhu, Zhiliang Ke, Zejun Cui, Jiayin Yu, Hai Liu, Guoqi The construction of Chinese microblog gender-specific thesauruses and user gender classification |
title | The construction of Chinese microblog gender-specific thesauruses and user gender classification |
title_full | The construction of Chinese microblog gender-specific thesauruses and user gender classification |
title_fullStr | The construction of Chinese microblog gender-specific thesauruses and user gender classification |
title_full_unstemmed | The construction of Chinese microblog gender-specific thesauruses and user gender classification |
title_short | The construction of Chinese microblog gender-specific thesauruses and user gender classification |
title_sort | construction of chinese microblog gender-specific thesauruses and user gender classification |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6223889/ https://www.ncbi.nlm.nih.gov/pubmed/30465023 http://dx.doi.org/10.1007/s41109-018-0104-1 |
work_keys_str_mv | AT zhuzhiliang theconstructionofchinesemicrobloggenderspecificthesaurusesandusergenderclassification AT kezejun theconstructionofchinesemicrobloggenderspecificthesaurusesandusergenderclassification AT cuijiayin theconstructionofchinesemicrobloggenderspecificthesaurusesandusergenderclassification AT yuhai theconstructionofchinesemicrobloggenderspecificthesaurusesandusergenderclassification AT liuguoqi theconstructionofchinesemicrobloggenderspecificthesaurusesandusergenderclassification AT zhuzhiliang constructionofchinesemicrobloggenderspecificthesaurusesandusergenderclassification AT kezejun constructionofchinesemicrobloggenderspecificthesaurusesandusergenderclassification AT cuijiayin constructionofchinesemicrobloggenderspecificthesaurusesandusergenderclassification AT yuhai constructionofchinesemicrobloggenderspecificthesaurusesandusergenderclassification AT liuguoqi constructionofchinesemicrobloggenderspecificthesaurusesandusergenderclassification |