Cargando…
Analysis of big data job requirements based on K-means text clustering in China
This paper aims to understand the characteristics of domestic big data jobs requirements through k-means text clustering, help enterprises, and employees to identify big data talents, and promote the further development of big data-related research. Firstly, the crawler software is used to crawl the...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8341572/ https://www.ncbi.nlm.nih.gov/pubmed/34351951 http://dx.doi.org/10.1371/journal.pone.0255419 |
_version_ | 1783733940988674048 |
---|---|
author | Debao, Dai Yinxia, Ma Min, Zhao |
author_facet | Debao, Dai Yinxia, Ma Min, Zhao |
author_sort | Debao, Dai |
collection | PubMed |
description | This paper aims to understand the characteristics of domestic big data jobs requirements through k-means text clustering, help enterprises, and employees to identify big data talents, and promote the further development of big data-related research. Firstly, the crawler software is used to crawl the recruitment information about "big data" on the zhaopin.com recruitment website. Then, Jieba word segmentation and K-means text clustering are used to cluster big data recruitment positions, and the number of clustering was determined by the average sum of squares within the group. Finally, big data jobs are divided into 10 categories, and the urban distribution, salary level, education requirements, and experience requirements of big data jobs are discussed and analyzed from the perspectives of the overall data set and clustering results, to clarify the characteristics of big data job demands. The analysis results show that the job demands of big data are mainly distributed in first-tier cities and new first-tier cities. Enterprises are more inclined to job seekers with a college degree or bachelor’s degree and more than one year’s relevant experience. There are wage differences among different types of jobs. The higher the position, the higher the requirement for education and experience will be. |
format | Online Article Text |
id | pubmed-8341572 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-83415722021-08-06 Analysis of big data job requirements based on K-means text clustering in China Debao, Dai Yinxia, Ma Min, Zhao PLoS One Research Article This paper aims to understand the characteristics of domestic big data jobs requirements through k-means text clustering, help enterprises, and employees to identify big data talents, and promote the further development of big data-related research. Firstly, the crawler software is used to crawl the recruitment information about "big data" on the zhaopin.com recruitment website. Then, Jieba word segmentation and K-means text clustering are used to cluster big data recruitment positions, and the number of clustering was determined by the average sum of squares within the group. Finally, big data jobs are divided into 10 categories, and the urban distribution, salary level, education requirements, and experience requirements of big data jobs are discussed and analyzed from the perspectives of the overall data set and clustering results, to clarify the characteristics of big data job demands. The analysis results show that the job demands of big data are mainly distributed in first-tier cities and new first-tier cities. Enterprises are more inclined to job seekers with a college degree or bachelor’s degree and more than one year’s relevant experience. There are wage differences among different types of jobs. The higher the position, the higher the requirement for education and experience will be. Public Library of Science 2021-08-05 /pmc/articles/PMC8341572/ /pubmed/34351951 http://dx.doi.org/10.1371/journal.pone.0255419 Text en © 2021 Debao et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Debao, Dai Yinxia, Ma Min, Zhao Analysis of big data job requirements based on K-means text clustering in China |
title | Analysis of big data job requirements based on K-means text clustering in China |
title_full | Analysis of big data job requirements based on K-means text clustering in China |
title_fullStr | Analysis of big data job requirements based on K-means text clustering in China |
title_full_unstemmed | Analysis of big data job requirements based on K-means text clustering in China |
title_short | Analysis of big data job requirements based on K-means text clustering in China |
title_sort | analysis of big data job requirements based on k-means text clustering in china |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8341572/ https://www.ncbi.nlm.nih.gov/pubmed/34351951 http://dx.doi.org/10.1371/journal.pone.0255419 |
work_keys_str_mv | AT debaodai analysisofbigdatajobrequirementsbasedonkmeanstextclusteringinchina AT yinxiama analysisofbigdatajobrequirementsbasedonkmeanstextclusteringinchina AT minzhao analysisofbigdatajobrequirementsbasedonkmeanstextclusteringinchina |