Cargando…

Analysis of big data job requirements based on K-means text clustering in China

This paper aims to understand the characteristics of domestic big data jobs requirements through k-means text clustering, help enterprises, and employees to identify big data talents, and promote the further development of big data-related research. Firstly, the crawler software is used to crawl the...

Descripción completa

Detalles Bibliográficos
Autores principales: Debao, Dai, Yinxia, Ma, Min, Zhao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8341572/
https://www.ncbi.nlm.nih.gov/pubmed/34351951
http://dx.doi.org/10.1371/journal.pone.0255419
_version_ 1783733940988674048
author Debao, Dai
Yinxia, Ma
Min, Zhao
author_facet Debao, Dai
Yinxia, Ma
Min, Zhao
author_sort Debao, Dai
collection PubMed
description This paper aims to understand the characteristics of domestic big data jobs requirements through k-means text clustering, help enterprises, and employees to identify big data talents, and promote the further development of big data-related research. Firstly, the crawler software is used to crawl the recruitment information about "big data" on the zhaopin.com recruitment website. Then, Jieba word segmentation and K-means text clustering are used to cluster big data recruitment positions, and the number of clustering was determined by the average sum of squares within the group. Finally, big data jobs are divided into 10 categories, and the urban distribution, salary level, education requirements, and experience requirements of big data jobs are discussed and analyzed from the perspectives of the overall data set and clustering results, to clarify the characteristics of big data job demands. The analysis results show that the job demands of big data are mainly distributed in first-tier cities and new first-tier cities. Enterprises are more inclined to job seekers with a college degree or bachelor’s degree and more than one year’s relevant experience. There are wage differences among different types of jobs. The higher the position, the higher the requirement for education and experience will be.
format Online
Article
Text
id pubmed-8341572
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83415722021-08-06 Analysis of big data job requirements based on K-means text clustering in China Debao, Dai Yinxia, Ma Min, Zhao PLoS One Research Article This paper aims to understand the characteristics of domestic big data jobs requirements through k-means text clustering, help enterprises, and employees to identify big data talents, and promote the further development of big data-related research. Firstly, the crawler software is used to crawl the recruitment information about "big data" on the zhaopin.com recruitment website. Then, Jieba word segmentation and K-means text clustering are used to cluster big data recruitment positions, and the number of clustering was determined by the average sum of squares within the group. Finally, big data jobs are divided into 10 categories, and the urban distribution, salary level, education requirements, and experience requirements of big data jobs are discussed and analyzed from the perspectives of the overall data set and clustering results, to clarify the characteristics of big data job demands. The analysis results show that the job demands of big data are mainly distributed in first-tier cities and new first-tier cities. Enterprises are more inclined to job seekers with a college degree or bachelor’s degree and more than one year’s relevant experience. There are wage differences among different types of jobs. The higher the position, the higher the requirement for education and experience will be. Public Library of Science 2021-08-05 /pmc/articles/PMC8341572/ /pubmed/34351951 http://dx.doi.org/10.1371/journal.pone.0255419 Text en © 2021 Debao et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Debao, Dai
Yinxia, Ma
Min, Zhao
Analysis of big data job requirements based on K-means text clustering in China
title Analysis of big data job requirements based on K-means text clustering in China
title_full Analysis of big data job requirements based on K-means text clustering in China
title_fullStr Analysis of big data job requirements based on K-means text clustering in China
title_full_unstemmed Analysis of big data job requirements based on K-means text clustering in China
title_short Analysis of big data job requirements based on K-means text clustering in China
title_sort analysis of big data job requirements based on k-means text clustering in china
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8341572/
https://www.ncbi.nlm.nih.gov/pubmed/34351951
http://dx.doi.org/10.1371/journal.pone.0255419
work_keys_str_mv AT debaodai analysisofbigdatajobrequirementsbasedonkmeanstextclusteringinchina
AT yinxiama analysisofbigdatajobrequirementsbasedonkmeanstextclusteringinchina
AT minzhao analysisofbigdatajobrequirementsbasedonkmeanstextclusteringinchina