Cargando…

Sentiment classification for employees reviews using regression vector- stochastic gradient descent classifier (RV-SGDC)

The satisfaction of employees is very important for any organization to make sufficient progress in production and to achieve its goals. Organizations try to keep their employees satisfied by making their policies according to employees’ demands which help to create a good environment for the collec...

Descripción completa

Detalles Bibliográficos
Autores principales: Gaye, Babacar, Zhang, Dezheng, Wulamu, Aziguli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8507482/
https://www.ncbi.nlm.nih.gov/pubmed/34712795
http://dx.doi.org/10.7717/peerj-cs.712
_version_ 1784581866152525824
author Gaye, Babacar
Zhang, Dezheng
Wulamu, Aziguli
author_facet Gaye, Babacar
Zhang, Dezheng
Wulamu, Aziguli
author_sort Gaye, Babacar
collection PubMed
description The satisfaction of employees is very important for any organization to make sufficient progress in production and to achieve its goals. Organizations try to keep their employees satisfied by making their policies according to employees’ demands which help to create a good environment for the collective. For this reason, it is beneficial for organizations to perform staff satisfaction surveys to be analyzed, allowing them to gauge the levels of satisfaction among employees. Sentiment analysis is an approach that can assist in this regard as it categorizes sentiments of reviews into positive and negative results. In this study, we perform experiments for the world’s big six companies and classify their employees’ reviews based on their sentiments. For this, we proposed an approach using lexicon-based and machine learning based techniques. Firstly, we extracted the sentiments of employees from text reviews and labeled the dataset as positive and negative using TextBlob. Then we proposed a hybrid/voting model named Regression Vector-Stochastic Gradient Descent Classifier (RV-SGDC) for sentiment classification. RV-SGDC is a combination of logistic regression, support vector machines, and stochastic gradient descent. We combined these models under a majority voting criteria. We also used other machine learning models in the performance comparison of RV-SGDC. Further, three feature extraction techniques: term frequency-inverse document frequency (TF-IDF), bag of words, and global vectors are used to train learning models. We evaluated the performance of all models in terms of accuracy, precision, recall, and F1 score. The results revealed that RV-SGDC outperforms with a 0.97 accuracy score using the TF-IDF feature due to its hybrid architecture.
format Online
Article
Text
id pubmed-8507482
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-85074822021-10-27 Sentiment classification for employees reviews using regression vector- stochastic gradient descent classifier (RV-SGDC) Gaye, Babacar Zhang, Dezheng Wulamu, Aziguli PeerJ Comput Sci Algorithms and Analysis of Algorithms The satisfaction of employees is very important for any organization to make sufficient progress in production and to achieve its goals. Organizations try to keep their employees satisfied by making their policies according to employees’ demands which help to create a good environment for the collective. For this reason, it is beneficial for organizations to perform staff satisfaction surveys to be analyzed, allowing them to gauge the levels of satisfaction among employees. Sentiment analysis is an approach that can assist in this regard as it categorizes sentiments of reviews into positive and negative results. In this study, we perform experiments for the world’s big six companies and classify their employees’ reviews based on their sentiments. For this, we proposed an approach using lexicon-based and machine learning based techniques. Firstly, we extracted the sentiments of employees from text reviews and labeled the dataset as positive and negative using TextBlob. Then we proposed a hybrid/voting model named Regression Vector-Stochastic Gradient Descent Classifier (RV-SGDC) for sentiment classification. RV-SGDC is a combination of logistic regression, support vector machines, and stochastic gradient descent. We combined these models under a majority voting criteria. We also used other machine learning models in the performance comparison of RV-SGDC. Further, three feature extraction techniques: term frequency-inverse document frequency (TF-IDF), bag of words, and global vectors are used to train learning models. We evaluated the performance of all models in terms of accuracy, precision, recall, and F1 score. The results revealed that RV-SGDC outperforms with a 0.97 accuracy score using the TF-IDF feature due to its hybrid architecture. PeerJ Inc. 2021-09-23 /pmc/articles/PMC8507482/ /pubmed/34712795 http://dx.doi.org/10.7717/peerj-cs.712 Text en ©2021 Gaye et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Algorithms and Analysis of Algorithms
Gaye, Babacar
Zhang, Dezheng
Wulamu, Aziguli
Sentiment classification for employees reviews using regression vector- stochastic gradient descent classifier (RV-SGDC)
title Sentiment classification for employees reviews using regression vector- stochastic gradient descent classifier (RV-SGDC)
title_full Sentiment classification for employees reviews using regression vector- stochastic gradient descent classifier (RV-SGDC)
title_fullStr Sentiment classification for employees reviews using regression vector- stochastic gradient descent classifier (RV-SGDC)
title_full_unstemmed Sentiment classification for employees reviews using regression vector- stochastic gradient descent classifier (RV-SGDC)
title_short Sentiment classification for employees reviews using regression vector- stochastic gradient descent classifier (RV-SGDC)
title_sort sentiment classification for employees reviews using regression vector- stochastic gradient descent classifier (rv-sgdc)
topic Algorithms and Analysis of Algorithms
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8507482/
https://www.ncbi.nlm.nih.gov/pubmed/34712795
http://dx.doi.org/10.7717/peerj-cs.712
work_keys_str_mv AT gayebabacar sentimentclassificationforemployeesreviewsusingregressionvectorstochasticgradientdescentclassifierrvsgdc
AT zhangdezheng sentimentclassificationforemployeesreviewsusingregressionvectorstochasticgradientdescentclassifierrvsgdc
AT wulamuaziguli sentimentclassificationforemployeesreviewsusingregressionvectorstochasticgradientdescentclassifierrvsgdc