Cargando…

Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm

Sentiment analysis techniques are increasingly exploited to categorize the opinion text to one or more predefined sentiment classes for the creation and automated maintenance of review-aggregation websites. In this paper, a Malay sentiment analysis classification model is proposed to improve classif...

Descripción completa

Detalles Bibliográficos
Autores principales: Al-Saffar, Ahmed, Awang, Suryanti, Tao, Hai, Omar, Nazlia, Al-Saiagh, Wafaa, Al-bared, Mohammed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5912726/
https://www.ncbi.nlm.nih.gov/pubmed/29684036
http://dx.doi.org/10.1371/journal.pone.0194852
_version_ 1783316411422081024
author Al-Saffar, Ahmed
Awang, Suryanti
Tao, Hai
Omar, Nazlia
Al-Saiagh, Wafaa
Al-bared, Mohammed
author_facet Al-Saffar, Ahmed
Awang, Suryanti
Tao, Hai
Omar, Nazlia
Al-Saiagh, Wafaa
Al-bared, Mohammed
author_sort Al-Saffar, Ahmed
collection PubMed
description Sentiment analysis techniques are increasingly exploited to categorize the opinion text to one or more predefined sentiment classes for the creation and automated maintenance of review-aggregation websites. In this paper, a Malay sentiment analysis classification model is proposed to improve classification performances based on the semantic orientation and machine learning approaches. First, a total of 2,478 Malay sentiment-lexicon phrases and words are assigned with a synonym and stored with the help of more than one Malay native speaker, and the polarity is manually allotted with a score. In addition, the supervised machine learning approaches and lexicon knowledge method are combined for Malay sentiment classification with evaluating thirteen features. Finally, three individual classifiers and a combined classifier are used to evaluate the classification accuracy. In experimental results, a wide-range of comparative experiments is conducted on a Malay Reviews Corpus (MRC), and it demonstrates that the feature extraction improves the performance of Malay sentiment analysis based on the combined classification. However, the results depend on three factors, the features, the number of features and the classification approach.
format Online
Article
Text
id pubmed-5912726
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-59127262018-05-05 Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm Al-Saffar, Ahmed Awang, Suryanti Tao, Hai Omar, Nazlia Al-Saiagh, Wafaa Al-bared, Mohammed PLoS One Research Article Sentiment analysis techniques are increasingly exploited to categorize the opinion text to one or more predefined sentiment classes for the creation and automated maintenance of review-aggregation websites. In this paper, a Malay sentiment analysis classification model is proposed to improve classification performances based on the semantic orientation and machine learning approaches. First, a total of 2,478 Malay sentiment-lexicon phrases and words are assigned with a synonym and stored with the help of more than one Malay native speaker, and the polarity is manually allotted with a score. In addition, the supervised machine learning approaches and lexicon knowledge method are combined for Malay sentiment classification with evaluating thirteen features. Finally, three individual classifiers and a combined classifier are used to evaluate the classification accuracy. In experimental results, a wide-range of comparative experiments is conducted on a Malay Reviews Corpus (MRC), and it demonstrates that the feature extraction improves the performance of Malay sentiment analysis based on the combined classification. However, the results depend on three factors, the features, the number of features and the classification approach. Public Library of Science 2018-04-23 /pmc/articles/PMC5912726/ /pubmed/29684036 http://dx.doi.org/10.1371/journal.pone.0194852 Text en © 2018 Al-Saffar et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Al-Saffar, Ahmed
Awang, Suryanti
Tao, Hai
Omar, Nazlia
Al-Saiagh, Wafaa
Al-bared, Mohammed
Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm
title Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm
title_full Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm
title_fullStr Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm
title_full_unstemmed Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm
title_short Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm
title_sort malay sentiment analysis based on combined classification approaches and senti-lexicon algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5912726/
https://www.ncbi.nlm.nih.gov/pubmed/29684036
http://dx.doi.org/10.1371/journal.pone.0194852
work_keys_str_mv AT alsaffarahmed malaysentimentanalysisbasedoncombinedclassificationapproachesandsentilexiconalgorithm
AT awangsuryanti malaysentimentanalysisbasedoncombinedclassificationapproachesandsentilexiconalgorithm
AT taohai malaysentimentanalysisbasedoncombinedclassificationapproachesandsentilexiconalgorithm
AT omarnazlia malaysentimentanalysisbasedoncombinedclassificationapproachesandsentilexiconalgorithm
AT alsaiaghwafaa malaysentimentanalysisbasedoncombinedclassificationapproachesandsentilexiconalgorithm
AT albaredmohammed malaysentimentanalysisbasedoncombinedclassificationapproachesandsentilexiconalgorithm