Cargando…

Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models

This research paper investigates the efficacy of various machine learning models, including deep learning and hybrid models, for text classification in the English and Bangla languages. The study focuses on sentiment analysis of comments from a popular Bengali e-commerce site, "DARAZ," whi...

Descripción completa

Detalles Bibliográficos
Autores principales: Das, Rajesh Kumar, Islam, Mirajul, Hasan, Md Mahmudul, Razia, Sultana, Hassan, Mocksidul, Khushbu, Sharun Akter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10560063/
https://www.ncbi.nlm.nih.gov/pubmed/37809397
http://dx.doi.org/10.1016/j.heliyon.2023.e20281
_version_ 1785117649142808576
author Das, Rajesh Kumar
Islam, Mirajul
Hasan, Md Mahmudul
Razia, Sultana
Hassan, Mocksidul
Khushbu, Sharun Akter
author_facet Das, Rajesh Kumar
Islam, Mirajul
Hasan, Md Mahmudul
Razia, Sultana
Hassan, Mocksidul
Khushbu, Sharun Akter
author_sort Das, Rajesh Kumar
collection PubMed
description This research paper investigates the efficacy of various machine learning models, including deep learning and hybrid models, for text classification in the English and Bangla languages. The study focuses on sentiment analysis of comments from a popular Bengali e-commerce site, "DARAZ," which comprises both Bangla and translated English reviews. The primary objective of this study is to conduct a comparative analysis of various models, evaluating their efficacy in the domain of sentiment analysis. The research methodology includes implementing seven machine learning models and deep learning models, such as Long Short-Term Memory (LSTM), Bidirectional LSTM (Bi-LSTM), Convolutional 1D (Conv1D), and a combined Conv1D-LSTM. Preprocessing techniques are applied to a modified text set to enhance model accuracy. The major conclusion of the study is that Support Vector Machine (SVM) models exhibit superior performance compared to other models, achieving an accuracy of 82.56% for English text sentiment analysis and 86.43% for Bangla text sentiment analysis using the porter stemming algorithm. Additionally, the Bi-LSTM Based Model demonstrates the best performance among the deep learning models, achieving an accuracy of 78.10% for English text and 83.72% for Bangla text using porter stemming. This study signifies significant progress in natural language processing research, particularly for Bangla, by enhancing improved text classification models and methodologies. The results of this research make a significant contribution to the field of sentiment analysis and offer valuable insights for future research and practical applications.
format Online
Article
Text
id pubmed-10560063
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-105600632023-10-08 Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models Das, Rajesh Kumar Islam, Mirajul Hasan, Md Mahmudul Razia, Sultana Hassan, Mocksidul Khushbu, Sharun Akter Heliyon Research Article This research paper investigates the efficacy of various machine learning models, including deep learning and hybrid models, for text classification in the English and Bangla languages. The study focuses on sentiment analysis of comments from a popular Bengali e-commerce site, "DARAZ," which comprises both Bangla and translated English reviews. The primary objective of this study is to conduct a comparative analysis of various models, evaluating their efficacy in the domain of sentiment analysis. The research methodology includes implementing seven machine learning models and deep learning models, such as Long Short-Term Memory (LSTM), Bidirectional LSTM (Bi-LSTM), Convolutional 1D (Conv1D), and a combined Conv1D-LSTM. Preprocessing techniques are applied to a modified text set to enhance model accuracy. The major conclusion of the study is that Support Vector Machine (SVM) models exhibit superior performance compared to other models, achieving an accuracy of 82.56% for English text sentiment analysis and 86.43% for Bangla text sentiment analysis using the porter stemming algorithm. Additionally, the Bi-LSTM Based Model demonstrates the best performance among the deep learning models, achieving an accuracy of 78.10% for English text and 83.72% for Bangla text using porter stemming. This study signifies significant progress in natural language processing research, particularly for Bangla, by enhancing improved text classification models and methodologies. The results of this research make a significant contribution to the field of sentiment analysis and offer valuable insights for future research and practical applications. Elsevier 2023-09-19 /pmc/articles/PMC10560063/ /pubmed/37809397 http://dx.doi.org/10.1016/j.heliyon.2023.e20281 Text en © 2023 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Das, Rajesh Kumar
Islam, Mirajul
Hasan, Md Mahmudul
Razia, Sultana
Hassan, Mocksidul
Khushbu, Sharun Akter
Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models
title Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models
title_full Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models
title_fullStr Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models
title_full_unstemmed Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models
title_short Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models
title_sort sentiment analysis in multilingual context: comparative analysis of machine learning and hybrid deep learning models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10560063/
https://www.ncbi.nlm.nih.gov/pubmed/37809397
http://dx.doi.org/10.1016/j.heliyon.2023.e20281
work_keys_str_mv AT dasrajeshkumar sentimentanalysisinmultilingualcontextcomparativeanalysisofmachinelearningandhybriddeeplearningmodels
AT islammirajul sentimentanalysisinmultilingualcontextcomparativeanalysisofmachinelearningandhybriddeeplearningmodels
AT hasanmdmahmudul sentimentanalysisinmultilingualcontextcomparativeanalysisofmachinelearningandhybriddeeplearningmodels
AT raziasultana sentimentanalysisinmultilingualcontextcomparativeanalysisofmachinelearningandhybriddeeplearningmodels
AT hassanmocksidul sentimentanalysisinmultilingualcontextcomparativeanalysisofmachinelearningandhybriddeeplearningmodels
AT khushbusharunakter sentimentanalysisinmultilingualcontextcomparativeanalysisofmachinelearningandhybriddeeplearningmodels