Cargando…

Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study

Sentiment analysis in research involves the processing and analysis of sentiments from textual data. The sentiment analysis for high resource languages such as English and French has been carried out effectively in the past. However, its applications are comparatively few for resource-poor languages...

Descripción completa

Detalles Bibliográficos
Autores principales: Liaqat, Muhammad Irzam, Awais Hassan, Muhammad, Shoaib, Muhammad, Khurshid, Syed Khaldoon, Shamseldin, Mohamed A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9454799/
https://www.ncbi.nlm.nih.gov/pubmed/36091980
http://dx.doi.org/10.7717/peerj-cs.1032
_version_ 1784785436819849216
author Liaqat, Muhammad Irzam
Awais Hassan, Muhammad
Shoaib, Muhammad
Khurshid, Syed Khaldoon
Shamseldin, Mohamed A.
author_facet Liaqat, Muhammad Irzam
Awais Hassan, Muhammad
Shoaib, Muhammad
Khurshid, Syed Khaldoon
Shamseldin, Mohamed A.
author_sort Liaqat, Muhammad Irzam
collection PubMed
description Sentiment analysis in research involves the processing and analysis of sentiments from textual data. The sentiment analysis for high resource languages such as English and French has been carried out effectively in the past. However, its applications are comparatively few for resource-poor languages due to a lack of textual resources. This systematic literature explores different aspects of Urdu-based sentiment analysis, a classic case of poor resource language. While Urdu is a South Asian language understood by one hundred and sixty-nine million people across the planet. There are various shortcomings in the literature, including limitation of large corpora, language parsers, and lack of pre-trained machine learning models that result in poor performance. This article has analyzed and evaluated studies addressing machine learning-based Urdu sentiment analysis. After searching and filtering, forty articles have been inspected. Research objectives have been proposed that lead to research questions. Our searches were organized in digital repositories after selecting and screening relevant studies. Data was extracted from these studies. Our work on the existing literature reflects that sentiment classification performance can be improved by overcoming the challenges such as word sense disambiguation and massive datasets. Furthermore, Urdu-based language constructs, including language parsers and emoticons, context-level sentiment analysis techniques, pre-processing methods, and lexical resources, can also be improved.
format Online
Article
Text
id pubmed-9454799
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-94547992022-09-09 Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study Liaqat, Muhammad Irzam Awais Hassan, Muhammad Shoaib, Muhammad Khurshid, Syed Khaldoon Shamseldin, Mohamed A. PeerJ Comput Sci Natural Language and Speech Sentiment analysis in research involves the processing and analysis of sentiments from textual data. The sentiment analysis for high resource languages such as English and French has been carried out effectively in the past. However, its applications are comparatively few for resource-poor languages due to a lack of textual resources. This systematic literature explores different aspects of Urdu-based sentiment analysis, a classic case of poor resource language. While Urdu is a South Asian language understood by one hundred and sixty-nine million people across the planet. There are various shortcomings in the literature, including limitation of large corpora, language parsers, and lack of pre-trained machine learning models that result in poor performance. This article has analyzed and evaluated studies addressing machine learning-based Urdu sentiment analysis. After searching and filtering, forty articles have been inspected. Research objectives have been proposed that lead to research questions. Our searches were organized in digital repositories after selecting and screening relevant studies. Data was extracted from these studies. Our work on the existing literature reflects that sentiment classification performance can be improved by overcoming the challenges such as word sense disambiguation and massive datasets. Furthermore, Urdu-based language constructs, including language parsers and emoticons, context-level sentiment analysis techniques, pre-processing methods, and lexical resources, can also be improved. PeerJ Inc. 2022-08-31 /pmc/articles/PMC9454799/ /pubmed/36091980 http://dx.doi.org/10.7717/peerj-cs.1032 Text en ©2022 Liaqat et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Natural Language and Speech
Liaqat, Muhammad Irzam
Awais Hassan, Muhammad
Shoaib, Muhammad
Khurshid, Syed Khaldoon
Shamseldin, Mohamed A.
Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study
title Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study
title_full Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study
title_fullStr Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study
title_full_unstemmed Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study
title_short Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study
title_sort sentiment analysis techniques, challenges, and opportunities: urdu language-based analytical study
topic Natural Language and Speech
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9454799/
https://www.ncbi.nlm.nih.gov/pubmed/36091980
http://dx.doi.org/10.7717/peerj-cs.1032
work_keys_str_mv AT liaqatmuhammadirzam sentimentanalysistechniqueschallengesandopportunitiesurdulanguagebasedanalyticalstudy
AT awaishassanmuhammad sentimentanalysistechniqueschallengesandopportunitiesurdulanguagebasedanalyticalstudy
AT shoaibmuhammad sentimentanalysistechniqueschallengesandopportunitiesurdulanguagebasedanalyticalstudy
AT khurshidsyedkhaldoon sentimentanalysistechniqueschallengesandopportunitiesurdulanguagebasedanalyticalstudy
AT shamseldinmohameda sentimentanalysistechniqueschallengesandopportunitiesurdulanguagebasedanalyticalstudy