Cargando…
Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study
Sentiment analysis in research involves the processing and analysis of sentiments from textual data. The sentiment analysis for high resource languages such as English and French has been carried out effectively in the past. However, its applications are comparatively few for resource-poor languages...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9454799/ https://www.ncbi.nlm.nih.gov/pubmed/36091980 http://dx.doi.org/10.7717/peerj-cs.1032 |
_version_ | 1784785436819849216 |
---|---|
author | Liaqat, Muhammad Irzam Awais Hassan, Muhammad Shoaib, Muhammad Khurshid, Syed Khaldoon Shamseldin, Mohamed A. |
author_facet | Liaqat, Muhammad Irzam Awais Hassan, Muhammad Shoaib, Muhammad Khurshid, Syed Khaldoon Shamseldin, Mohamed A. |
author_sort | Liaqat, Muhammad Irzam |
collection | PubMed |
description | Sentiment analysis in research involves the processing and analysis of sentiments from textual data. The sentiment analysis for high resource languages such as English and French has been carried out effectively in the past. However, its applications are comparatively few for resource-poor languages due to a lack of textual resources. This systematic literature explores different aspects of Urdu-based sentiment analysis, a classic case of poor resource language. While Urdu is a South Asian language understood by one hundred and sixty-nine million people across the planet. There are various shortcomings in the literature, including limitation of large corpora, language parsers, and lack of pre-trained machine learning models that result in poor performance. This article has analyzed and evaluated studies addressing machine learning-based Urdu sentiment analysis. After searching and filtering, forty articles have been inspected. Research objectives have been proposed that lead to research questions. Our searches were organized in digital repositories after selecting and screening relevant studies. Data was extracted from these studies. Our work on the existing literature reflects that sentiment classification performance can be improved by overcoming the challenges such as word sense disambiguation and massive datasets. Furthermore, Urdu-based language constructs, including language parsers and emoticons, context-level sentiment analysis techniques, pre-processing methods, and lexical resources, can also be improved. |
format | Online Article Text |
id | pubmed-9454799 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-94547992022-09-09 Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study Liaqat, Muhammad Irzam Awais Hassan, Muhammad Shoaib, Muhammad Khurshid, Syed Khaldoon Shamseldin, Mohamed A. PeerJ Comput Sci Natural Language and Speech Sentiment analysis in research involves the processing and analysis of sentiments from textual data. The sentiment analysis for high resource languages such as English and French has been carried out effectively in the past. However, its applications are comparatively few for resource-poor languages due to a lack of textual resources. This systematic literature explores different aspects of Urdu-based sentiment analysis, a classic case of poor resource language. While Urdu is a South Asian language understood by one hundred and sixty-nine million people across the planet. There are various shortcomings in the literature, including limitation of large corpora, language parsers, and lack of pre-trained machine learning models that result in poor performance. This article has analyzed and evaluated studies addressing machine learning-based Urdu sentiment analysis. After searching and filtering, forty articles have been inspected. Research objectives have been proposed that lead to research questions. Our searches were organized in digital repositories after selecting and screening relevant studies. Data was extracted from these studies. Our work on the existing literature reflects that sentiment classification performance can be improved by overcoming the challenges such as word sense disambiguation and massive datasets. Furthermore, Urdu-based language constructs, including language parsers and emoticons, context-level sentiment analysis techniques, pre-processing methods, and lexical resources, can also be improved. PeerJ Inc. 2022-08-31 /pmc/articles/PMC9454799/ /pubmed/36091980 http://dx.doi.org/10.7717/peerj-cs.1032 Text en ©2022 Liaqat et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Natural Language and Speech Liaqat, Muhammad Irzam Awais Hassan, Muhammad Shoaib, Muhammad Khurshid, Syed Khaldoon Shamseldin, Mohamed A. Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study |
title | Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study |
title_full | Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study |
title_fullStr | Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study |
title_full_unstemmed | Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study |
title_short | Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study |
title_sort | sentiment analysis techniques, challenges, and opportunities: urdu language-based analytical study |
topic | Natural Language and Speech |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9454799/ https://www.ncbi.nlm.nih.gov/pubmed/36091980 http://dx.doi.org/10.7717/peerj-cs.1032 |
work_keys_str_mv | AT liaqatmuhammadirzam sentimentanalysistechniqueschallengesandopportunitiesurdulanguagebasedanalyticalstudy AT awaishassanmuhammad sentimentanalysistechniqueschallengesandopportunitiesurdulanguagebasedanalyticalstudy AT shoaibmuhammad sentimentanalysistechniqueschallengesandopportunitiesurdulanguagebasedanalyticalstudy AT khurshidsyedkhaldoon sentimentanalysistechniqueschallengesandopportunitiesurdulanguagebasedanalyticalstudy AT shamseldinmohameda sentimentanalysistechniqueschallengesandopportunitiesurdulanguagebasedanalyticalstudy |