Cargando…

Character gated recurrent neural networks for Arabic sentiment analysis

Sentiment analysis is a Natural Language Processing (NLP) task concerned with opinions, attitudes, emotions, and feelings. It applies NLP techniques for identifying and detecting personal information from opinionated text. Sentiment analysis deduces the author's perspective regarding a topic an...

Descripción completa

Detalles Bibliográficos
Autores principales: Omara, Eslam, Mousa, Mervat, Ismail, Nabil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9192763/
https://www.ncbi.nlm.nih.gov/pubmed/35697814
http://dx.doi.org/10.1038/s41598-022-13153-w
_version_ 1784726314549248000
author Omara, Eslam
Mousa, Mervat
Ismail, Nabil
author_facet Omara, Eslam
Mousa, Mervat
Ismail, Nabil
author_sort Omara, Eslam
collection PubMed
description Sentiment analysis is a Natural Language Processing (NLP) task concerned with opinions, attitudes, emotions, and feelings. It applies NLP techniques for identifying and detecting personal information from opinionated text. Sentiment analysis deduces the author's perspective regarding a topic and classifies the attitude polarity as positive, negative, or neutral. In the meantime, deep architectures applied to NLP reported a noticeable breakthrough in performance compared to traditional approaches. The outstanding performance of deep architectures is related to their capability to disclose, differentiate and discriminate features captured from large datasets. Recurrent neural networks (RNNs) and their variants Long-Short Term Memory (LSTM), Gated Recurrent Unit (GRU), Bi-directional Long-Short Term Memory (Bi-LSTM), and Bi-directional Gated Recurrent Unit (Bi-GRU) architectures are robust at processing sequential data. They are commonly used for NLP applications as they—unlike RNNs—can combat vanishing and exploding gradients. Also, Convolution Neural Networks (CNNs) were efficiently applied for implicitly detecting features in NLP tasks. In the proposed work, different deep learning architectures composed of LSTM, GRU, Bi-LSTM, and Bi-GRU are used and compared for Arabic sentiment analysis performance improvement. The models are implemented and tested based on the character representation of opinion entries. Moreover, deep hybrid models that combine multiple layers of CNN with LSTM, GRU, Bi-LSTM, and Bi-GRU are also tested. Two datasets are used for the models implementation; the first is a hybrid combined dataset, and the second is the Book Review Arabic Dataset (BRAD). The proposed application proves that character representation can capture morphological and semantic features, and hence it can be employed for text representation in different Arabic language understanding and processing tasks.
format Online
Article
Text
id pubmed-9192763
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-91927632022-06-15 Character gated recurrent neural networks for Arabic sentiment analysis Omara, Eslam Mousa, Mervat Ismail, Nabil Sci Rep Article Sentiment analysis is a Natural Language Processing (NLP) task concerned with opinions, attitudes, emotions, and feelings. It applies NLP techniques for identifying and detecting personal information from opinionated text. Sentiment analysis deduces the author's perspective regarding a topic and classifies the attitude polarity as positive, negative, or neutral. In the meantime, deep architectures applied to NLP reported a noticeable breakthrough in performance compared to traditional approaches. The outstanding performance of deep architectures is related to their capability to disclose, differentiate and discriminate features captured from large datasets. Recurrent neural networks (RNNs) and their variants Long-Short Term Memory (LSTM), Gated Recurrent Unit (GRU), Bi-directional Long-Short Term Memory (Bi-LSTM), and Bi-directional Gated Recurrent Unit (Bi-GRU) architectures are robust at processing sequential data. They are commonly used for NLP applications as they—unlike RNNs—can combat vanishing and exploding gradients. Also, Convolution Neural Networks (CNNs) were efficiently applied for implicitly detecting features in NLP tasks. In the proposed work, different deep learning architectures composed of LSTM, GRU, Bi-LSTM, and Bi-GRU are used and compared for Arabic sentiment analysis performance improvement. The models are implemented and tested based on the character representation of opinion entries. Moreover, deep hybrid models that combine multiple layers of CNN with LSTM, GRU, Bi-LSTM, and Bi-GRU are also tested. Two datasets are used for the models implementation; the first is a hybrid combined dataset, and the second is the Book Review Arabic Dataset (BRAD). The proposed application proves that character representation can capture morphological and semantic features, and hence it can be employed for text representation in different Arabic language understanding and processing tasks. Nature Publishing Group UK 2022-06-13 /pmc/articles/PMC9192763/ /pubmed/35697814 http://dx.doi.org/10.1038/s41598-022-13153-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Omara, Eslam
Mousa, Mervat
Ismail, Nabil
Character gated recurrent neural networks for Arabic sentiment analysis
title Character gated recurrent neural networks for Arabic sentiment analysis
title_full Character gated recurrent neural networks for Arabic sentiment analysis
title_fullStr Character gated recurrent neural networks for Arabic sentiment analysis
title_full_unstemmed Character gated recurrent neural networks for Arabic sentiment analysis
title_short Character gated recurrent neural networks for Arabic sentiment analysis
title_sort character gated recurrent neural networks for arabic sentiment analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9192763/
https://www.ncbi.nlm.nih.gov/pubmed/35697814
http://dx.doi.org/10.1038/s41598-022-13153-w
work_keys_str_mv AT omaraeslam charactergatedrecurrentneuralnetworksforarabicsentimentanalysis
AT mousamervat charactergatedrecurrentneuralnetworksforarabicsentimentanalysis
AT ismailnabil charactergatedrecurrentneuralnetworksforarabicsentimentanalysis