Cargando…

Cross Lingual Sentiment Analysis: A Clustering-Based Bee Colony Instance Selection and Target-Based Feature Weighting Approach

The lack of sentiment resources in poor resource languages poses challenges for the sentiment analysis in which machine learning is involved. Cross-lingual and semi-supervised learning approaches have been deployed to represent the most common ways that can overcome this issue. However, performance...

Descripción completa

Detalles Bibliográficos
Autores principales:	Mohammed Almansor, Mohammed Abbas, Zhang, Chongfu, Khan, Wasiq, Hussain, Abir, Alhusaini, Naji
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7570551/ https://www.ncbi.nlm.nih.gov/pubmed/32942721 http://dx.doi.org/10.3390/s20185276

_version_	1783596972888817664
author	Mohammed Almansor, Mohammed Abbas Zhang, Chongfu Khan, Wasiq Hussain, Abir Alhusaini, Naji
author_facet	Mohammed Almansor, Mohammed Abbas Zhang, Chongfu Khan, Wasiq Hussain, Abir Alhusaini, Naji
author_sort	Mohammed Almansor, Mohammed Abbas
collection	PubMed
description	The lack of sentiment resources in poor resource languages poses challenges for the sentiment analysis in which machine learning is involved. Cross-lingual and semi-supervised learning approaches have been deployed to represent the most common ways that can overcome this issue. However, performance of the existing methods degrades due to the poor quality of translated resources, data sparseness and more specifically, language divergence. An integrated learning model that uses a semi-supervised and an ensembled model while utilizing the available sentiment resources to tackle language divergence related issues is proposed. Additionally, to reduce the impact of translation errors and handle instance selection problem, we propose a clustering-based bee-colony-sample selection method for the optimal selection of most distinguishing features representing the target data. To evaluate the proposed model, various experiments are conducted employing an English-Arabic cross-lingual data set. Simulations results demonstrate that the proposed model outperforms the baseline approaches in terms of classification performances. Furthermore, the statistical outcomes indicate the advantages of the proposed training data sampling and target-based feature selection to reduce the negative effect of translation errors. These results highlight the fact that the proposed approach achieves a performance that is close to in-language supervised models.
format	Online Article Text
id	pubmed-7570551
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-75705512020-10-28 Cross Lingual Sentiment Analysis: A Clustering-Based Bee Colony Instance Selection and Target-Based Feature Weighting Approach Mohammed Almansor, Mohammed Abbas Zhang, Chongfu Khan, Wasiq Hussain, Abir Alhusaini, Naji Sensors (Basel) Article The lack of sentiment resources in poor resource languages poses challenges for the sentiment analysis in which machine learning is involved. Cross-lingual and semi-supervised learning approaches have been deployed to represent the most common ways that can overcome this issue. However, performance of the existing methods degrades due to the poor quality of translated resources, data sparseness and more specifically, language divergence. An integrated learning model that uses a semi-supervised and an ensembled model while utilizing the available sentiment resources to tackle language divergence related issues is proposed. Additionally, to reduce the impact of translation errors and handle instance selection problem, we propose a clustering-based bee-colony-sample selection method for the optimal selection of most distinguishing features representing the target data. To evaluate the proposed model, various experiments are conducted employing an English-Arabic cross-lingual data set. Simulations results demonstrate that the proposed model outperforms the baseline approaches in terms of classification performances. Furthermore, the statistical outcomes indicate the advantages of the proposed training data sampling and target-based feature selection to reduce the negative effect of translation errors. These results highlight the fact that the proposed approach achieves a performance that is close to in-language supervised models. MDPI 2020-09-15 /pmc/articles/PMC7570551/ /pubmed/32942721 http://dx.doi.org/10.3390/s20185276 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Mohammed Almansor, Mohammed Abbas Zhang, Chongfu Khan, Wasiq Hussain, Abir Alhusaini, Naji Cross Lingual Sentiment Analysis: A Clustering-Based Bee Colony Instance Selection and Target-Based Feature Weighting Approach
title	Cross Lingual Sentiment Analysis: A Clustering-Based Bee Colony Instance Selection and Target-Based Feature Weighting Approach
title_full	Cross Lingual Sentiment Analysis: A Clustering-Based Bee Colony Instance Selection and Target-Based Feature Weighting Approach
title_fullStr	Cross Lingual Sentiment Analysis: A Clustering-Based Bee Colony Instance Selection and Target-Based Feature Weighting Approach
title_full_unstemmed	Cross Lingual Sentiment Analysis: A Clustering-Based Bee Colony Instance Selection and Target-Based Feature Weighting Approach
title_short	Cross Lingual Sentiment Analysis: A Clustering-Based Bee Colony Instance Selection and Target-Based Feature Weighting Approach
title_sort	cross lingual sentiment analysis: a clustering-based bee colony instance selection and target-based feature weighting approach
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7570551/ https://www.ncbi.nlm.nih.gov/pubmed/32942721 http://dx.doi.org/10.3390/s20185276
work_keys_str_mv	AT mohammedalmansormohammedabbas crosslingualsentimentanalysisaclusteringbasedbeecolonyinstanceselectionandtargetbasedfeatureweightingapproach AT zhangchongfu crosslingualsentimentanalysisaclusteringbasedbeecolonyinstanceselectionandtargetbasedfeatureweightingapproach AT khanwasiq crosslingualsentimentanalysisaclusteringbasedbeecolonyinstanceselectionandtargetbasedfeatureweightingapproach AT hussainabir crosslingualsentimentanalysisaclusteringbasedbeecolonyinstanceselectionandtargetbasedfeatureweightingapproach AT alhusaininaji crosslingualsentimentanalysisaclusteringbasedbeecolonyinstanceselectionandtargetbasedfeatureweightingapproach

Cross Lingual Sentiment Analysis: A Clustering-Based Bee Colony Instance Selection and Target-Based Feature Weighting Approach

Ejemplares similares