Cargando…
Sentiment Classification of News Text Data Using Intelligent Model
Text sentiment classification is a fundamental sub-area in natural language processing. The sentiment classification algorithm is highly domain-dependent. For example, the phrase “traffic jam” expresses negative sentiment in the sentence “I was stuck in a traffic jam on the elevated for 2 h.” But in...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8509032/ https://www.ncbi.nlm.nih.gov/pubmed/34650498 http://dx.doi.org/10.3389/fpsyg.2021.758967 |
_version_ | 1784582238983159808 |
---|---|
author | Zhang, Shitao |
author_facet | Zhang, Shitao |
author_sort | Zhang, Shitao |
collection | PubMed |
description | Text sentiment classification is a fundamental sub-area in natural language processing. The sentiment classification algorithm is highly domain-dependent. For example, the phrase “traffic jam” expresses negative sentiment in the sentence “I was stuck in a traffic jam on the elevated for 2 h.” But in the domain of transportation, the phrase “traffic jam” in the sentence “Bread and water are essential terms in traffic jams” is without any sentiment. The most common method is to use the domain-specific data samples to classify the text in this domain. However, text sentiment analysis based on machine learning relies on sufficient labeled training data. Aiming at the problem of sentiment classification of news text data with insufficient label news data and the domain adaptation of text sentiment classifiers, an intelligent model, i.e., transfer learning discriminative dictionary learning algorithm (TLDDL) is proposed for cross-domain text sentiment classification. Based on the framework of dictionary learning, the samples from the different domains are projected into a subspace, and a domain-invariant dictionary is built to connect two different domains. To improve the discriminative performance of the proposed algorithm, the discrimination information preserved term and principal component analysis (PCA) term are combined into the objective function. The experiments are performed on three public text datasets. The experimental results show that the proposed algorithm improves the sentiment classification performance of texts in the target domain. |
format | Online Article Text |
id | pubmed-8509032 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-85090322021-10-13 Sentiment Classification of News Text Data Using Intelligent Model Zhang, Shitao Front Psychol Psychology Text sentiment classification is a fundamental sub-area in natural language processing. The sentiment classification algorithm is highly domain-dependent. For example, the phrase “traffic jam” expresses negative sentiment in the sentence “I was stuck in a traffic jam on the elevated for 2 h.” But in the domain of transportation, the phrase “traffic jam” in the sentence “Bread and water are essential terms in traffic jams” is without any sentiment. The most common method is to use the domain-specific data samples to classify the text in this domain. However, text sentiment analysis based on machine learning relies on sufficient labeled training data. Aiming at the problem of sentiment classification of news text data with insufficient label news data and the domain adaptation of text sentiment classifiers, an intelligent model, i.e., transfer learning discriminative dictionary learning algorithm (TLDDL) is proposed for cross-domain text sentiment classification. Based on the framework of dictionary learning, the samples from the different domains are projected into a subspace, and a domain-invariant dictionary is built to connect two different domains. To improve the discriminative performance of the proposed algorithm, the discrimination information preserved term and principal component analysis (PCA) term are combined into the objective function. The experiments are performed on three public text datasets. The experimental results show that the proposed algorithm improves the sentiment classification performance of texts in the target domain. Frontiers Media S.A. 2021-09-28 /pmc/articles/PMC8509032/ /pubmed/34650498 http://dx.doi.org/10.3389/fpsyg.2021.758967 Text en Copyright © 2021 Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Psychology Zhang, Shitao Sentiment Classification of News Text Data Using Intelligent Model |
title | Sentiment Classification of News Text Data Using Intelligent Model |
title_full | Sentiment Classification of News Text Data Using Intelligent Model |
title_fullStr | Sentiment Classification of News Text Data Using Intelligent Model |
title_full_unstemmed | Sentiment Classification of News Text Data Using Intelligent Model |
title_short | Sentiment Classification of News Text Data Using Intelligent Model |
title_sort | sentiment classification of news text data using intelligent model |
topic | Psychology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8509032/ https://www.ncbi.nlm.nih.gov/pubmed/34650498 http://dx.doi.org/10.3389/fpsyg.2021.758967 |
work_keys_str_mv | AT zhangshitao sentimentclassificationofnewstextdatausingintelligentmodel |