Cargando…
A natural language processing based technique for sentiment analysis of college english corpus
The college English corpus can help us better master English, but how to obtain the desired information from a large number of English corpus has become the focus of information technology. Based on the natural language processing (NLP) technology, a sentiment analysis model is built in this article...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280647/ https://www.ncbi.nlm.nih.gov/pubmed/37346685 http://dx.doi.org/10.7717/peerj-cs.1235 |
_version_ | 1785060843475435520 |
---|---|
author | Xu, Jingjing |
author_facet | Xu, Jingjing |
author_sort | Xu, Jingjing |
collection | PubMed |
description | The college English corpus can help us better master English, but how to obtain the desired information from a large number of English corpus has become the focus of information technology. Based on the natural language processing (NLP) technology, a sentiment analysis model is built in this article. An improved term frequency–inverse document frequency (TF-IDF) algorithm is proposed in this article, where the weighted average method is used to determine the emotional value of each emotional word. The inspirational words are used to obtain the English corpus’s emotional tendency and emotional value. The results show that the model has high classification accuracy and operation efficiency when selecting feature words. Compared with the TF-IDF, the improved TF-IDF algorithm added the necessary information weight processing and word density weight processing to two new processing links, which can significantly improve the efficiency of college English learning. |
format | Online Article Text |
id | pubmed-10280647 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-102806472023-06-21 A natural language processing based technique for sentiment analysis of college english corpus Xu, Jingjing PeerJ Comput Sci Computational Linguistics The college English corpus can help us better master English, but how to obtain the desired information from a large number of English corpus has become the focus of information technology. Based on the natural language processing (NLP) technology, a sentiment analysis model is built in this article. An improved term frequency–inverse document frequency (TF-IDF) algorithm is proposed in this article, where the weighted average method is used to determine the emotional value of each emotional word. The inspirational words are used to obtain the English corpus’s emotional tendency and emotional value. The results show that the model has high classification accuracy and operation efficiency when selecting feature words. Compared with the TF-IDF, the improved TF-IDF algorithm added the necessary information weight processing and word density weight processing to two new processing links, which can significantly improve the efficiency of college English learning. PeerJ Inc. 2023-02-17 /pmc/articles/PMC10280647/ /pubmed/37346685 http://dx.doi.org/10.7717/peerj-cs.1235 Text en ©2023 Xu https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Computational Linguistics Xu, Jingjing A natural language processing based technique for sentiment analysis of college english corpus |
title | A natural language processing based technique for sentiment analysis of college english corpus |
title_full | A natural language processing based technique for sentiment analysis of college english corpus |
title_fullStr | A natural language processing based technique for sentiment analysis of college english corpus |
title_full_unstemmed | A natural language processing based technique for sentiment analysis of college english corpus |
title_short | A natural language processing based technique for sentiment analysis of college english corpus |
title_sort | natural language processing based technique for sentiment analysis of college english corpus |
topic | Computational Linguistics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280647/ https://www.ncbi.nlm.nih.gov/pubmed/37346685 http://dx.doi.org/10.7717/peerj-cs.1235 |
work_keys_str_mv | AT xujingjing anaturallanguageprocessingbasedtechniqueforsentimentanalysisofcollegeenglishcorpus AT xujingjing naturallanguageprocessingbasedtechniqueforsentimentanalysisofcollegeenglishcorpus |