Cargando…

Evaluation of transformer models for financial targeted sentiment analysis in Spanish

Nowadays, financial data from social media plays an important role to predict the stock market. However, the exponential growth of financial information and the different polarities of sentiment that other sectors or stakeholders may have on the same information has led to the need for new technolog...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Ronghao, García-Díaz, José Antonio, Garcia-Sanchez, Francisco, Valencia-García, Rafael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280559/
https://www.ncbi.nlm.nih.gov/pubmed/37346571
http://dx.doi.org/10.7717/peerj-cs.1377
_version_ 1785060822265888768
author Pan, Ronghao
García-Díaz, José Antonio
Garcia-Sanchez, Francisco
Valencia-García, Rafael
author_facet Pan, Ronghao
García-Díaz, José Antonio
Garcia-Sanchez, Francisco
Valencia-García, Rafael
author_sort Pan, Ronghao
collection PubMed
description Nowadays, financial data from social media plays an important role to predict the stock market. However, the exponential growth of financial information and the different polarities of sentiment that other sectors or stakeholders may have on the same information has led to the need for new technologies that automatically collect and classify large volumes of information quickly and easily for each stakeholder. In this scenario, we conduct a targeted sentiment analysis that can automatically extract the main economic target from financial texts and obtain the polarity of a text towards such main economic target, other companies and society in general. To this end, we have compiled a novel corpus of financial tweets and news headlines in Spanish, constituting a valuable resource for the Spanish-focused research community. In addition, we have carried out a performance comparison of different Spanish-specific large language models, with MarIA and BETO achieving the best results. Our best result has an overall performance of 76.04%, 74.16%, and 68.07% in macro F1-score for the sentiment classification towards the main economic target, society, and other companies, respectively, and an accuracy of 69.74% for target detection. We have also evaluated the performance of multi-label classification models in this context and obtained a performance of 71.13%.
format Online
Article
Text
id pubmed-10280559
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-102805592023-06-21 Evaluation of transformer models for financial targeted sentiment analysis in Spanish Pan, Ronghao García-Díaz, José Antonio Garcia-Sanchez, Francisco Valencia-García, Rafael PeerJ Comput Sci Artificial Intelligence Nowadays, financial data from social media plays an important role to predict the stock market. However, the exponential growth of financial information and the different polarities of sentiment that other sectors or stakeholders may have on the same information has led to the need for new technologies that automatically collect and classify large volumes of information quickly and easily for each stakeholder. In this scenario, we conduct a targeted sentiment analysis that can automatically extract the main economic target from financial texts and obtain the polarity of a text towards such main economic target, other companies and society in general. To this end, we have compiled a novel corpus of financial tweets and news headlines in Spanish, constituting a valuable resource for the Spanish-focused research community. In addition, we have carried out a performance comparison of different Spanish-specific large language models, with MarIA and BETO achieving the best results. Our best result has an overall performance of 76.04%, 74.16%, and 68.07% in macro F1-score for the sentiment classification towards the main economic target, society, and other companies, respectively, and an accuracy of 69.74% for target detection. We have also evaluated the performance of multi-label classification models in this context and obtained a performance of 71.13%. PeerJ Inc. 2023-05-09 /pmc/articles/PMC10280559/ /pubmed/37346571 http://dx.doi.org/10.7717/peerj-cs.1377 Text en © 2023 Pan et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Pan, Ronghao
García-Díaz, José Antonio
Garcia-Sanchez, Francisco
Valencia-García, Rafael
Evaluation of transformer models for financial targeted sentiment analysis in Spanish
title Evaluation of transformer models for financial targeted sentiment analysis in Spanish
title_full Evaluation of transformer models for financial targeted sentiment analysis in Spanish
title_fullStr Evaluation of transformer models for financial targeted sentiment analysis in Spanish
title_full_unstemmed Evaluation of transformer models for financial targeted sentiment analysis in Spanish
title_short Evaluation of transformer models for financial targeted sentiment analysis in Spanish
title_sort evaluation of transformer models for financial targeted sentiment analysis in spanish
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280559/
https://www.ncbi.nlm.nih.gov/pubmed/37346571
http://dx.doi.org/10.7717/peerj-cs.1377
work_keys_str_mv AT panronghao evaluationoftransformermodelsforfinancialtargetedsentimentanalysisinspanish
AT garciadiazjoseantonio evaluationoftransformermodelsforfinancialtargetedsentimentanalysisinspanish
AT garciasanchezfrancisco evaluationoftransformermodelsforfinancialtargetedsentimentanalysisinspanish
AT valenciagarciarafael evaluationoftransformermodelsforfinancialtargetedsentimentanalysisinspanish