Cargando…

Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text

To detect comprehensive clues and provide more accurate forecasting in the early stage of financial distress, in addition to financial indicators, digitalization of lengthy but indispensable textual disclosure, such as Management Discussion and Analysis (MD&A), has been emphasized by researchers...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruan, Sumei, Sun, Xusheng, Yao, Ruanxingchen, Li, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8683239/
https://www.ncbi.nlm.nih.gov/pubmed/34925482
http://dx.doi.org/10.1155/2021/1165296
_version_ 1784617371466465280
author Ruan, Sumei
Sun, Xusheng
Yao, Ruanxingchen
Li, Wei
author_facet Ruan, Sumei
Sun, Xusheng
Yao, Ruanxingchen
Li, Wei
author_sort Ruan, Sumei
collection PubMed
description To detect comprehensive clues and provide more accurate forecasting in the early stage of financial distress, in addition to financial indicators, digitalization of lengthy but indispensable textual disclosure, such as Management Discussion and Analysis (MD&A), has been emphasized by researchers. However, most studies divide the long text into words and count words to treat the text as word count vectors, bringing massive invalid information but ignoring meaningful contexts. Aiming to efficiently represent the text of large size, an end-to-end neural networks model based on hierarchical self-attention is proposed in this study after the state-of-the-art pretrained model is introduced for text embedding including contexts. The proposed model has two notable characteristics. First, the hierarchical self-attention only affords the essential content with high weights in word-level and sentence-level and automatically neglects lots of information that has no business with risk prediction, which is suitable for extracting effective parts of the large-scale text. Second, after fine-tuning, the word embedding adapts the specific contexts of samples and conveys the original text expression more accurately without excessive manual operations. Experiments confirm that the addition of text improves the accuracy of financial distress forecasting and the proposed model outperforms benchmark models better at AUC and F2-score. For visualization, the elements in the weight matrix of hierarchical self-attention act as scalers to estimate the importance of each word and sentence. In this way, the “red-flag” statement that implies financial risk is figured out and highlighted in the original text, providing effective references for decision-makers.
format Online
Article
Text
id pubmed-8683239
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-86832392021-12-18 Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text Ruan, Sumei Sun, Xusheng Yao, Ruanxingchen Li, Wei Comput Intell Neurosci Research Article To detect comprehensive clues and provide more accurate forecasting in the early stage of financial distress, in addition to financial indicators, digitalization of lengthy but indispensable textual disclosure, such as Management Discussion and Analysis (MD&A), has been emphasized by researchers. However, most studies divide the long text into words and count words to treat the text as word count vectors, bringing massive invalid information but ignoring meaningful contexts. Aiming to efficiently represent the text of large size, an end-to-end neural networks model based on hierarchical self-attention is proposed in this study after the state-of-the-art pretrained model is introduced for text embedding including contexts. The proposed model has two notable characteristics. First, the hierarchical self-attention only affords the essential content with high weights in word-level and sentence-level and automatically neglects lots of information that has no business with risk prediction, which is suitable for extracting effective parts of the large-scale text. Second, after fine-tuning, the word embedding adapts the specific contexts of samples and conveys the original text expression more accurately without excessive manual operations. Experiments confirm that the addition of text improves the accuracy of financial distress forecasting and the proposed model outperforms benchmark models better at AUC and F2-score. For visualization, the elements in the weight matrix of hierarchical self-attention act as scalers to estimate the importance of each word and sentence. In this way, the “red-flag” statement that implies financial risk is figured out and highlighted in the original text, providing effective references for decision-makers. Hindawi 2021-12-10 /pmc/articles/PMC8683239/ /pubmed/34925482 http://dx.doi.org/10.1155/2021/1165296 Text en Copyright © 2021 Sumei Ruan et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ruan, Sumei
Sun, Xusheng
Yao, Ruanxingchen
Li, Wei
Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text
title Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text
title_full Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text
title_fullStr Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text
title_full_unstemmed Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text
title_short Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text
title_sort deep learning based on hierarchical self-attention for finance distress prediction incorporating text
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8683239/
https://www.ncbi.nlm.nih.gov/pubmed/34925482
http://dx.doi.org/10.1155/2021/1165296
work_keys_str_mv AT ruansumei deeplearningbasedonhierarchicalselfattentionforfinancedistresspredictionincorporatingtext
AT sunxusheng deeplearningbasedonhierarchicalselfattentionforfinancedistresspredictionincorporatingtext
AT yaoruanxingchen deeplearningbasedonhierarchicalselfattentionforfinancedistresspredictionincorporatingtext
AT liwei deeplearningbasedonhierarchicalselfattentionforfinancedistresspredictionincorporatingtext