Cargando…
Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text
To detect comprehensive clues and provide more accurate forecasting in the early stage of financial distress, in addition to financial indicators, digitalization of lengthy but indispensable textual disclosure, such as Management Discussion and Analysis (MD&A), has been emphasized by researchers...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8683239/ https://www.ncbi.nlm.nih.gov/pubmed/34925482 http://dx.doi.org/10.1155/2021/1165296 |
_version_ | 1784617371466465280 |
---|---|
author | Ruan, Sumei Sun, Xusheng Yao, Ruanxingchen Li, Wei |
author_facet | Ruan, Sumei Sun, Xusheng Yao, Ruanxingchen Li, Wei |
author_sort | Ruan, Sumei |
collection | PubMed |
description | To detect comprehensive clues and provide more accurate forecasting in the early stage of financial distress, in addition to financial indicators, digitalization of lengthy but indispensable textual disclosure, such as Management Discussion and Analysis (MD&A), has been emphasized by researchers. However, most studies divide the long text into words and count words to treat the text as word count vectors, bringing massive invalid information but ignoring meaningful contexts. Aiming to efficiently represent the text of large size, an end-to-end neural networks model based on hierarchical self-attention is proposed in this study after the state-of-the-art pretrained model is introduced for text embedding including contexts. The proposed model has two notable characteristics. First, the hierarchical self-attention only affords the essential content with high weights in word-level and sentence-level and automatically neglects lots of information that has no business with risk prediction, which is suitable for extracting effective parts of the large-scale text. Second, after fine-tuning, the word embedding adapts the specific contexts of samples and conveys the original text expression more accurately without excessive manual operations. Experiments confirm that the addition of text improves the accuracy of financial distress forecasting and the proposed model outperforms benchmark models better at AUC and F2-score. For visualization, the elements in the weight matrix of hierarchical self-attention act as scalers to estimate the importance of each word and sentence. In this way, the “red-flag” statement that implies financial risk is figured out and highlighted in the original text, providing effective references for decision-makers. |
format | Online Article Text |
id | pubmed-8683239 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-86832392021-12-18 Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text Ruan, Sumei Sun, Xusheng Yao, Ruanxingchen Li, Wei Comput Intell Neurosci Research Article To detect comprehensive clues and provide more accurate forecasting in the early stage of financial distress, in addition to financial indicators, digitalization of lengthy but indispensable textual disclosure, such as Management Discussion and Analysis (MD&A), has been emphasized by researchers. However, most studies divide the long text into words and count words to treat the text as word count vectors, bringing massive invalid information but ignoring meaningful contexts. Aiming to efficiently represent the text of large size, an end-to-end neural networks model based on hierarchical self-attention is proposed in this study after the state-of-the-art pretrained model is introduced for text embedding including contexts. The proposed model has two notable characteristics. First, the hierarchical self-attention only affords the essential content with high weights in word-level and sentence-level and automatically neglects lots of information that has no business with risk prediction, which is suitable for extracting effective parts of the large-scale text. Second, after fine-tuning, the word embedding adapts the specific contexts of samples and conveys the original text expression more accurately without excessive manual operations. Experiments confirm that the addition of text improves the accuracy of financial distress forecasting and the proposed model outperforms benchmark models better at AUC and F2-score. For visualization, the elements in the weight matrix of hierarchical self-attention act as scalers to estimate the importance of each word and sentence. In this way, the “red-flag” statement that implies financial risk is figured out and highlighted in the original text, providing effective references for decision-makers. Hindawi 2021-12-10 /pmc/articles/PMC8683239/ /pubmed/34925482 http://dx.doi.org/10.1155/2021/1165296 Text en Copyright © 2021 Sumei Ruan et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Ruan, Sumei Sun, Xusheng Yao, Ruanxingchen Li, Wei Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text |
title | Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text |
title_full | Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text |
title_fullStr | Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text |
title_full_unstemmed | Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text |
title_short | Deep Learning Based on Hierarchical Self-Attention for Finance Distress Prediction Incorporating Text |
title_sort | deep learning based on hierarchical self-attention for finance distress prediction incorporating text |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8683239/ https://www.ncbi.nlm.nih.gov/pubmed/34925482 http://dx.doi.org/10.1155/2021/1165296 |
work_keys_str_mv | AT ruansumei deeplearningbasedonhierarchicalselfattentionforfinancedistresspredictionincorporatingtext AT sunxusheng deeplearningbasedonhierarchicalselfattentionforfinancedistresspredictionincorporatingtext AT yaoruanxingchen deeplearningbasedonhierarchicalselfattentionforfinancedistresspredictionincorporatingtext AT liwei deeplearningbasedonhierarchicalselfattentionforfinancedistresspredictionincorporatingtext |