Cargando…
Semi-Supervised Clustering for Financial Risk Analysis
Many methods have been developed for financial risk analysis. In general, the conventional unsupervised approaches lack sufficient accuracy and semantics for the clustering, and the supervised approaches rely on large amount of training data for the classification. This paper explores the semi-super...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8223531/ https://www.ncbi.nlm.nih.gov/pubmed/34188603 http://dx.doi.org/10.1007/s11063-021-10564-0 |
_version_ | 1783711712960053248 |
---|---|
author | Han, Yihan Wang, Tao |
author_facet | Han, Yihan Wang, Tao |
author_sort | Han, Yihan |
collection | PubMed |
description | Many methods have been developed for financial risk analysis. In general, the conventional unsupervised approaches lack sufficient accuracy and semantics for the clustering, and the supervised approaches rely on large amount of training data for the classification. This paper explores the semi-supervised scheme for the financial data prediction, in which accurate predictions are expected with a small amount of labeled data. Due to lack of sufficient distinguishability in financial data, it is hard for the existing semi-supervised approaches to obtain satisfactory results. In order to improve the performance, we first convert the input labeled clues to the global prior probability, and propagate the’soft’ prior probability to learn the posterior probability instead of directly propagating the’hard’ labeled data. A label diffusion model is then constructed to adaptively fuse the information at feature space and label space, which makes the structures of data affinity and labeling more consistent. Experiments on two public real financial datasets validate the effectiveness of the proposed method. |
format | Online Article Text |
id | pubmed-8223531 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-82235312021-06-25 Semi-Supervised Clustering for Financial Risk Analysis Han, Yihan Wang, Tao Neural Process Lett Article Many methods have been developed for financial risk analysis. In general, the conventional unsupervised approaches lack sufficient accuracy and semantics for the clustering, and the supervised approaches rely on large amount of training data for the classification. This paper explores the semi-supervised scheme for the financial data prediction, in which accurate predictions are expected with a small amount of labeled data. Due to lack of sufficient distinguishability in financial data, it is hard for the existing semi-supervised approaches to obtain satisfactory results. In order to improve the performance, we first convert the input labeled clues to the global prior probability, and propagate the’soft’ prior probability to learn the posterior probability instead of directly propagating the’hard’ labeled data. A label diffusion model is then constructed to adaptively fuse the information at feature space and label space, which makes the structures of data affinity and labeling more consistent. Experiments on two public real financial datasets validate the effectiveness of the proposed method. Springer US 2021-06-24 2021 /pmc/articles/PMC8223531/ /pubmed/34188603 http://dx.doi.org/10.1007/s11063-021-10564-0 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Han, Yihan Wang, Tao Semi-Supervised Clustering for Financial Risk Analysis |
title | Semi-Supervised Clustering for Financial Risk Analysis |
title_full | Semi-Supervised Clustering for Financial Risk Analysis |
title_fullStr | Semi-Supervised Clustering for Financial Risk Analysis |
title_full_unstemmed | Semi-Supervised Clustering for Financial Risk Analysis |
title_short | Semi-Supervised Clustering for Financial Risk Analysis |
title_sort | semi-supervised clustering for financial risk analysis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8223531/ https://www.ncbi.nlm.nih.gov/pubmed/34188603 http://dx.doi.org/10.1007/s11063-021-10564-0 |
work_keys_str_mv | AT hanyihan semisupervisedclusteringforfinancialriskanalysis AT wangtao semisupervisedclusteringforfinancialriskanalysis |