Cargando…

Semi-Supervised Clustering for Financial Risk Analysis

Many methods have been developed for financial risk analysis. In general, the conventional unsupervised approaches lack sufficient accuracy and semantics for the clustering, and the supervised approaches rely on large amount of training data for the classification. This paper explores the semi-super...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Yihan, Wang, Tao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8223531/
https://www.ncbi.nlm.nih.gov/pubmed/34188603
http://dx.doi.org/10.1007/s11063-021-10564-0
_version_ 1783711712960053248
author Han, Yihan
Wang, Tao
author_facet Han, Yihan
Wang, Tao
author_sort Han, Yihan
collection PubMed
description Many methods have been developed for financial risk analysis. In general, the conventional unsupervised approaches lack sufficient accuracy and semantics for the clustering, and the supervised approaches rely on large amount of training data for the classification. This paper explores the semi-supervised scheme for the financial data prediction, in which accurate predictions are expected with a small amount of labeled data. Due to lack of sufficient distinguishability in financial data, it is hard for the existing semi-supervised approaches to obtain satisfactory results. In order to improve the performance, we first convert the input labeled clues to the global prior probability, and propagate the’soft’ prior probability to learn the posterior probability instead of directly propagating the’hard’ labeled data. A label diffusion model is then constructed to adaptively fuse the information at feature space and label space, which makes the structures of data affinity and labeling more consistent. Experiments on two public real financial datasets validate the effectiveness of the proposed method.
format Online
Article
Text
id pubmed-8223531
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-82235312021-06-25 Semi-Supervised Clustering for Financial Risk Analysis Han, Yihan Wang, Tao Neural Process Lett Article Many methods have been developed for financial risk analysis. In general, the conventional unsupervised approaches lack sufficient accuracy and semantics for the clustering, and the supervised approaches rely on large amount of training data for the classification. This paper explores the semi-supervised scheme for the financial data prediction, in which accurate predictions are expected with a small amount of labeled data. Due to lack of sufficient distinguishability in financial data, it is hard for the existing semi-supervised approaches to obtain satisfactory results. In order to improve the performance, we first convert the input labeled clues to the global prior probability, and propagate the’soft’ prior probability to learn the posterior probability instead of directly propagating the’hard’ labeled data. A label diffusion model is then constructed to adaptively fuse the information at feature space and label space, which makes the structures of data affinity and labeling more consistent. Experiments on two public real financial datasets validate the effectiveness of the proposed method. Springer US 2021-06-24 2021 /pmc/articles/PMC8223531/ /pubmed/34188603 http://dx.doi.org/10.1007/s11063-021-10564-0 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Han, Yihan
Wang, Tao
Semi-Supervised Clustering for Financial Risk Analysis
title Semi-Supervised Clustering for Financial Risk Analysis
title_full Semi-Supervised Clustering for Financial Risk Analysis
title_fullStr Semi-Supervised Clustering for Financial Risk Analysis
title_full_unstemmed Semi-Supervised Clustering for Financial Risk Analysis
title_short Semi-Supervised Clustering for Financial Risk Analysis
title_sort semi-supervised clustering for financial risk analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8223531/
https://www.ncbi.nlm.nih.gov/pubmed/34188603
http://dx.doi.org/10.1007/s11063-021-10564-0
work_keys_str_mv AT hanyihan semisupervisedclusteringforfinancialriskanalysis
AT wangtao semisupervisedclusteringforfinancialriskanalysis