Cargando…

Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending

In peer-to-peer (P2P) social lending, it is important to predict the repayment of borrowers. P2P lending data are generated in real-time, but most of them are pending to decide the repayment because the deadline is not yet expired. Adding the unexpired data with appropriate labels into the training...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Ji-Yoon, Cho, Sung-Bae
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10450474/
https://www.ncbi.nlm.nih.gov/pubmed/36112937
http://dx.doi.org/10.1177/00368504221124004
_version_ 1785095205529059328
author Kim, Ji-Yoon
Cho, Sung-Bae
author_facet Kim, Ji-Yoon
Cho, Sung-Bae
author_sort Kim, Ji-Yoon
collection PubMed
description In peer-to-peer (P2P) social lending, it is important to predict the repayment of borrowers. P2P lending data are generated in real-time, but most of them are pending to decide the repayment because the deadline is not yet expired. Adding the unexpired data with appropriate labels into the training set could improve the performance of a prediction model, but the pseudo-labels cannot be certainly precise. In this paper, we propose an ensemble classifier composed of diverse convolutional neural networks (CNNs) of GoogLeNet, ResNet and DenseNet for the repayment prediction in social lending with the pseudo-labels approximated by an uncertainty handling scheme. The additional data labeled by Dempster-Shafer fusion of two semi-supervised learning methods boost up training of various models of CNNs, which are combined by weighted voting. A diversity measure is applied to constructing a pool of different models of CNNs that extract the effective features in the social lending data with labeling noise and predict the borrower's loan status. The experiment with the real dataset of 855,502 cases from Lending Club confirms that the diverse ensemble combined with weighted voting achieves the highest performance and outperforms conventional methods.
format Online
Article
Text
id pubmed-10450474
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-104504742023-08-26 Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending Kim, Ji-Yoon Cho, Sung-Bae Sci Prog Applying Artificial Intelligence Techniques to Encourage Economic Growth and Maintain Sustainable Societies In peer-to-peer (P2P) social lending, it is important to predict the repayment of borrowers. P2P lending data are generated in real-time, but most of them are pending to decide the repayment because the deadline is not yet expired. Adding the unexpired data with appropriate labels into the training set could improve the performance of a prediction model, but the pseudo-labels cannot be certainly precise. In this paper, we propose an ensemble classifier composed of diverse convolutional neural networks (CNNs) of GoogLeNet, ResNet and DenseNet for the repayment prediction in social lending with the pseudo-labels approximated by an uncertainty handling scheme. The additional data labeled by Dempster-Shafer fusion of two semi-supervised learning methods boost up training of various models of CNNs, which are combined by weighted voting. A diversity measure is applied to constructing a pool of different models of CNNs that extract the effective features in the social lending data with labeling noise and predict the borrower's loan status. The experiment with the real dataset of 855,502 cases from Lending Club confirms that the diverse ensemble combined with weighted voting achieves the highest performance and outperforms conventional methods. SAGE Publications 2022-09-13 /pmc/articles/PMC10450474/ /pubmed/36112937 http://dx.doi.org/10.1177/00368504221124004 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Applying Artificial Intelligence Techniques to Encourage Economic Growth and Maintain Sustainable Societies
Kim, Ji-Yoon
Cho, Sung-Bae
Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending
title Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending
title_full Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending
title_fullStr Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending
title_full_unstemmed Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending
title_short Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending
title_sort ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending
topic Applying Artificial Intelligence Techniques to Encourage Economic Growth and Maintain Sustainable Societies
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10450474/
https://www.ncbi.nlm.nih.gov/pubmed/36112937
http://dx.doi.org/10.1177/00368504221124004
work_keys_str_mv AT kimjiyoon ensembleofdiversedeepneuralnetworkswithpseudolabelsforrepaymentpredictioninsociallending
AT chosungbae ensembleofdiversedeepneuralnetworkswithpseudolabelsforrepaymentpredictioninsociallending