Cargando…
Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending
In peer-to-peer (P2P) social lending, it is important to predict the repayment of borrowers. P2P lending data are generated in real-time, but most of them are pending to decide the repayment because the deadline is not yet expired. Adding the unexpired data with appropriate labels into the training...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10450474/ https://www.ncbi.nlm.nih.gov/pubmed/36112937 http://dx.doi.org/10.1177/00368504221124004 |
_version_ | 1785095205529059328 |
---|---|
author | Kim, Ji-Yoon Cho, Sung-Bae |
author_facet | Kim, Ji-Yoon Cho, Sung-Bae |
author_sort | Kim, Ji-Yoon |
collection | PubMed |
description | In peer-to-peer (P2P) social lending, it is important to predict the repayment of borrowers. P2P lending data are generated in real-time, but most of them are pending to decide the repayment because the deadline is not yet expired. Adding the unexpired data with appropriate labels into the training set could improve the performance of a prediction model, but the pseudo-labels cannot be certainly precise. In this paper, we propose an ensemble classifier composed of diverse convolutional neural networks (CNNs) of GoogLeNet, ResNet and DenseNet for the repayment prediction in social lending with the pseudo-labels approximated by an uncertainty handling scheme. The additional data labeled by Dempster-Shafer fusion of two semi-supervised learning methods boost up training of various models of CNNs, which are combined by weighted voting. A diversity measure is applied to constructing a pool of different models of CNNs that extract the effective features in the social lending data with labeling noise and predict the borrower's loan status. The experiment with the real dataset of 855,502 cases from Lending Club confirms that the diverse ensemble combined with weighted voting achieves the highest performance and outperforms conventional methods. |
format | Online Article Text |
id | pubmed-10450474 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-104504742023-08-26 Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending Kim, Ji-Yoon Cho, Sung-Bae Sci Prog Applying Artificial Intelligence Techniques to Encourage Economic Growth and Maintain Sustainable Societies In peer-to-peer (P2P) social lending, it is important to predict the repayment of borrowers. P2P lending data are generated in real-time, but most of them are pending to decide the repayment because the deadline is not yet expired. Adding the unexpired data with appropriate labels into the training set could improve the performance of a prediction model, but the pseudo-labels cannot be certainly precise. In this paper, we propose an ensemble classifier composed of diverse convolutional neural networks (CNNs) of GoogLeNet, ResNet and DenseNet for the repayment prediction in social lending with the pseudo-labels approximated by an uncertainty handling scheme. The additional data labeled by Dempster-Shafer fusion of two semi-supervised learning methods boost up training of various models of CNNs, which are combined by weighted voting. A diversity measure is applied to constructing a pool of different models of CNNs that extract the effective features in the social lending data with labeling noise and predict the borrower's loan status. The experiment with the real dataset of 855,502 cases from Lending Club confirms that the diverse ensemble combined with weighted voting achieves the highest performance and outperforms conventional methods. SAGE Publications 2022-09-13 /pmc/articles/PMC10450474/ /pubmed/36112937 http://dx.doi.org/10.1177/00368504221124004 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Applying Artificial Intelligence Techniques to Encourage Economic Growth and Maintain Sustainable Societies Kim, Ji-Yoon Cho, Sung-Bae Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending |
title | Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending |
title_full | Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending |
title_fullStr | Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending |
title_full_unstemmed | Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending |
title_short | Ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending |
title_sort | ensemble of diverse deep neural networks with pseudo-labels for repayment prediction in social lending |
topic | Applying Artificial Intelligence Techniques to Encourage Economic Growth and Maintain Sustainable Societies |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10450474/ https://www.ncbi.nlm.nih.gov/pubmed/36112937 http://dx.doi.org/10.1177/00368504221124004 |
work_keys_str_mv | AT kimjiyoon ensembleofdiversedeepneuralnetworkswithpseudolabelsforrepaymentpredictioninsociallending AT chosungbae ensembleofdiversedeepneuralnetworkswithpseudolabelsforrepaymentpredictioninsociallending |