Cargando…

Loan default prediction of Chinese P2P market: a machine learning methodology

Repayment failures of borrowers have greatly affected the sustainable development of the peer-to-peer (P2P) lending industry. The latest literature reveals that existing risk evaluation systems may ignore important signals and risk factors affecting P2P repayment. In our study, we applied four machi...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Junhui, Lu, Zekai, Xie, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8455619/
https://www.ncbi.nlm.nih.gov/pubmed/34548599
http://dx.doi.org/10.1038/s41598-021-98361-6
_version_ 1784570708020428800
author Xu, Junhui
Lu, Zekai
Xie, Ying
author_facet Xu, Junhui
Lu, Zekai
Xie, Ying
author_sort Xu, Junhui
collection PubMed
description Repayment failures of borrowers have greatly affected the sustainable development of the peer-to-peer (P2P) lending industry. The latest literature reveals that existing risk evaluation systems may ignore important signals and risk factors affecting P2P repayment. In our study, we applied four machine learning methods (random forest (RF), extreme gradient boosting tree (XGBT), gradient boosting model (GBM), and neural network (NN)) to predict important factors affecting repayment by utilizing data from Renrendai.com in China from Thursday, January 1, 2015, to Tuesday, June 30, 2015. The results showed that borrowers who have passed video, mobile phone, job, residence or education level verification are more likely to default on loan repayment, whereas those who have passed identity and asset certification are less likely to default on loans. The accuracy and kappa value of the four methods all exceed 90%, and RF is superior to the other classification models. Our findings demonstrate important techniques for borrower screening by P2P companies and risk regulation by regulatory agencies. Our methodology and findings will help regulators, banks and creditors combat current financial disasters caused by the coronavirus disease 2019 (COVID-19) pandemic by addressing various financial risks and translating credit scoring improvements.
format Online
Article
Text
id pubmed-8455619
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-84556192021-09-22 Loan default prediction of Chinese P2P market: a machine learning methodology Xu, Junhui Lu, Zekai Xie, Ying Sci Rep Article Repayment failures of borrowers have greatly affected the sustainable development of the peer-to-peer (P2P) lending industry. The latest literature reveals that existing risk evaluation systems may ignore important signals and risk factors affecting P2P repayment. In our study, we applied four machine learning methods (random forest (RF), extreme gradient boosting tree (XGBT), gradient boosting model (GBM), and neural network (NN)) to predict important factors affecting repayment by utilizing data from Renrendai.com in China from Thursday, January 1, 2015, to Tuesday, June 30, 2015. The results showed that borrowers who have passed video, mobile phone, job, residence or education level verification are more likely to default on loan repayment, whereas those who have passed identity and asset certification are less likely to default on loans. The accuracy and kappa value of the four methods all exceed 90%, and RF is superior to the other classification models. Our findings demonstrate important techniques for borrower screening by P2P companies and risk regulation by regulatory agencies. Our methodology and findings will help regulators, banks and creditors combat current financial disasters caused by the coronavirus disease 2019 (COVID-19) pandemic by addressing various financial risks and translating credit scoring improvements. Nature Publishing Group UK 2021-09-21 /pmc/articles/PMC8455619/ /pubmed/34548599 http://dx.doi.org/10.1038/s41598-021-98361-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Xu, Junhui
Lu, Zekai
Xie, Ying
Loan default prediction of Chinese P2P market: a machine learning methodology
title Loan default prediction of Chinese P2P market: a machine learning methodology
title_full Loan default prediction of Chinese P2P market: a machine learning methodology
title_fullStr Loan default prediction of Chinese P2P market: a machine learning methodology
title_full_unstemmed Loan default prediction of Chinese P2P market: a machine learning methodology
title_short Loan default prediction of Chinese P2P market: a machine learning methodology
title_sort loan default prediction of chinese p2p market: a machine learning methodology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8455619/
https://www.ncbi.nlm.nih.gov/pubmed/34548599
http://dx.doi.org/10.1038/s41598-021-98361-6
work_keys_str_mv AT xujunhui loandefaultpredictionofchinesep2pmarketamachinelearningmethodology
AT luzekai loandefaultpredictionofchinesep2pmarketamachinelearningmethodology
AT xieying loandefaultpredictionofchinesep2pmarketamachinelearningmethodology