Cargando…

Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble

Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plaus...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Hong, Xu, Qingsong, Zhou, Lifeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4338292/
https://www.ncbi.nlm.nih.gov/pubmed/25706988
http://dx.doi.org/10.1371/journal.pone.0117844
_version_ 1782481185430044672
author Wang, Hong
Xu, Qingsong
Zhou, Lifeng
author_facet Wang, Hong
Xu, Qingsong
Zhou, Lifeng
author_sort Wang, Hong
collection PubMed
description Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data.
format Online
Article
Text
id pubmed-4338292
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43382922015-03-04 Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble Wang, Hong Xu, Qingsong Zhou, Lifeng PLoS One Research Article Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data. Public Library of Science 2015-02-23 /pmc/articles/PMC4338292/ /pubmed/25706988 http://dx.doi.org/10.1371/journal.pone.0117844 Text en © 2015 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Wang, Hong
Xu, Qingsong
Zhou, Lifeng
Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble
title Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble
title_full Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble
title_fullStr Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble
title_full_unstemmed Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble
title_short Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble
title_sort large unbalanced credit scoring using lasso-logistic regression ensemble
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4338292/
https://www.ncbi.nlm.nih.gov/pubmed/25706988
http://dx.doi.org/10.1371/journal.pone.0117844
work_keys_str_mv AT wanghong largeunbalancedcreditscoringusinglassologisticregressionensemble
AT xuqingsong largeunbalancedcreditscoringusinglassologisticregressionensemble
AT zhoulifeng largeunbalancedcreditscoringusinglassologisticregressionensemble