Cargando…
Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble
Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plaus...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4338292/ https://www.ncbi.nlm.nih.gov/pubmed/25706988 http://dx.doi.org/10.1371/journal.pone.0117844 |
_version_ | 1782481185430044672 |
---|---|
author | Wang, Hong Xu, Qingsong Zhou, Lifeng |
author_facet | Wang, Hong Xu, Qingsong Zhou, Lifeng |
author_sort | Wang, Hong |
collection | PubMed |
description | Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data. |
format | Online Article Text |
id | pubmed-4338292 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-43382922015-03-04 Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble Wang, Hong Xu, Qingsong Zhou, Lifeng PLoS One Research Article Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data. Public Library of Science 2015-02-23 /pmc/articles/PMC4338292/ /pubmed/25706988 http://dx.doi.org/10.1371/journal.pone.0117844 Text en © 2015 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Wang, Hong Xu, Qingsong Zhou, Lifeng Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble |
title | Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble |
title_full | Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble |
title_fullStr | Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble |
title_full_unstemmed | Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble |
title_short | Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble |
title_sort | large unbalanced credit scoring using lasso-logistic regression ensemble |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4338292/ https://www.ncbi.nlm.nih.gov/pubmed/25706988 http://dx.doi.org/10.1371/journal.pone.0117844 |
work_keys_str_mv | AT wanghong largeunbalancedcreditscoringusinglassologisticregressionensemble AT xuqingsong largeunbalancedcreditscoringusinglassologisticregressionensemble AT zhoulifeng largeunbalancedcreditscoringusinglassologisticregressionensemble |