Cargando…

Facing the Challenges of Developing Fair Risk Scoring Models

Algorithmic scoring methods are widely used in the finance industry for several decades in order to prevent risk and to automate and optimize decisions. Regulatory requirements as given by the Basel Committee on Banking Supervision (BCBS) or the EU data protection regulations have led to an increasi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Szepannek, Gero, Lübke, Karsten
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8552888/ https://www.ncbi.nlm.nih.gov/pubmed/34723172 http://dx.doi.org/10.3389/frai.2021.681915

_version_	1784591475239026688
author	Szepannek, Gero Lübke, Karsten
author_facet	Szepannek, Gero Lübke, Karsten
author_sort	Szepannek, Gero
collection	PubMed
description	Algorithmic scoring methods are widely used in the finance industry for several decades in order to prevent risk and to automate and optimize decisions. Regulatory requirements as given by the Basel Committee on Banking Supervision (BCBS) or the EU data protection regulations have led to an increasing interest and research activity on understanding black box machine learning models by means of explainable machine learning. Even though this is a step into a right direction, such methods are not able to guarantee for a fair scoring as machine learning models are not necessarily unbiased and may discriminate with respect to certain subpopulations such as a particular race, gender, or sexual orientation—even if the variable itself is not used for modeling. This is also true for white box methods like logistic regression. In this study, a framework is presented that allows analyzing and developing models with regard to fairness. The proposed methodology is based on techniques of causal inference and some of the methods can be linked to methods from explainable machine learning. A definition of counterfactual fairness is given together with an algorithm that results in a fair scoring model. The concepts are illustrated by means of a transparent simulation and a popular real-world example, the German Credit data using traditional scorecard models based on logistic regression and weight of evidence variable pre-transform. In contrast to previous studies in the field for our study, a corrected version of the data is presented and used. With the help of the simulation, the trade-off between fairness and predictive accuracy is analyzed. The results indicate that it is possible to remove unfairness without a strong performance decrease unless the correlation of the discriminative attributes on the other predictor variables in the model is not too strong. In addition, the challenge in explaining the resulting scoring model and the associated fairness implications to users is discussed.
format	Online Article Text
id	pubmed-8552888
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-85528882021-10-29 Facing the Challenges of Developing Fair Risk Scoring Models Szepannek, Gero Lübke, Karsten Front Artif Intell Artificial Intelligence Algorithmic scoring methods are widely used in the finance industry for several decades in order to prevent risk and to automate and optimize decisions. Regulatory requirements as given by the Basel Committee on Banking Supervision (BCBS) or the EU data protection regulations have led to an increasing interest and research activity on understanding black box machine learning models by means of explainable machine learning. Even though this is a step into a right direction, such methods are not able to guarantee for a fair scoring as machine learning models are not necessarily unbiased and may discriminate with respect to certain subpopulations such as a particular race, gender, or sexual orientation—even if the variable itself is not used for modeling. This is also true for white box methods like logistic regression. In this study, a framework is presented that allows analyzing and developing models with regard to fairness. The proposed methodology is based on techniques of causal inference and some of the methods can be linked to methods from explainable machine learning. A definition of counterfactual fairness is given together with an algorithm that results in a fair scoring model. The concepts are illustrated by means of a transparent simulation and a popular real-world example, the German Credit data using traditional scorecard models based on logistic regression and weight of evidence variable pre-transform. In contrast to previous studies in the field for our study, a corrected version of the data is presented and used. With the help of the simulation, the trade-off between fairness and predictive accuracy is analyzed. The results indicate that it is possible to remove unfairness without a strong performance decrease unless the correlation of the discriminative attributes on the other predictor variables in the model is not too strong. In addition, the challenge in explaining the resulting scoring model and the associated fairness implications to users is discussed. Frontiers Media S.A. 2021-10-14 /pmc/articles/PMC8552888/ /pubmed/34723172 http://dx.doi.org/10.3389/frai.2021.681915 Text en Copyright © 2021 Szepannek and Lübke. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Artificial Intelligence Szepannek, Gero Lübke, Karsten Facing the Challenges of Developing Fair Risk Scoring Models
title	Facing the Challenges of Developing Fair Risk Scoring Models
title_full	Facing the Challenges of Developing Fair Risk Scoring Models
title_fullStr	Facing the Challenges of Developing Fair Risk Scoring Models
title_full_unstemmed	Facing the Challenges of Developing Fair Risk Scoring Models
title_short	Facing the Challenges of Developing Fair Risk Scoring Models
title_sort	facing the challenges of developing fair risk scoring models
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8552888/ https://www.ncbi.nlm.nih.gov/pubmed/34723172 http://dx.doi.org/10.3389/frai.2021.681915
work_keys_str_mv	AT szepannekgero facingthechallengesofdevelopingfairriskscoringmodels AT lubkekarsten facingthechallengesofdevelopingfairriskscoringmodels

Facing the Challenges of Developing Fair Risk Scoring Models

Ejemplares similares