Cargando…

A Benford's law based method for fraud detection using R Library

Benford Law (BL) states that the occurrence of significant digits in many natural and human phenomena data sets are not uniformly scattered, as one could naively expect, but follow a logarithmic-type distribution. Here, we present a method that consists of the use of BL analysis over first and first...

Descripción completa

Detalles Bibliográficos
Autores principales: Azevedo, Caio da Silva, Gonçalves, Rodrigo Franco, Gava, Vagner Luiz, Spinola, Mauro de Mesquita
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8720889/
https://www.ncbi.nlm.nih.gov/pubmed/35004209
http://dx.doi.org/10.1016/j.mex.2021.101575
_version_ 1784625222129811456
author Azevedo, Caio da Silva
Gonçalves, Rodrigo Franco
Gava, Vagner Luiz
Spinola, Mauro de Mesquita
author_facet Azevedo, Caio da Silva
Gonçalves, Rodrigo Franco
Gava, Vagner Luiz
Spinola, Mauro de Mesquita
author_sort Azevedo, Caio da Silva
collection PubMed
description Benford Law (BL) states that the occurrence of significant digits in many natural and human phenomena data sets are not uniformly scattered, as one could naively expect, but follow a logarithmic-type distribution. Here, we present a method that consists of the use of BL analysis over first and first-two digits, three statistical conformity tests – Z-statistics, Mean Absolute Deviation (MAD) and Chi-square (χ2) as well as the summation test which looks for excessively large numbers, having fraud detection as one of its application. We developed the method for fraud detection in the case of the Brazilian Bolsa Familia welfare program. In this case, we submitted four periods of Brazilian welfare program payments to the method with a dataset of 13,442,529 records. We provide a practical implementation of the method based on open-source R library released on a public repository. Furthermore, code implementation of the algorithm as well as datasets are freely available. Advantages of the algorithm are listed below: • The method was developed based on open source libraries • The technique is simple, rapid and ease of use • Easily applicable to other social welfare program auditing
format Online
Article
Text
id pubmed-8720889
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-87208892022-01-07 A Benford's law based method for fraud detection using R Library Azevedo, Caio da Silva Gonçalves, Rodrigo Franco Gava, Vagner Luiz Spinola, Mauro de Mesquita MethodsX Method Article Benford Law (BL) states that the occurrence of significant digits in many natural and human phenomena data sets are not uniformly scattered, as one could naively expect, but follow a logarithmic-type distribution. Here, we present a method that consists of the use of BL analysis over first and first-two digits, three statistical conformity tests – Z-statistics, Mean Absolute Deviation (MAD) and Chi-square (χ2) as well as the summation test which looks for excessively large numbers, having fraud detection as one of its application. We developed the method for fraud detection in the case of the Brazilian Bolsa Familia welfare program. In this case, we submitted four periods of Brazilian welfare program payments to the method with a dataset of 13,442,529 records. We provide a practical implementation of the method based on open-source R library released on a public repository. Furthermore, code implementation of the algorithm as well as datasets are freely available. Advantages of the algorithm are listed below: • The method was developed based on open source libraries • The technique is simple, rapid and ease of use • Easily applicable to other social welfare program auditing Elsevier 2021-11-11 /pmc/articles/PMC8720889/ /pubmed/35004209 http://dx.doi.org/10.1016/j.mex.2021.101575 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Method Article
Azevedo, Caio da Silva
Gonçalves, Rodrigo Franco
Gava, Vagner Luiz
Spinola, Mauro de Mesquita
A Benford's law based method for fraud detection using R Library
title A Benford's law based method for fraud detection using R Library
title_full A Benford's law based method for fraud detection using R Library
title_fullStr A Benford's law based method for fraud detection using R Library
title_full_unstemmed A Benford's law based method for fraud detection using R Library
title_short A Benford's law based method for fraud detection using R Library
title_sort benford's law based method for fraud detection using r library
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8720889/
https://www.ncbi.nlm.nih.gov/pubmed/35004209
http://dx.doi.org/10.1016/j.mex.2021.101575
work_keys_str_mv AT azevedocaiodasilva abenfordslawbasedmethodforfrauddetectionusingrlibrary
AT goncalvesrodrigofranco abenfordslawbasedmethodforfrauddetectionusingrlibrary
AT gavavagnerluiz abenfordslawbasedmethodforfrauddetectionusingrlibrary
AT spinolamaurodemesquita abenfordslawbasedmethodforfrauddetectionusingrlibrary
AT azevedocaiodasilva benfordslawbasedmethodforfrauddetectionusingrlibrary
AT goncalvesrodrigofranco benfordslawbasedmethodforfrauddetectionusingrlibrary
AT gavavagnerluiz benfordslawbasedmethodforfrauddetectionusingrlibrary
AT spinolamaurodemesquita benfordslawbasedmethodforfrauddetectionusingrlibrary