Cargando…
A Benford's law based method for fraud detection using R Library
Benford Law (BL) states that the occurrence of significant digits in many natural and human phenomena data sets are not uniformly scattered, as one could naively expect, but follow a logarithmic-type distribution. Here, we present a method that consists of the use of BL analysis over first and first...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8720889/ https://www.ncbi.nlm.nih.gov/pubmed/35004209 http://dx.doi.org/10.1016/j.mex.2021.101575 |
_version_ | 1784625222129811456 |
---|---|
author | Azevedo, Caio da Silva Gonçalves, Rodrigo Franco Gava, Vagner Luiz Spinola, Mauro de Mesquita |
author_facet | Azevedo, Caio da Silva Gonçalves, Rodrigo Franco Gava, Vagner Luiz Spinola, Mauro de Mesquita |
author_sort | Azevedo, Caio da Silva |
collection | PubMed |
description | Benford Law (BL) states that the occurrence of significant digits in many natural and human phenomena data sets are not uniformly scattered, as one could naively expect, but follow a logarithmic-type distribution. Here, we present a method that consists of the use of BL analysis over first and first-two digits, three statistical conformity tests – Z-statistics, Mean Absolute Deviation (MAD) and Chi-square (χ2) as well as the summation test which looks for excessively large numbers, having fraud detection as one of its application. We developed the method for fraud detection in the case of the Brazilian Bolsa Familia welfare program. In this case, we submitted four periods of Brazilian welfare program payments to the method with a dataset of 13,442,529 records. We provide a practical implementation of the method based on open-source R library released on a public repository. Furthermore, code implementation of the algorithm as well as datasets are freely available. Advantages of the algorithm are listed below: • The method was developed based on open source libraries • The technique is simple, rapid and ease of use • Easily applicable to other social welfare program auditing |
format | Online Article Text |
id | pubmed-8720889 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-87208892022-01-07 A Benford's law based method for fraud detection using R Library Azevedo, Caio da Silva Gonçalves, Rodrigo Franco Gava, Vagner Luiz Spinola, Mauro de Mesquita MethodsX Method Article Benford Law (BL) states that the occurrence of significant digits in many natural and human phenomena data sets are not uniformly scattered, as one could naively expect, but follow a logarithmic-type distribution. Here, we present a method that consists of the use of BL analysis over first and first-two digits, three statistical conformity tests – Z-statistics, Mean Absolute Deviation (MAD) and Chi-square (χ2) as well as the summation test which looks for excessively large numbers, having fraud detection as one of its application. We developed the method for fraud detection in the case of the Brazilian Bolsa Familia welfare program. In this case, we submitted four periods of Brazilian welfare program payments to the method with a dataset of 13,442,529 records. We provide a practical implementation of the method based on open-source R library released on a public repository. Furthermore, code implementation of the algorithm as well as datasets are freely available. Advantages of the algorithm are listed below: • The method was developed based on open source libraries • The technique is simple, rapid and ease of use • Easily applicable to other social welfare program auditing Elsevier 2021-11-11 /pmc/articles/PMC8720889/ /pubmed/35004209 http://dx.doi.org/10.1016/j.mex.2021.101575 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Method Article Azevedo, Caio da Silva Gonçalves, Rodrigo Franco Gava, Vagner Luiz Spinola, Mauro de Mesquita A Benford's law based method for fraud detection using R Library |
title | A Benford's law based method for fraud detection using R Library |
title_full | A Benford's law based method for fraud detection using R Library |
title_fullStr | A Benford's law based method for fraud detection using R Library |
title_full_unstemmed | A Benford's law based method for fraud detection using R Library |
title_short | A Benford's law based method for fraud detection using R Library |
title_sort | benford's law based method for fraud detection using r library |
topic | Method Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8720889/ https://www.ncbi.nlm.nih.gov/pubmed/35004209 http://dx.doi.org/10.1016/j.mex.2021.101575 |
work_keys_str_mv | AT azevedocaiodasilva abenfordslawbasedmethodforfrauddetectionusingrlibrary AT goncalvesrodrigofranco abenfordslawbasedmethodforfrauddetectionusingrlibrary AT gavavagnerluiz abenfordslawbasedmethodforfrauddetectionusingrlibrary AT spinolamaurodemesquita abenfordslawbasedmethodforfrauddetectionusingrlibrary AT azevedocaiodasilva benfordslawbasedmethodforfrauddetectionusingrlibrary AT goncalvesrodrigofranco benfordslawbasedmethodforfrauddetectionusingrlibrary AT gavavagnerluiz benfordslawbasedmethodforfrauddetectionusingrlibrary AT spinolamaurodemesquita benfordslawbasedmethodforfrauddetectionusingrlibrary |