Cargando…
A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces
Organized retail crime (ORC) is a significant issue for retailers, marketplace platforms, and consumers. Its prevalence and influence have increased fast in lockstep with the expansion of online commerce, digital devices, and communication platforms. Today, it is a costly affair, wreaking havoc on e...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10397305/ https://www.ncbi.nlm.nih.gov/pubmed/37532696 http://dx.doi.org/10.1038/s41598-023-38304-5 |
_version_ | 1785083886002241536 |
---|---|
author | Mutemi, Abed Bacao, Fernando |
author_facet | Mutemi, Abed Bacao, Fernando |
author_sort | Mutemi, Abed |
collection | PubMed |
description | Organized retail crime (ORC) is a significant issue for retailers, marketplace platforms, and consumers. Its prevalence and influence have increased fast in lockstep with the expansion of online commerce, digital devices, and communication platforms. Today, it is a costly affair, wreaking havoc on enterprises’ overall revenues and continually jeopardizing community security. These negative consequences are set to rocket to unprecedented heights as more people and devices connect to the Internet. Detecting and responding to these terrible acts as early as possible is critical for protecting consumers and businesses while also keeping an eye on rising patterns and fraud. The issue of detecting fraud in general has been studied widely, especially in financial services, but studies focusing on organized retail crimes are extremely rare in literature. To contribute to the knowledge base in this area, we present a scalable machine learning strategy for detecting and isolating ORC listings on a prominent marketplace platform by merchants committing organized retail crimes or fraud. We employ a supervised learning approach to classify postings as fraudulent or real based on past data from buyer and seller behaviors and transactions on the platform. The proposed framework combines bespoke data preprocessing procedures, feature selection methods, and state-of-the-art class asymmetry resolution techniques to search for aligned classification algorithms capable of discriminating between fraudulent and legitimate listings in this context. Our best detection model obtains a recall score of 0.97 on the holdout set and 0.94 on the out-of-sample testing data set. We achieve these results based on a select set of 45 features out of 58. |
format | Online Article Text |
id | pubmed-10397305 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-103973052023-08-04 A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces Mutemi, Abed Bacao, Fernando Sci Rep Article Organized retail crime (ORC) is a significant issue for retailers, marketplace platforms, and consumers. Its prevalence and influence have increased fast in lockstep with the expansion of online commerce, digital devices, and communication platforms. Today, it is a costly affair, wreaking havoc on enterprises’ overall revenues and continually jeopardizing community security. These negative consequences are set to rocket to unprecedented heights as more people and devices connect to the Internet. Detecting and responding to these terrible acts as early as possible is critical for protecting consumers and businesses while also keeping an eye on rising patterns and fraud. The issue of detecting fraud in general has been studied widely, especially in financial services, but studies focusing on organized retail crimes are extremely rare in literature. To contribute to the knowledge base in this area, we present a scalable machine learning strategy for detecting and isolating ORC listings on a prominent marketplace platform by merchants committing organized retail crimes or fraud. We employ a supervised learning approach to classify postings as fraudulent or real based on past data from buyer and seller behaviors and transactions on the platform. The proposed framework combines bespoke data preprocessing procedures, feature selection methods, and state-of-the-art class asymmetry resolution techniques to search for aligned classification algorithms capable of discriminating between fraudulent and legitimate listings in this context. Our best detection model obtains a recall score of 0.97 on the holdout set and 0.94 on the out-of-sample testing data set. We achieve these results based on a select set of 45 features out of 58. Nature Publishing Group UK 2023-08-02 /pmc/articles/PMC10397305/ /pubmed/37532696 http://dx.doi.org/10.1038/s41598-023-38304-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Mutemi, Abed Bacao, Fernando A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces |
title | A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces |
title_full | A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces |
title_fullStr | A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces |
title_full_unstemmed | A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces |
title_short | A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces |
title_sort | numeric-based machine learning design for detecting organized retail fraud in digital marketplaces |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10397305/ https://www.ncbi.nlm.nih.gov/pubmed/37532696 http://dx.doi.org/10.1038/s41598-023-38304-5 |
work_keys_str_mv | AT mutemiabed anumericbasedmachinelearningdesignfordetectingorganizedretailfraudindigitalmarketplaces AT bacaofernando anumericbasedmachinelearningdesignfordetectingorganizedretailfraudindigitalmarketplaces AT mutemiabed numericbasedmachinelearningdesignfordetectingorganizedretailfraudindigitalmarketplaces AT bacaofernando numericbasedmachinelearningdesignfordetectingorganizedretailfraudindigitalmarketplaces |