Cargando…

A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces

Organized retail crime (ORC) is a significant issue for retailers, marketplace platforms, and consumers. Its prevalence and influence have increased fast in lockstep with the expansion of online commerce, digital devices, and communication platforms. Today, it is a costly affair, wreaking havoc on e...

Descripción completa

Detalles Bibliográficos
Autores principales: Mutemi, Abed, Bacao, Fernando
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10397305/
https://www.ncbi.nlm.nih.gov/pubmed/37532696
http://dx.doi.org/10.1038/s41598-023-38304-5
_version_ 1785083886002241536
author Mutemi, Abed
Bacao, Fernando
author_facet Mutemi, Abed
Bacao, Fernando
author_sort Mutemi, Abed
collection PubMed
description Organized retail crime (ORC) is a significant issue for retailers, marketplace platforms, and consumers. Its prevalence and influence have increased fast in lockstep with the expansion of online commerce, digital devices, and communication platforms. Today, it is a costly affair, wreaking havoc on enterprises’ overall revenues and continually jeopardizing community security. These negative consequences are set to rocket to unprecedented heights as more people and devices connect to the Internet. Detecting and responding to these terrible acts as early as possible is critical for protecting consumers and businesses while also keeping an eye on rising patterns and fraud. The issue of detecting fraud in general has been studied widely, especially in financial services, but studies focusing on organized retail crimes are extremely rare in literature. To contribute to the knowledge base in this area, we present a scalable machine learning strategy for detecting and isolating ORC listings on a prominent marketplace platform by merchants committing organized retail crimes or fraud. We employ a supervised learning approach to classify postings as fraudulent or real based on past data from buyer and seller behaviors and transactions on the platform. The proposed framework combines bespoke data preprocessing procedures, feature selection methods, and state-of-the-art class asymmetry resolution techniques to search for aligned classification algorithms capable of discriminating between fraudulent and legitimate listings in this context. Our best detection model obtains a recall score of 0.97 on the holdout set and 0.94 on the out-of-sample testing data set. We achieve these results based on a select set of 45 features out of 58.
format Online
Article
Text
id pubmed-10397305
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103973052023-08-04 A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces Mutemi, Abed Bacao, Fernando Sci Rep Article Organized retail crime (ORC) is a significant issue for retailers, marketplace platforms, and consumers. Its prevalence and influence have increased fast in lockstep with the expansion of online commerce, digital devices, and communication platforms. Today, it is a costly affair, wreaking havoc on enterprises’ overall revenues and continually jeopardizing community security. These negative consequences are set to rocket to unprecedented heights as more people and devices connect to the Internet. Detecting and responding to these terrible acts as early as possible is critical for protecting consumers and businesses while also keeping an eye on rising patterns and fraud. The issue of detecting fraud in general has been studied widely, especially in financial services, but studies focusing on organized retail crimes are extremely rare in literature. To contribute to the knowledge base in this area, we present a scalable machine learning strategy for detecting and isolating ORC listings on a prominent marketplace platform by merchants committing organized retail crimes or fraud. We employ a supervised learning approach to classify postings as fraudulent or real based on past data from buyer and seller behaviors and transactions on the platform. The proposed framework combines bespoke data preprocessing procedures, feature selection methods, and state-of-the-art class asymmetry resolution techniques to search for aligned classification algorithms capable of discriminating between fraudulent and legitimate listings in this context. Our best detection model obtains a recall score of 0.97 on the holdout set and 0.94 on the out-of-sample testing data set. We achieve these results based on a select set of 45 features out of 58. Nature Publishing Group UK 2023-08-02 /pmc/articles/PMC10397305/ /pubmed/37532696 http://dx.doi.org/10.1038/s41598-023-38304-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Mutemi, Abed
Bacao, Fernando
A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces
title A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces
title_full A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces
title_fullStr A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces
title_full_unstemmed A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces
title_short A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces
title_sort numeric-based machine learning design for detecting organized retail fraud in digital marketplaces
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10397305/
https://www.ncbi.nlm.nih.gov/pubmed/37532696
http://dx.doi.org/10.1038/s41598-023-38304-5
work_keys_str_mv AT mutemiabed anumericbasedmachinelearningdesignfordetectingorganizedretailfraudindigitalmarketplaces
AT bacaofernando anumericbasedmachinelearningdesignfordetectingorganizedretailfraudindigitalmarketplaces
AT mutemiabed numericbasedmachinelearningdesignfordetectingorganizedretailfraudindigitalmarketplaces
AT bacaofernando numericbasedmachinelearningdesignfordetectingorganizedretailfraudindigitalmarketplaces