Cargando…

Feature generation and contribution comparison for electronic fraud detection

Modern money transfer services are convenient, attracting fraudulent actors to run scams in which victims are deceived into transferring funds to fraudulent accounts. Machine learning models are broadly applied due to the poor fraud detection performance of traditional rule-based approaches. Learnin...

Descripción completa

Detalles Bibliográficos
Autores principales: Ti, Yen-Wu, Hsin, Yu-Yen, Dai, Tian-Shyr, Huang, Ming-Chuan, Liu, Liang-Chih
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9613914/
https://www.ncbi.nlm.nih.gov/pubmed/36302818
http://dx.doi.org/10.1038/s41598-022-22130-2
_version_ 1784820075139694592
author Ti, Yen-Wu
Hsin, Yu-Yen
Dai, Tian-Shyr
Huang, Ming-Chuan
Liu, Liang-Chih
author_facet Ti, Yen-Wu
Hsin, Yu-Yen
Dai, Tian-Shyr
Huang, Ming-Chuan
Liu, Liang-Chih
author_sort Ti, Yen-Wu
collection PubMed
description Modern money transfer services are convenient, attracting fraudulent actors to run scams in which victims are deceived into transferring funds to fraudulent accounts. Machine learning models are broadly applied due to the poor fraud detection performance of traditional rule-based approaches. Learning directly from raw transaction data is impractical due to its high-dimensional nature; most studies construct features instead by extracting patterns from raw transaction data. Past literature categorizes these features into recency, frequency, monetary, and anomaly detection features. We use various machine learning algorithms to examine the performance of features in these four categories with real transaction data; we compare them with the performance of our feature generation guideline based on the statistical perspectives and characteristics of (non)-fraudulent accounts. The results show that except for the monetary category, other feature categories used in the literature perform poorly regardless of which machine learning algorithm is used; anomaly detection features perform the worst. We find that even statistical features generated based on financial knowledge yield limited performance on a real transaction dataset. Our atypical detection characteristic of normal accounts improves the ability to distinguish them from fraudulent accounts and hence improves the overall detection results, outperforming other existent methods.
format Online
Article
Text
id pubmed-9613914
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96139142022-10-29 Feature generation and contribution comparison for electronic fraud detection Ti, Yen-Wu Hsin, Yu-Yen Dai, Tian-Shyr Huang, Ming-Chuan Liu, Liang-Chih Sci Rep Article Modern money transfer services are convenient, attracting fraudulent actors to run scams in which victims are deceived into transferring funds to fraudulent accounts. Machine learning models are broadly applied due to the poor fraud detection performance of traditional rule-based approaches. Learning directly from raw transaction data is impractical due to its high-dimensional nature; most studies construct features instead by extracting patterns from raw transaction data. Past literature categorizes these features into recency, frequency, monetary, and anomaly detection features. We use various machine learning algorithms to examine the performance of features in these four categories with real transaction data; we compare them with the performance of our feature generation guideline based on the statistical perspectives and characteristics of (non)-fraudulent accounts. The results show that except for the monetary category, other feature categories used in the literature perform poorly regardless of which machine learning algorithm is used; anomaly detection features perform the worst. We find that even statistical features generated based on financial knowledge yield limited performance on a real transaction dataset. Our atypical detection characteristic of normal accounts improves the ability to distinguish them from fraudulent accounts and hence improves the overall detection results, outperforming other existent methods. Nature Publishing Group UK 2022-10-27 /pmc/articles/PMC9613914/ /pubmed/36302818 http://dx.doi.org/10.1038/s41598-022-22130-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Ti, Yen-Wu
Hsin, Yu-Yen
Dai, Tian-Shyr
Huang, Ming-Chuan
Liu, Liang-Chih
Feature generation and contribution comparison for electronic fraud detection
title Feature generation and contribution comparison for electronic fraud detection
title_full Feature generation and contribution comparison for electronic fraud detection
title_fullStr Feature generation and contribution comparison for electronic fraud detection
title_full_unstemmed Feature generation and contribution comparison for electronic fraud detection
title_short Feature generation and contribution comparison for electronic fraud detection
title_sort feature generation and contribution comparison for electronic fraud detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9613914/
https://www.ncbi.nlm.nih.gov/pubmed/36302818
http://dx.doi.org/10.1038/s41598-022-22130-2
work_keys_str_mv AT tiyenwu featuregenerationandcontributioncomparisonforelectronicfrauddetection
AT hsinyuyen featuregenerationandcontributioncomparisonforelectronicfrauddetection
AT daitianshyr featuregenerationandcontributioncomparisonforelectronicfrauddetection
AT huangmingchuan featuregenerationandcontributioncomparisonforelectronicfrauddetection
AT liuliangchih featuregenerationandcontributioncomparisonforelectronicfrauddetection