Cargando…

A two-route CNN model for bank account classification with heterogeneous data

Classifying bank accounts by using transaction data is encouraging in cracking down on illegal financial activities. However, few research simultaneously use heterogenous features, which are embedded in the time series data. In this paper, a two route convolution neural network TRHD-CNN model, fed w...

Descripción completa

Detalles Bibliográficos
Autores principales: Lv, Fang, Huang, Junheng, Wang, Wei, Wei, Yuliang, Sun, Yunxiao, Wang, Bailing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6699796/
https://www.ncbi.nlm.nih.gov/pubmed/31425545
http://dx.doi.org/10.1371/journal.pone.0220631
_version_ 1783444770318712832
author Lv, Fang
Huang, Junheng
Wang, Wei
Wei, Yuliang
Sun, Yunxiao
Wang, Bailing
author_facet Lv, Fang
Huang, Junheng
Wang, Wei
Wei, Yuliang
Sun, Yunxiao
Wang, Bailing
author_sort Lv, Fang
collection PubMed
description Classifying bank accounts by using transaction data is encouraging in cracking down on illegal financial activities. However, few research simultaneously use heterogenous features, which are embedded in the time series data. In this paper, a two route convolution neural network TRHD-CNN model, fed with two types of heterogeneous feature matrices, is proposed for classifying the bank accounts. TRHD-CNN adopts divide and conquer strategy to extract characteristics from two types of data source independently. The strategy is proved able in mining complementary classification characteristics. We firstly transfer the original log data into a directed and dynamic transaction network. On the basis of that, two feature generation methods are devised for extracting information from local topological structure and time series transaction respectively. A DirectedWalk method is developed in this paper to learning the network vector of vertices used for embedding the neighbor relationship of bank account. The extensive experimental results, conducted on a real bank transaction dataset that contains illegal pyramid selling accounts, show the significant advantage of TRHD-CNN over the existing methods. TRHD-CNN can provide recall scores up to 5.15% higher than competing methods. In addition, the two-route architecture of TRHD-CNN is easy to extend to multi-route scenarios and other fields.
format Online
Article
Text
id pubmed-6699796
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-66997962019-09-04 A two-route CNN model for bank account classification with heterogeneous data Lv, Fang Huang, Junheng Wang, Wei Wei, Yuliang Sun, Yunxiao Wang, Bailing PLoS One Research Article Classifying bank accounts by using transaction data is encouraging in cracking down on illegal financial activities. However, few research simultaneously use heterogenous features, which are embedded in the time series data. In this paper, a two route convolution neural network TRHD-CNN model, fed with two types of heterogeneous feature matrices, is proposed for classifying the bank accounts. TRHD-CNN adopts divide and conquer strategy to extract characteristics from two types of data source independently. The strategy is proved able in mining complementary classification characteristics. We firstly transfer the original log data into a directed and dynamic transaction network. On the basis of that, two feature generation methods are devised for extracting information from local topological structure and time series transaction respectively. A DirectedWalk method is developed in this paper to learning the network vector of vertices used for embedding the neighbor relationship of bank account. The extensive experimental results, conducted on a real bank transaction dataset that contains illegal pyramid selling accounts, show the significant advantage of TRHD-CNN over the existing methods. TRHD-CNN can provide recall scores up to 5.15% higher than competing methods. In addition, the two-route architecture of TRHD-CNN is easy to extend to multi-route scenarios and other fields. Public Library of Science 2019-08-19 /pmc/articles/PMC6699796/ /pubmed/31425545 http://dx.doi.org/10.1371/journal.pone.0220631 Text en © 2019 Lv et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lv, Fang
Huang, Junheng
Wang, Wei
Wei, Yuliang
Sun, Yunxiao
Wang, Bailing
A two-route CNN model for bank account classification with heterogeneous data
title A two-route CNN model for bank account classification with heterogeneous data
title_full A two-route CNN model for bank account classification with heterogeneous data
title_fullStr A two-route CNN model for bank account classification with heterogeneous data
title_full_unstemmed A two-route CNN model for bank account classification with heterogeneous data
title_short A two-route CNN model for bank account classification with heterogeneous data
title_sort two-route cnn model for bank account classification with heterogeneous data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6699796/
https://www.ncbi.nlm.nih.gov/pubmed/31425545
http://dx.doi.org/10.1371/journal.pone.0220631
work_keys_str_mv AT lvfang atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata
AT huangjunheng atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata
AT wangwei atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata
AT weiyuliang atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata
AT sunyunxiao atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata
AT wangbailing atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata
AT lvfang tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata
AT huangjunheng tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata
AT wangwei tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata
AT weiyuliang tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata
AT sunyunxiao tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata
AT wangbailing tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata