Cargando…
A two-route CNN model for bank account classification with heterogeneous data
Classifying bank accounts by using transaction data is encouraging in cracking down on illegal financial activities. However, few research simultaneously use heterogenous features, which are embedded in the time series data. In this paper, a two route convolution neural network TRHD-CNN model, fed w...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6699796/ https://www.ncbi.nlm.nih.gov/pubmed/31425545 http://dx.doi.org/10.1371/journal.pone.0220631 |
_version_ | 1783444770318712832 |
---|---|
author | Lv, Fang Huang, Junheng Wang, Wei Wei, Yuliang Sun, Yunxiao Wang, Bailing |
author_facet | Lv, Fang Huang, Junheng Wang, Wei Wei, Yuliang Sun, Yunxiao Wang, Bailing |
author_sort | Lv, Fang |
collection | PubMed |
description | Classifying bank accounts by using transaction data is encouraging in cracking down on illegal financial activities. However, few research simultaneously use heterogenous features, which are embedded in the time series data. In this paper, a two route convolution neural network TRHD-CNN model, fed with two types of heterogeneous feature matrices, is proposed for classifying the bank accounts. TRHD-CNN adopts divide and conquer strategy to extract characteristics from two types of data source independently. The strategy is proved able in mining complementary classification characteristics. We firstly transfer the original log data into a directed and dynamic transaction network. On the basis of that, two feature generation methods are devised for extracting information from local topological structure and time series transaction respectively. A DirectedWalk method is developed in this paper to learning the network vector of vertices used for embedding the neighbor relationship of bank account. The extensive experimental results, conducted on a real bank transaction dataset that contains illegal pyramid selling accounts, show the significant advantage of TRHD-CNN over the existing methods. TRHD-CNN can provide recall scores up to 5.15% higher than competing methods. In addition, the two-route architecture of TRHD-CNN is easy to extend to multi-route scenarios and other fields. |
format | Online Article Text |
id | pubmed-6699796 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-66997962019-09-04 A two-route CNN model for bank account classification with heterogeneous data Lv, Fang Huang, Junheng Wang, Wei Wei, Yuliang Sun, Yunxiao Wang, Bailing PLoS One Research Article Classifying bank accounts by using transaction data is encouraging in cracking down on illegal financial activities. However, few research simultaneously use heterogenous features, which are embedded in the time series data. In this paper, a two route convolution neural network TRHD-CNN model, fed with two types of heterogeneous feature matrices, is proposed for classifying the bank accounts. TRHD-CNN adopts divide and conquer strategy to extract characteristics from two types of data source independently. The strategy is proved able in mining complementary classification characteristics. We firstly transfer the original log data into a directed and dynamic transaction network. On the basis of that, two feature generation methods are devised for extracting information from local topological structure and time series transaction respectively. A DirectedWalk method is developed in this paper to learning the network vector of vertices used for embedding the neighbor relationship of bank account. The extensive experimental results, conducted on a real bank transaction dataset that contains illegal pyramid selling accounts, show the significant advantage of TRHD-CNN over the existing methods. TRHD-CNN can provide recall scores up to 5.15% higher than competing methods. In addition, the two-route architecture of TRHD-CNN is easy to extend to multi-route scenarios and other fields. Public Library of Science 2019-08-19 /pmc/articles/PMC6699796/ /pubmed/31425545 http://dx.doi.org/10.1371/journal.pone.0220631 Text en © 2019 Lv et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Lv, Fang Huang, Junheng Wang, Wei Wei, Yuliang Sun, Yunxiao Wang, Bailing A two-route CNN model for bank account classification with heterogeneous data |
title | A two-route CNN model for bank account classification with heterogeneous data |
title_full | A two-route CNN model for bank account classification with heterogeneous data |
title_fullStr | A two-route CNN model for bank account classification with heterogeneous data |
title_full_unstemmed | A two-route CNN model for bank account classification with heterogeneous data |
title_short | A two-route CNN model for bank account classification with heterogeneous data |
title_sort | two-route cnn model for bank account classification with heterogeneous data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6699796/ https://www.ncbi.nlm.nih.gov/pubmed/31425545 http://dx.doi.org/10.1371/journal.pone.0220631 |
work_keys_str_mv | AT lvfang atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata AT huangjunheng atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata AT wangwei atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata AT weiyuliang atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata AT sunyunxiao atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata AT wangbailing atworoutecnnmodelforbankaccountclassificationwithheterogeneousdata AT lvfang tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata AT huangjunheng tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata AT wangwei tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata AT weiyuliang tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata AT sunyunxiao tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata AT wangbailing tworoutecnnmodelforbankaccountclassificationwithheterogeneousdata |