Cargando…

CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions

How can we find patterns and anomalies in a tensor, i.e., multi-dimensional array, in an efficient and directly interpretable way? How can we do this in an online environment, where a new tensor arrives at each time step? Finding patterns and anomalies in multi-dimensional data have many important a...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Jungwoo, Choi, Dongjin, Sael, Lee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6059458/
https://www.ncbi.nlm.nih.gov/pubmed/30044837
http://dx.doi.org/10.1371/journal.pone.0200579
_version_ 1783341865832022016
author Lee, Jungwoo
Choi, Dongjin
Sael, Lee
author_facet Lee, Jungwoo
Choi, Dongjin
Sael, Lee
author_sort Lee, Jungwoo
collection PubMed
description How can we find patterns and anomalies in a tensor, i.e., multi-dimensional array, in an efficient and directly interpretable way? How can we do this in an online environment, where a new tensor arrives at each time step? Finding patterns and anomalies in multi-dimensional data have many important applications, including building safety monitoring, health monitoring, cyber security, terrorist detection, and fake user detection in social networks. Standard tensor decomposition results are not directly interpretable and few methods that propose to increase interpretability need to be made faster, more memory efficient, and more accurate for large and quickly generated data in the online environment. We propose two versions of a fast, accurate, and directly interpretable tensor decomposition method we call CTD that is based on efficient sampling method. First is the static version of CTD, i.e., CTD-S, that provably guarantees up to 11× higher accuracy than that of the state-of-the-art method. Also, CTD-S is made up to 2.3× faster and up to 24× more memory-efficient than the state-of-the-art method by removing redundancy. Second is the dynamic version of CTD, i.e. CTD-D, which is the first interpretable dynamic tensor decomposition method ever proposed. It is also made up to 82× faster than the already fast CTD-S by exploiting factors at previous time step and by reordering operations. With CTD, we demonstrate how the results can be effectively interpreted in online distributed denial of service (DDoS) attack detection and online troll detection.
format Online
Article
Text
id pubmed-6059458
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60594582018-08-09 CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions Lee, Jungwoo Choi, Dongjin Sael, Lee PLoS One Research Article How can we find patterns and anomalies in a tensor, i.e., multi-dimensional array, in an efficient and directly interpretable way? How can we do this in an online environment, where a new tensor arrives at each time step? Finding patterns and anomalies in multi-dimensional data have many important applications, including building safety monitoring, health monitoring, cyber security, terrorist detection, and fake user detection in social networks. Standard tensor decomposition results are not directly interpretable and few methods that propose to increase interpretability need to be made faster, more memory efficient, and more accurate for large and quickly generated data in the online environment. We propose two versions of a fast, accurate, and directly interpretable tensor decomposition method we call CTD that is based on efficient sampling method. First is the static version of CTD, i.e., CTD-S, that provably guarantees up to 11× higher accuracy than that of the state-of-the-art method. Also, CTD-S is made up to 2.3× faster and up to 24× more memory-efficient than the state-of-the-art method by removing redundancy. Second is the dynamic version of CTD, i.e. CTD-D, which is the first interpretable dynamic tensor decomposition method ever proposed. It is also made up to 82× faster than the already fast CTD-S by exploiting factors at previous time step and by reordering operations. With CTD, we demonstrate how the results can be effectively interpreted in online distributed denial of service (DDoS) attack detection and online troll detection. Public Library of Science 2018-07-25 /pmc/articles/PMC6059458/ /pubmed/30044837 http://dx.doi.org/10.1371/journal.pone.0200579 Text en © 2018 Lee et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lee, Jungwoo
Choi, Dongjin
Sael, Lee
CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions
title CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions
title_full CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions
title_fullStr CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions
title_full_unstemmed CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions
title_short CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions
title_sort ctd: fast, accurate, and interpretable method for static and dynamic tensor decompositions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6059458/
https://www.ncbi.nlm.nih.gov/pubmed/30044837
http://dx.doi.org/10.1371/journal.pone.0200579
work_keys_str_mv AT leejungwoo ctdfastaccurateandinterpretablemethodforstaticanddynamictensordecompositions
AT choidongjin ctdfastaccurateandinterpretablemethodforstaticanddynamictensordecompositions
AT saellee ctdfastaccurateandinterpretablemethodforstaticanddynamictensordecompositions