Cargando…

S(3)CMTF: Fast, accurate, and scalable method for incomplete coupled matrix-tensor factorization

How can we extract hidden relations from a tensor and a matrix data simultaneously in a fast, accurate, and scalable way? Coupled matrix-tensor factorization (CMTF) is an important tool for this purpose. Designing an accurate and efficient CMTF method has become more crucial as the size and dimensio...

Descripción completa

Detalles Bibliográficos
Autores principales: Choi, Dongjin, Jang, Jun-Gi, Kang, U
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6599158/
https://www.ncbi.nlm.nih.gov/pubmed/31251750
http://dx.doi.org/10.1371/journal.pone.0217316
_version_ 1783430904979390464
author Choi, Dongjin
Jang, Jun-Gi
Kang, U
author_facet Choi, Dongjin
Jang, Jun-Gi
Kang, U
author_sort Choi, Dongjin
collection PubMed
description How can we extract hidden relations from a tensor and a matrix data simultaneously in a fast, accurate, and scalable way? Coupled matrix-tensor factorization (CMTF) is an important tool for this purpose. Designing an accurate and efficient CMTF method has become more crucial as the size and dimension of real-world data are growing explosively. However, existing methods for CMTF suffer from lack of accuracy, slow running time, and limited scalability. In this paper, we propose S(3)CMTF, a fast, accurate, and scalable CMTF method. In contrast to previous methods which do not handle large sparse tensors and are not parallelizable, S(3)CMTF provides parallel sparse CMTF by carefully deriving gradient update rules. S(3)CMTF asynchronously updates partial gradients without expensive locking. We show that our method is guaranteed to converge to a quality solution theoretically and empirically. S(3)CMTF further boosts the performance by carefully storing intermediate computation and reusing them. We theoretically and empirically show that S(3)CMTF is the fastest, outperforming existing methods. Experimental results show that S(3)CMTF is up to 930× faster than existing methods while providing the best accuracy. S(3)CMTF shows linear scalability on the number of data entries and the number of cores. In addition, we apply S(3)CMTF to Yelp rating tensor data coupled with 3 additional matrices to discover interesting patterns.
format Online
Article
Text
id pubmed-6599158
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-65991582019-07-12 S(3)CMTF: Fast, accurate, and scalable method for incomplete coupled matrix-tensor factorization Choi, Dongjin Jang, Jun-Gi Kang, U PLoS One Research Article How can we extract hidden relations from a tensor and a matrix data simultaneously in a fast, accurate, and scalable way? Coupled matrix-tensor factorization (CMTF) is an important tool for this purpose. Designing an accurate and efficient CMTF method has become more crucial as the size and dimension of real-world data are growing explosively. However, existing methods for CMTF suffer from lack of accuracy, slow running time, and limited scalability. In this paper, we propose S(3)CMTF, a fast, accurate, and scalable CMTF method. In contrast to previous methods which do not handle large sparse tensors and are not parallelizable, S(3)CMTF provides parallel sparse CMTF by carefully deriving gradient update rules. S(3)CMTF asynchronously updates partial gradients without expensive locking. We show that our method is guaranteed to converge to a quality solution theoretically and empirically. S(3)CMTF further boosts the performance by carefully storing intermediate computation and reusing them. We theoretically and empirically show that S(3)CMTF is the fastest, outperforming existing methods. Experimental results show that S(3)CMTF is up to 930× faster than existing methods while providing the best accuracy. S(3)CMTF shows linear scalability on the number of data entries and the number of cores. In addition, we apply S(3)CMTF to Yelp rating tensor data coupled with 3 additional matrices to discover interesting patterns. Public Library of Science 2019-06-28 /pmc/articles/PMC6599158/ /pubmed/31251750 http://dx.doi.org/10.1371/journal.pone.0217316 Text en © 2019 Choi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Choi, Dongjin
Jang, Jun-Gi
Kang, U
S(3)CMTF: Fast, accurate, and scalable method for incomplete coupled matrix-tensor factorization
title S(3)CMTF: Fast, accurate, and scalable method for incomplete coupled matrix-tensor factorization
title_full S(3)CMTF: Fast, accurate, and scalable method for incomplete coupled matrix-tensor factorization
title_fullStr S(3)CMTF: Fast, accurate, and scalable method for incomplete coupled matrix-tensor factorization
title_full_unstemmed S(3)CMTF: Fast, accurate, and scalable method for incomplete coupled matrix-tensor factorization
title_short S(3)CMTF: Fast, accurate, and scalable method for incomplete coupled matrix-tensor factorization
title_sort s(3)cmtf: fast, accurate, and scalable method for incomplete coupled matrix-tensor factorization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6599158/
https://www.ncbi.nlm.nih.gov/pubmed/31251750
http://dx.doi.org/10.1371/journal.pone.0217316
work_keys_str_mv AT choidongjin s3cmtffastaccurateandscalablemethodforincompletecoupledmatrixtensorfactorization
AT jangjungi s3cmtffastaccurateandscalablemethodforincompletecoupledmatrixtensorfactorization
AT kangu s3cmtffastaccurateandscalablemethodforincompletecoupledmatrixtensorfactorization