Cargando…

Provable Convex Co-clustering of Tensors

Cluster analysis is a fundamental tool for pattern discovery of complex heterogeneous data. Prevalent clustering methods mainly focus on vector or matrix-variate data and are not applicable to general-order tensors, which arise frequently in modern scientific and business applications. Moreover, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Chi, Eric C., Gaines, Brian R., Sun, Will Wei, Zhou, Hua, Yang, Jian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7731944/
https://www.ncbi.nlm.nih.gov/pubmed/33312074
_version_ 1783621997118357504
author Chi, Eric C.
Gaines, Brian R.
Sun, Will Wei
Zhou, Hua
Yang, Jian
author_facet Chi, Eric C.
Gaines, Brian R.
Sun, Will Wei
Zhou, Hua
Yang, Jian
author_sort Chi, Eric C.
collection PubMed
description Cluster analysis is a fundamental tool for pattern discovery of complex heterogeneous data. Prevalent clustering methods mainly focus on vector or matrix-variate data and are not applicable to general-order tensors, which arise frequently in modern scientific and business applications. Moreover, there is a gap between statistical guarantees and computational efficiency for existing tensor clustering solutions due to the nature of their non-convex formulations. In this work, we bridge this gap by developing a provable convex formulation of tensor co-clustering. Our convex co-clustering (CoCo) estimator enjoys stability guarantees and its computational and storage costs are polynomial in the size of the data. We further establish a non-asymptotic error bound for the CoCo estimator, which reveals a surprising “blessing of dimensionality” phenomenon that does not exist in vector or matrix-variate cluster analysis. Our theoretical findings are supported by extensive simulated studies. Finally, we apply the CoCo estimator to the cluster analysis of advertisement click tensor data from a major online company. Our clustering results provide meaningful business insights to improve advertising effectiveness.
format Online
Article
Text
id pubmed-7731944
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-77319442020-12-11 Provable Convex Co-clustering of Tensors Chi, Eric C. Gaines, Brian R. Sun, Will Wei Zhou, Hua Yang, Jian J Mach Learn Res Article Cluster analysis is a fundamental tool for pattern discovery of complex heterogeneous data. Prevalent clustering methods mainly focus on vector or matrix-variate data and are not applicable to general-order tensors, which arise frequently in modern scientific and business applications. Moreover, there is a gap between statistical guarantees and computational efficiency for existing tensor clustering solutions due to the nature of their non-convex formulations. In this work, we bridge this gap by developing a provable convex formulation of tensor co-clustering. Our convex co-clustering (CoCo) estimator enjoys stability guarantees and its computational and storage costs are polynomial in the size of the data. We further establish a non-asymptotic error bound for the CoCo estimator, which reveals a surprising “blessing of dimensionality” phenomenon that does not exist in vector or matrix-variate cluster analysis. Our theoretical findings are supported by extensive simulated studies. Finally, we apply the CoCo estimator to the cluster analysis of advertisement click tensor data from a major online company. Our clustering results provide meaningful business insights to improve advertising effectiveness. 2020 /pmc/articles/PMC7731944/ /pubmed/33312074 Text en https://creativecommons.org/licenses/by/4.0/License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. (https://creativecommons.org/licenses/by/4.0/) Attribution requirements are provided at http://jmlr.org/papers/v21/18-155.html. (http://jmlr.org/papers/v21/18-155.html)
spellingShingle Article
Chi, Eric C.
Gaines, Brian R.
Sun, Will Wei
Zhou, Hua
Yang, Jian
Provable Convex Co-clustering of Tensors
title Provable Convex Co-clustering of Tensors
title_full Provable Convex Co-clustering of Tensors
title_fullStr Provable Convex Co-clustering of Tensors
title_full_unstemmed Provable Convex Co-clustering of Tensors
title_short Provable Convex Co-clustering of Tensors
title_sort provable convex co-clustering of tensors
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7731944/
https://www.ncbi.nlm.nih.gov/pubmed/33312074
work_keys_str_mv AT chiericc provableconvexcoclusteringoftensors
AT gainesbrianr provableconvexcoclusteringoftensors
AT sunwillwei provableconvexcoclusteringoftensors
AT zhouhua provableconvexcoclusteringoftensors
AT yangjian provableconvexcoclusteringoftensors