Cargando…
Provable Convex Co-clustering of Tensors
Cluster analysis is a fundamental tool for pattern discovery of complex heterogeneous data. Prevalent clustering methods mainly focus on vector or matrix-variate data and are not applicable to general-order tensors, which arise frequently in modern scientific and business applications. Moreover, the...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7731944/ https://www.ncbi.nlm.nih.gov/pubmed/33312074 |
_version_ | 1783621997118357504 |
---|---|
author | Chi, Eric C. Gaines, Brian R. Sun, Will Wei Zhou, Hua Yang, Jian |
author_facet | Chi, Eric C. Gaines, Brian R. Sun, Will Wei Zhou, Hua Yang, Jian |
author_sort | Chi, Eric C. |
collection | PubMed |
description | Cluster analysis is a fundamental tool for pattern discovery of complex heterogeneous data. Prevalent clustering methods mainly focus on vector or matrix-variate data and are not applicable to general-order tensors, which arise frequently in modern scientific and business applications. Moreover, there is a gap between statistical guarantees and computational efficiency for existing tensor clustering solutions due to the nature of their non-convex formulations. In this work, we bridge this gap by developing a provable convex formulation of tensor co-clustering. Our convex co-clustering (CoCo) estimator enjoys stability guarantees and its computational and storage costs are polynomial in the size of the data. We further establish a non-asymptotic error bound for the CoCo estimator, which reveals a surprising “blessing of dimensionality” phenomenon that does not exist in vector or matrix-variate cluster analysis. Our theoretical findings are supported by extensive simulated studies. Finally, we apply the CoCo estimator to the cluster analysis of advertisement click tensor data from a major online company. Our clustering results provide meaningful business insights to improve advertising effectiveness. |
format | Online Article Text |
id | pubmed-7731944 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-77319442020-12-11 Provable Convex Co-clustering of Tensors Chi, Eric C. Gaines, Brian R. Sun, Will Wei Zhou, Hua Yang, Jian J Mach Learn Res Article Cluster analysis is a fundamental tool for pattern discovery of complex heterogeneous data. Prevalent clustering methods mainly focus on vector or matrix-variate data and are not applicable to general-order tensors, which arise frequently in modern scientific and business applications. Moreover, there is a gap between statistical guarantees and computational efficiency for existing tensor clustering solutions due to the nature of their non-convex formulations. In this work, we bridge this gap by developing a provable convex formulation of tensor co-clustering. Our convex co-clustering (CoCo) estimator enjoys stability guarantees and its computational and storage costs are polynomial in the size of the data. We further establish a non-asymptotic error bound for the CoCo estimator, which reveals a surprising “blessing of dimensionality” phenomenon that does not exist in vector or matrix-variate cluster analysis. Our theoretical findings are supported by extensive simulated studies. Finally, we apply the CoCo estimator to the cluster analysis of advertisement click tensor data from a major online company. Our clustering results provide meaningful business insights to improve advertising effectiveness. 2020 /pmc/articles/PMC7731944/ /pubmed/33312074 Text en https://creativecommons.org/licenses/by/4.0/License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. (https://creativecommons.org/licenses/by/4.0/) Attribution requirements are provided at http://jmlr.org/papers/v21/18-155.html. (http://jmlr.org/papers/v21/18-155.html) |
spellingShingle | Article Chi, Eric C. Gaines, Brian R. Sun, Will Wei Zhou, Hua Yang, Jian Provable Convex Co-clustering of Tensors |
title | Provable Convex Co-clustering of Tensors |
title_full | Provable Convex Co-clustering of Tensors |
title_fullStr | Provable Convex Co-clustering of Tensors |
title_full_unstemmed | Provable Convex Co-clustering of Tensors |
title_short | Provable Convex Co-clustering of Tensors |
title_sort | provable convex co-clustering of tensors |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7731944/ https://www.ncbi.nlm.nih.gov/pubmed/33312074 |
work_keys_str_mv | AT chiericc provableconvexcoclusteringoftensors AT gainesbrianr provableconvexcoclusteringoftensors AT sunwillwei provableconvexcoclusteringoftensors AT zhouhua provableconvexcoclusteringoftensors AT yangjian provableconvexcoclusteringoftensors |