Cargando…

Model Selection for Non-Negative Tensor Factorization with Minimum Description Length

Non-negative tensor factorization (NTF) is a widely used multi-way analysis approach that factorizes a high-order non-negative data tensor into several non-negative factor matrices. In NTF, the non-negative rank has to be predetermined to specify the model and it greatly influences the factorized ma...

Descripción completa

Detalles Bibliográficos
Autores principales: Fu, Yunhui, Matsushima, Shin, Yamanishi, Kenji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515125/
https://www.ncbi.nlm.nih.gov/pubmed/33267345
http://dx.doi.org/10.3390/e21070632
_version_ 1783586746919813120
author Fu, Yunhui
Matsushima, Shin
Yamanishi, Kenji
author_facet Fu, Yunhui
Matsushima, Shin
Yamanishi, Kenji
author_sort Fu, Yunhui
collection PubMed
description Non-negative tensor factorization (NTF) is a widely used multi-way analysis approach that factorizes a high-order non-negative data tensor into several non-negative factor matrices. In NTF, the non-negative rank has to be predetermined to specify the model and it greatly influences the factorized matrices. However, its value is conventionally determined by specialists’ insights or trial and error. This paper proposes a novel rank selection criterion for NTF on the basis of the minimum description length (MDL) principle. Our methodology is unique in that (1) we apply the MDL principle on tensor slices to overcome a problem caused by the imbalance between the number of elements in a data tensor and that in factor matrices, and (2) we employ the normalized maximum likelihood (NML) code-length for histogram densities. We employ synthetic and real data to empirically demonstrate that our method outperforms other criteria in terms of accuracies for estimating true ranks and for completing missing values. We further show that our method can produce ranks suitable for knowledge discovery.
format Online
Article
Text
id pubmed-7515125
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75151252020-11-09 Model Selection for Non-Negative Tensor Factorization with Minimum Description Length Fu, Yunhui Matsushima, Shin Yamanishi, Kenji Entropy (Basel) Article Non-negative tensor factorization (NTF) is a widely used multi-way analysis approach that factorizes a high-order non-negative data tensor into several non-negative factor matrices. In NTF, the non-negative rank has to be predetermined to specify the model and it greatly influences the factorized matrices. However, its value is conventionally determined by specialists’ insights or trial and error. This paper proposes a novel rank selection criterion for NTF on the basis of the minimum description length (MDL) principle. Our methodology is unique in that (1) we apply the MDL principle on tensor slices to overcome a problem caused by the imbalance between the number of elements in a data tensor and that in factor matrices, and (2) we employ the normalized maximum likelihood (NML) code-length for histogram densities. We employ synthetic and real data to empirically demonstrate that our method outperforms other criteria in terms of accuracies for estimating true ranks and for completing missing values. We further show that our method can produce ranks suitable for knowledge discovery. MDPI 2019-06-27 /pmc/articles/PMC7515125/ /pubmed/33267345 http://dx.doi.org/10.3390/e21070632 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Fu, Yunhui
Matsushima, Shin
Yamanishi, Kenji
Model Selection for Non-Negative Tensor Factorization with Minimum Description Length
title Model Selection for Non-Negative Tensor Factorization with Minimum Description Length
title_full Model Selection for Non-Negative Tensor Factorization with Minimum Description Length
title_fullStr Model Selection for Non-Negative Tensor Factorization with Minimum Description Length
title_full_unstemmed Model Selection for Non-Negative Tensor Factorization with Minimum Description Length
title_short Model Selection for Non-Negative Tensor Factorization with Minimum Description Length
title_sort model selection for non-negative tensor factorization with minimum description length
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515125/
https://www.ncbi.nlm.nih.gov/pubmed/33267345
http://dx.doi.org/10.3390/e21070632
work_keys_str_mv AT fuyunhui modelselectionfornonnegativetensorfactorizationwithminimumdescriptionlength
AT matsushimashin modelselectionfornonnegativetensorfactorizationwithminimumdescriptionlength
AT yamanishikenji modelselectionfornonnegativetensorfactorizationwithminimumdescriptionlength