Cargando…
Model Selection for Non-Negative Tensor Factorization with Minimum Description Length
Non-negative tensor factorization (NTF) is a widely used multi-way analysis approach that factorizes a high-order non-negative data tensor into several non-negative factor matrices. In NTF, the non-negative rank has to be predetermined to specify the model and it greatly influences the factorized ma...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515125/ https://www.ncbi.nlm.nih.gov/pubmed/33267345 http://dx.doi.org/10.3390/e21070632 |
_version_ | 1783586746919813120 |
---|---|
author | Fu, Yunhui Matsushima, Shin Yamanishi, Kenji |
author_facet | Fu, Yunhui Matsushima, Shin Yamanishi, Kenji |
author_sort | Fu, Yunhui |
collection | PubMed |
description | Non-negative tensor factorization (NTF) is a widely used multi-way analysis approach that factorizes a high-order non-negative data tensor into several non-negative factor matrices. In NTF, the non-negative rank has to be predetermined to specify the model and it greatly influences the factorized matrices. However, its value is conventionally determined by specialists’ insights or trial and error. This paper proposes a novel rank selection criterion for NTF on the basis of the minimum description length (MDL) principle. Our methodology is unique in that (1) we apply the MDL principle on tensor slices to overcome a problem caused by the imbalance between the number of elements in a data tensor and that in factor matrices, and (2) we employ the normalized maximum likelihood (NML) code-length for histogram densities. We employ synthetic and real data to empirically demonstrate that our method outperforms other criteria in terms of accuracies for estimating true ranks and for completing missing values. We further show that our method can produce ranks suitable for knowledge discovery. |
format | Online Article Text |
id | pubmed-7515125 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75151252020-11-09 Model Selection for Non-Negative Tensor Factorization with Minimum Description Length Fu, Yunhui Matsushima, Shin Yamanishi, Kenji Entropy (Basel) Article Non-negative tensor factorization (NTF) is a widely used multi-way analysis approach that factorizes a high-order non-negative data tensor into several non-negative factor matrices. In NTF, the non-negative rank has to be predetermined to specify the model and it greatly influences the factorized matrices. However, its value is conventionally determined by specialists’ insights or trial and error. This paper proposes a novel rank selection criterion for NTF on the basis of the minimum description length (MDL) principle. Our methodology is unique in that (1) we apply the MDL principle on tensor slices to overcome a problem caused by the imbalance between the number of elements in a data tensor and that in factor matrices, and (2) we employ the normalized maximum likelihood (NML) code-length for histogram densities. We employ synthetic and real data to empirically demonstrate that our method outperforms other criteria in terms of accuracies for estimating true ranks and for completing missing values. We further show that our method can produce ranks suitable for knowledge discovery. MDPI 2019-06-27 /pmc/articles/PMC7515125/ /pubmed/33267345 http://dx.doi.org/10.3390/e21070632 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Fu, Yunhui Matsushima, Shin Yamanishi, Kenji Model Selection for Non-Negative Tensor Factorization with Minimum Description Length |
title | Model Selection for Non-Negative Tensor Factorization with Minimum Description Length |
title_full | Model Selection for Non-Negative Tensor Factorization with Minimum Description Length |
title_fullStr | Model Selection for Non-Negative Tensor Factorization with Minimum Description Length |
title_full_unstemmed | Model Selection for Non-Negative Tensor Factorization with Minimum Description Length |
title_short | Model Selection for Non-Negative Tensor Factorization with Minimum Description Length |
title_sort | model selection for non-negative tensor factorization with minimum description length |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515125/ https://www.ncbi.nlm.nih.gov/pubmed/33267345 http://dx.doi.org/10.3390/e21070632 |
work_keys_str_mv | AT fuyunhui modelselectionfornonnegativetensorfactorizationwithminimumdescriptionlength AT matsushimashin modelselectionfornonnegativetensorfactorizationwithminimumdescriptionlength AT yamanishikenji modelselectionfornonnegativetensorfactorizationwithminimumdescriptionlength |