Cargando…
Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis
Non-negative matrix factorization (NMF) is a matrix decomposition method based on the square loss function. To exploit cancer information, cancer gene expression data often uses the NMF method to reduce dimensionality. Gene expression data usually have some noise and outliers, while the original NMF...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882287/ https://www.ncbi.nlm.nih.gov/pubmed/31824556 http://dx.doi.org/10.3389/fgene.2019.01054 |
_version_ | 1783474124548472832 |
---|---|
author | Wang, Chuan-Yuan Liu, Jin-Xing Yu, Na Zheng, Chun-Hou |
author_facet | Wang, Chuan-Yuan Liu, Jin-Xing Yu, Na Zheng, Chun-Hou |
author_sort | Wang, Chuan-Yuan |
collection | PubMed |
description | Non-negative matrix factorization (NMF) is a matrix decomposition method based on the square loss function. To exploit cancer information, cancer gene expression data often uses the NMF method to reduce dimensionality. Gene expression data usually have some noise and outliers, while the original NMF loss function is very sensitive to non-Gaussian noise. To improve the robustness and clustering performance of the algorithm, we propose a sparse graph regularization NMF based on Huber loss model for cancer data analysis (Huber-SGNMF). Huber loss is a function between L (1)-norm and L (2)-norm that can effectively handle non-Gaussian noise and outliers. Taking into account the sparsity matrix and data geometry information, sparse penalty and graph regularization terms are introduced into the model to enhance matrix sparsity and capture data manifold structure. Before the experiment, we first analyzed the robustness of Huber-SGNMF and other models. Experiments on The Cancer Genome Atlas (TCGA) data have shown that Huber-SGNMF performs better than other most advanced methods in sample clustering and differentially expressed gene selection. |
format | Online Article Text |
id | pubmed-6882287 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-68822872019-12-10 Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis Wang, Chuan-Yuan Liu, Jin-Xing Yu, Na Zheng, Chun-Hou Front Genet Genetics Non-negative matrix factorization (NMF) is a matrix decomposition method based on the square loss function. To exploit cancer information, cancer gene expression data often uses the NMF method to reduce dimensionality. Gene expression data usually have some noise and outliers, while the original NMF loss function is very sensitive to non-Gaussian noise. To improve the robustness and clustering performance of the algorithm, we propose a sparse graph regularization NMF based on Huber loss model for cancer data analysis (Huber-SGNMF). Huber loss is a function between L (1)-norm and L (2)-norm that can effectively handle non-Gaussian noise and outliers. Taking into account the sparsity matrix and data geometry information, sparse penalty and graph regularization terms are introduced into the model to enhance matrix sparsity and capture data manifold structure. Before the experiment, we first analyzed the robustness of Huber-SGNMF and other models. Experiments on The Cancer Genome Atlas (TCGA) data have shown that Huber-SGNMF performs better than other most advanced methods in sample clustering and differentially expressed gene selection. Frontiers Media S.A. 2019-11-20 /pmc/articles/PMC6882287/ /pubmed/31824556 http://dx.doi.org/10.3389/fgene.2019.01054 Text en Copyright © 2019 Wang, Liu, Yu and Zheng http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Wang, Chuan-Yuan Liu, Jin-Xing Yu, Na Zheng, Chun-Hou Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis |
title | Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis |
title_full | Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis |
title_fullStr | Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis |
title_full_unstemmed | Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis |
title_short | Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis |
title_sort | sparse graph regularization non-negative matrix factorization based on huber loss model for cancer data analysis |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882287/ https://www.ncbi.nlm.nih.gov/pubmed/31824556 http://dx.doi.org/10.3389/fgene.2019.01054 |
work_keys_str_mv | AT wangchuanyuan sparsegraphregularizationnonnegativematrixfactorizationbasedonhuberlossmodelforcancerdataanalysis AT liujinxing sparsegraphregularizationnonnegativematrixfactorizationbasedonhuberlossmodelforcancerdataanalysis AT yuna sparsegraphregularizationnonnegativematrixfactorizationbasedonhuberlossmodelforcancerdataanalysis AT zhengchunhou sparsegraphregularizationnonnegativematrixfactorizationbasedonhuberlossmodelforcancerdataanalysis |