Cargando…

Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis

Non-negative matrix factorization (NMF) is a matrix decomposition method based on the square loss function. To exploit cancer information, cancer gene expression data often uses the NMF method to reduce dimensionality. Gene expression data usually have some noise and outliers, while the original NMF...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Chuan-Yuan, Liu, Jin-Xing, Yu, Na, Zheng, Chun-Hou
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882287/
https://www.ncbi.nlm.nih.gov/pubmed/31824556
http://dx.doi.org/10.3389/fgene.2019.01054
_version_ 1783474124548472832
author Wang, Chuan-Yuan
Liu, Jin-Xing
Yu, Na
Zheng, Chun-Hou
author_facet Wang, Chuan-Yuan
Liu, Jin-Xing
Yu, Na
Zheng, Chun-Hou
author_sort Wang, Chuan-Yuan
collection PubMed
description Non-negative matrix factorization (NMF) is a matrix decomposition method based on the square loss function. To exploit cancer information, cancer gene expression data often uses the NMF method to reduce dimensionality. Gene expression data usually have some noise and outliers, while the original NMF loss function is very sensitive to non-Gaussian noise. To improve the robustness and clustering performance of the algorithm, we propose a sparse graph regularization NMF based on Huber loss model for cancer data analysis (Huber-SGNMF). Huber loss is a function between L (1)-norm and L (2)-norm that can effectively handle non-Gaussian noise and outliers. Taking into account the sparsity matrix and data geometry information, sparse penalty and graph regularization terms are introduced into the model to enhance matrix sparsity and capture data manifold structure. Before the experiment, we first analyzed the robustness of Huber-SGNMF and other models. Experiments on The Cancer Genome Atlas (TCGA) data have shown that Huber-SGNMF performs better than other most advanced methods in sample clustering and differentially expressed gene selection.
format Online
Article
Text
id pubmed-6882287
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-68822872019-12-10 Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis Wang, Chuan-Yuan Liu, Jin-Xing Yu, Na Zheng, Chun-Hou Front Genet Genetics Non-negative matrix factorization (NMF) is a matrix decomposition method based on the square loss function. To exploit cancer information, cancer gene expression data often uses the NMF method to reduce dimensionality. Gene expression data usually have some noise and outliers, while the original NMF loss function is very sensitive to non-Gaussian noise. To improve the robustness and clustering performance of the algorithm, we propose a sparse graph regularization NMF based on Huber loss model for cancer data analysis (Huber-SGNMF). Huber loss is a function between L (1)-norm and L (2)-norm that can effectively handle non-Gaussian noise and outliers. Taking into account the sparsity matrix and data geometry information, sparse penalty and graph regularization terms are introduced into the model to enhance matrix sparsity and capture data manifold structure. Before the experiment, we first analyzed the robustness of Huber-SGNMF and other models. Experiments on The Cancer Genome Atlas (TCGA) data have shown that Huber-SGNMF performs better than other most advanced methods in sample clustering and differentially expressed gene selection. Frontiers Media S.A. 2019-11-20 /pmc/articles/PMC6882287/ /pubmed/31824556 http://dx.doi.org/10.3389/fgene.2019.01054 Text en Copyright © 2019 Wang, Liu, Yu and Zheng http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Wang, Chuan-Yuan
Liu, Jin-Xing
Yu, Na
Zheng, Chun-Hou
Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis
title Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis
title_full Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis
title_fullStr Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis
title_full_unstemmed Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis
title_short Sparse Graph Regularization Non-Negative Matrix Factorization Based on Huber Loss Model for Cancer Data Analysis
title_sort sparse graph regularization non-negative matrix factorization based on huber loss model for cancer data analysis
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882287/
https://www.ncbi.nlm.nih.gov/pubmed/31824556
http://dx.doi.org/10.3389/fgene.2019.01054
work_keys_str_mv AT wangchuanyuan sparsegraphregularizationnonnegativematrixfactorizationbasedonhuberlossmodelforcancerdataanalysis
AT liujinxing sparsegraphregularizationnonnegativematrixfactorizationbasedonhuberlossmodelforcancerdataanalysis
AT yuna sparsegraphregularizationnonnegativematrixfactorizationbasedonhuberlossmodelforcancerdataanalysis
AT zhengchunhou sparsegraphregularizationnonnegativematrixfactorizationbasedonhuberlossmodelforcancerdataanalysis