Cargando…

Non-negative matrix factorization by maximizing correntropy for cancer clustering

BACKGROUND: Non-negative matrix factorization (NMF) has been shown to be a powerful tool for clustering gene expression data, which are widely used to classify cancers. NMF aims to find two non-negative matrices whose product closely approximates the original matrix. Traditional NMF methods minimize...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jim Jing-Yan, Wang, Xiaolei, Gao, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3659102/
https://www.ncbi.nlm.nih.gov/pubmed/23522344
http://dx.doi.org/10.1186/1471-2105-14-107
_version_ 1782270400851345408
author Wang, Jim Jing-Yan
Wang, Xiaolei
Gao, Xin
author_facet Wang, Jim Jing-Yan
Wang, Xiaolei
Gao, Xin
author_sort Wang, Jim Jing-Yan
collection PubMed
description BACKGROUND: Non-negative matrix factorization (NMF) has been shown to be a powerful tool for clustering gene expression data, which are widely used to classify cancers. NMF aims to find two non-negative matrices whose product closely approximates the original matrix. Traditional NMF methods minimize either the l(2) norm or the Kullback-Leibler distance between the product of the two matrices and the original matrix. Correntropy was recently shown to be an effective similarity measurement due to its stability to outliers or noise. RESULTS: We propose a maximum correntropy criterion (MCC)-based NMF method (NMF-MCC) for gene expression data-based cancer clustering. Instead of minimizing the l(2) norm or the Kullback-Leibler distance, NMF-MCC maximizes the correntropy between the product of the two matrices and the original matrix. The optimization problem can be solved by an expectation conditional maximization algorithm. CONCLUSIONS: Extensive experiments on six cancer benchmark sets demonstrate that the proposed method is significantly more accurate than the state-of-the-art methods in cancer clustering.
format Online
Article
Text
id pubmed-3659102
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36591022013-05-23 Non-negative matrix factorization by maximizing correntropy for cancer clustering Wang, Jim Jing-Yan Wang, Xiaolei Gao, Xin BMC Bioinformatics Methodology Article BACKGROUND: Non-negative matrix factorization (NMF) has been shown to be a powerful tool for clustering gene expression data, which are widely used to classify cancers. NMF aims to find two non-negative matrices whose product closely approximates the original matrix. Traditional NMF methods minimize either the l(2) norm or the Kullback-Leibler distance between the product of the two matrices and the original matrix. Correntropy was recently shown to be an effective similarity measurement due to its stability to outliers or noise. RESULTS: We propose a maximum correntropy criterion (MCC)-based NMF method (NMF-MCC) for gene expression data-based cancer clustering. Instead of minimizing the l(2) norm or the Kullback-Leibler distance, NMF-MCC maximizes the correntropy between the product of the two matrices and the original matrix. The optimization problem can be solved by an expectation conditional maximization algorithm. CONCLUSIONS: Extensive experiments on six cancer benchmark sets demonstrate that the proposed method is significantly more accurate than the state-of-the-art methods in cancer clustering. BioMed Central 2013-03-24 /pmc/articles/PMC3659102/ /pubmed/23522344 http://dx.doi.org/10.1186/1471-2105-14-107 Text en Copyright © 2013 Wang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Wang, Jim Jing-Yan
Wang, Xiaolei
Gao, Xin
Non-negative matrix factorization by maximizing correntropy for cancer clustering
title Non-negative matrix factorization by maximizing correntropy for cancer clustering
title_full Non-negative matrix factorization by maximizing correntropy for cancer clustering
title_fullStr Non-negative matrix factorization by maximizing correntropy for cancer clustering
title_full_unstemmed Non-negative matrix factorization by maximizing correntropy for cancer clustering
title_short Non-negative matrix factorization by maximizing correntropy for cancer clustering
title_sort non-negative matrix factorization by maximizing correntropy for cancer clustering
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3659102/
https://www.ncbi.nlm.nih.gov/pubmed/23522344
http://dx.doi.org/10.1186/1471-2105-14-107
work_keys_str_mv AT wangjimjingyan nonnegativematrixfactorizationbymaximizingcorrentropyforcancerclustering
AT wangxiaolei nonnegativematrixfactorizationbymaximizingcorrentropyforcancerclustering
AT gaoxin nonnegativematrixfactorizationbymaximizingcorrentropyforcancerclustering