Cargando…

Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization

RNA-sequencing is rapidly becoming the method of choice for studying the full complexity of transcriptomes, however with increasing dimensionality, accurate gene ranking is becoming increasingly challenging. This paper proposes an accurate and sensitive gene ranking method that implements discrimina...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Zhilong, Zhang, Xiang, Guan, Naiyang, Bo, Xiaochen, Barnes, Michael R., Luo, Zhigang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4562600/
https://www.ncbi.nlm.nih.gov/pubmed/26348772
http://dx.doi.org/10.1371/journal.pone.0137782
_version_ 1782389179073691648
author Jia, Zhilong
Zhang, Xiang
Guan, Naiyang
Bo, Xiaochen
Barnes, Michael R.
Luo, Zhigang
author_facet Jia, Zhilong
Zhang, Xiang
Guan, Naiyang
Bo, Xiaochen
Barnes, Michael R.
Luo, Zhigang
author_sort Jia, Zhilong
collection PubMed
description RNA-sequencing is rapidly becoming the method of choice for studying the full complexity of transcriptomes, however with increasing dimensionality, accurate gene ranking is becoming increasingly challenging. This paper proposes an accurate and sensitive gene ranking method that implements discriminant non-negative matrix factorization (DNMF) for RNA-seq data. To the best of our knowledge, this is the first work to explore the utility of DNMF for gene ranking. When incorporating Fisher’s discriminant criteria and setting the reduced dimension as two, DNMF learns two factors to approximate the original gene expression data, abstracting the up-regulated or down-regulated metagene by using the sample label information. The first factor denotes all the genes’ weights of two metagenes as the additive combination of all genes, while the second learned factor represents the expression values of two metagenes. In the gene ranking stage, all the genes are ranked as a descending sequence according to the differential values of the metagene weights. Leveraging the nature of NMF and Fisher’s criterion, DNMF can robustly boost the gene ranking performance. The Area Under the Curve analysis of differential expression analysis on two benchmarking tests of four RNA-seq data sets with similar phenotypes showed that our proposed DNMF-based gene ranking method outperforms other widely used methods. Moreover, the Gene Set Enrichment Analysis also showed DNMF outweighs others. DNMF is also computationally efficient, substantially outperforming all other benchmarked methods. Consequently, we suggest DNMF is an effective method for the analysis of differential gene expression and gene ranking for RNA-seq data.
format Online
Article
Text
id pubmed-4562600
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-45626002015-09-10 Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization Jia, Zhilong Zhang, Xiang Guan, Naiyang Bo, Xiaochen Barnes, Michael R. Luo, Zhigang PLoS One Research Article RNA-sequencing is rapidly becoming the method of choice for studying the full complexity of transcriptomes, however with increasing dimensionality, accurate gene ranking is becoming increasingly challenging. This paper proposes an accurate and sensitive gene ranking method that implements discriminant non-negative matrix factorization (DNMF) for RNA-seq data. To the best of our knowledge, this is the first work to explore the utility of DNMF for gene ranking. When incorporating Fisher’s discriminant criteria and setting the reduced dimension as two, DNMF learns two factors to approximate the original gene expression data, abstracting the up-regulated or down-regulated metagene by using the sample label information. The first factor denotes all the genes’ weights of two metagenes as the additive combination of all genes, while the second learned factor represents the expression values of two metagenes. In the gene ranking stage, all the genes are ranked as a descending sequence according to the differential values of the metagene weights. Leveraging the nature of NMF and Fisher’s criterion, DNMF can robustly boost the gene ranking performance. The Area Under the Curve analysis of differential expression analysis on two benchmarking tests of four RNA-seq data sets with similar phenotypes showed that our proposed DNMF-based gene ranking method outperforms other widely used methods. Moreover, the Gene Set Enrichment Analysis also showed DNMF outweighs others. DNMF is also computationally efficient, substantially outperforming all other benchmarked methods. Consequently, we suggest DNMF is an effective method for the analysis of differential gene expression and gene ranking for RNA-seq data. Public Library of Science 2015-09-08 /pmc/articles/PMC4562600/ /pubmed/26348772 http://dx.doi.org/10.1371/journal.pone.0137782 Text en © 2015 Jia et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Jia, Zhilong
Zhang, Xiang
Guan, Naiyang
Bo, Xiaochen
Barnes, Michael R.
Luo, Zhigang
Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization
title Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization
title_full Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization
title_fullStr Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization
title_full_unstemmed Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization
title_short Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization
title_sort gene ranking of rna-seq data via discriminant non-negative matrix factorization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4562600/
https://www.ncbi.nlm.nih.gov/pubmed/26348772
http://dx.doi.org/10.1371/journal.pone.0137782
work_keys_str_mv AT jiazhilong generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization
AT zhangxiang generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization
AT guannaiyang generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization
AT boxiaochen generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization
AT barnesmichaelr generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization
AT luozhigang generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization