Cargando…
Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization
RNA-sequencing is rapidly becoming the method of choice for studying the full complexity of transcriptomes, however with increasing dimensionality, accurate gene ranking is becoming increasingly challenging. This paper proposes an accurate and sensitive gene ranking method that implements discrimina...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4562600/ https://www.ncbi.nlm.nih.gov/pubmed/26348772 http://dx.doi.org/10.1371/journal.pone.0137782 |
_version_ | 1782389179073691648 |
---|---|
author | Jia, Zhilong Zhang, Xiang Guan, Naiyang Bo, Xiaochen Barnes, Michael R. Luo, Zhigang |
author_facet | Jia, Zhilong Zhang, Xiang Guan, Naiyang Bo, Xiaochen Barnes, Michael R. Luo, Zhigang |
author_sort | Jia, Zhilong |
collection | PubMed |
description | RNA-sequencing is rapidly becoming the method of choice for studying the full complexity of transcriptomes, however with increasing dimensionality, accurate gene ranking is becoming increasingly challenging. This paper proposes an accurate and sensitive gene ranking method that implements discriminant non-negative matrix factorization (DNMF) for RNA-seq data. To the best of our knowledge, this is the first work to explore the utility of DNMF for gene ranking. When incorporating Fisher’s discriminant criteria and setting the reduced dimension as two, DNMF learns two factors to approximate the original gene expression data, abstracting the up-regulated or down-regulated metagene by using the sample label information. The first factor denotes all the genes’ weights of two metagenes as the additive combination of all genes, while the second learned factor represents the expression values of two metagenes. In the gene ranking stage, all the genes are ranked as a descending sequence according to the differential values of the metagene weights. Leveraging the nature of NMF and Fisher’s criterion, DNMF can robustly boost the gene ranking performance. The Area Under the Curve analysis of differential expression analysis on two benchmarking tests of four RNA-seq data sets with similar phenotypes showed that our proposed DNMF-based gene ranking method outperforms other widely used methods. Moreover, the Gene Set Enrichment Analysis also showed DNMF outweighs others. DNMF is also computationally efficient, substantially outperforming all other benchmarked methods. Consequently, we suggest DNMF is an effective method for the analysis of differential gene expression and gene ranking for RNA-seq data. |
format | Online Article Text |
id | pubmed-4562600 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-45626002015-09-10 Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization Jia, Zhilong Zhang, Xiang Guan, Naiyang Bo, Xiaochen Barnes, Michael R. Luo, Zhigang PLoS One Research Article RNA-sequencing is rapidly becoming the method of choice for studying the full complexity of transcriptomes, however with increasing dimensionality, accurate gene ranking is becoming increasingly challenging. This paper proposes an accurate and sensitive gene ranking method that implements discriminant non-negative matrix factorization (DNMF) for RNA-seq data. To the best of our knowledge, this is the first work to explore the utility of DNMF for gene ranking. When incorporating Fisher’s discriminant criteria and setting the reduced dimension as two, DNMF learns two factors to approximate the original gene expression data, abstracting the up-regulated or down-regulated metagene by using the sample label information. The first factor denotes all the genes’ weights of two metagenes as the additive combination of all genes, while the second learned factor represents the expression values of two metagenes. In the gene ranking stage, all the genes are ranked as a descending sequence according to the differential values of the metagene weights. Leveraging the nature of NMF and Fisher’s criterion, DNMF can robustly boost the gene ranking performance. The Area Under the Curve analysis of differential expression analysis on two benchmarking tests of four RNA-seq data sets with similar phenotypes showed that our proposed DNMF-based gene ranking method outperforms other widely used methods. Moreover, the Gene Set Enrichment Analysis also showed DNMF outweighs others. DNMF is also computationally efficient, substantially outperforming all other benchmarked methods. Consequently, we suggest DNMF is an effective method for the analysis of differential gene expression and gene ranking for RNA-seq data. Public Library of Science 2015-09-08 /pmc/articles/PMC4562600/ /pubmed/26348772 http://dx.doi.org/10.1371/journal.pone.0137782 Text en © 2015 Jia et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Jia, Zhilong Zhang, Xiang Guan, Naiyang Bo, Xiaochen Barnes, Michael R. Luo, Zhigang Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization |
title | Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization |
title_full | Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization |
title_fullStr | Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization |
title_full_unstemmed | Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization |
title_short | Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization |
title_sort | gene ranking of rna-seq data via discriminant non-negative matrix factorization |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4562600/ https://www.ncbi.nlm.nih.gov/pubmed/26348772 http://dx.doi.org/10.1371/journal.pone.0137782 |
work_keys_str_mv | AT jiazhilong generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization AT zhangxiang generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization AT guannaiyang generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization AT boxiaochen generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization AT barnesmichaelr generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization AT luozhigang generankingofrnaseqdataviadiscriminantnonnegativematrixfactorization |