Cargando…
A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses
In terms of making genes expression data more interpretable and comprehensible, there exists a significant superiority on sparse methods. Many sparse methods, such as penalized matrix decomposition (PMD) and sparse principal component analysis (SPCA), have been applied to extract plants core genes....
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4152128/ https://www.ncbi.nlm.nih.gov/pubmed/25180509 http://dx.doi.org/10.1371/journal.pone.0106097 |
_version_ | 1782333082940997632 |
---|---|
author | Liu, Jin-Xing Liu, Jian Gao, Ying-Lian Mi, Jian-Xun Ma, Chun-Xia Wang, Dong |
author_facet | Liu, Jin-Xing Liu, Jian Gao, Ying-Lian Mi, Jian-Xun Ma, Chun-Xia Wang, Dong |
author_sort | Liu, Jin-Xing |
collection | PubMed |
description | In terms of making genes expression data more interpretable and comprehensible, there exists a significant superiority on sparse methods. Many sparse methods, such as penalized matrix decomposition (PMD) and sparse principal component analysis (SPCA), have been applied to extract plants core genes. Supervised algorithms, especially the support vector machine-recursive feature elimination (SVM-RFE) method, always have good performance in gene selection. In this paper, we draw into class information via the total scatter matrix and put forward a class-information-based penalized matrix decomposition (CIPMD) method to improve the gene identification performance of PMD-based method. Firstly, the total scatter matrix is obtained based on different samples of the gene expression data. Secondly, a new data matrix is constructed by decomposing the total scatter matrix. Thirdly, the new data matrix is decomposed by PMD to obtain the sparse eigensamples. Finally, the core genes are identified according to the nonzero entries in eigensamples. The results on simulation data show that CIPMD method can reach higher identification accuracies than the conventional gene identification methods. Moreover, the results on real gene expression data demonstrate that CIPMD method can identify more core genes closely related to the abiotic stresses than the other methods. |
format | Online Article Text |
id | pubmed-4152128 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-41521282014-09-05 A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses Liu, Jin-Xing Liu, Jian Gao, Ying-Lian Mi, Jian-Xun Ma, Chun-Xia Wang, Dong PLoS One Research Article In terms of making genes expression data more interpretable and comprehensible, there exists a significant superiority on sparse methods. Many sparse methods, such as penalized matrix decomposition (PMD) and sparse principal component analysis (SPCA), have been applied to extract plants core genes. Supervised algorithms, especially the support vector machine-recursive feature elimination (SVM-RFE) method, always have good performance in gene selection. In this paper, we draw into class information via the total scatter matrix and put forward a class-information-based penalized matrix decomposition (CIPMD) method to improve the gene identification performance of PMD-based method. Firstly, the total scatter matrix is obtained based on different samples of the gene expression data. Secondly, a new data matrix is constructed by decomposing the total scatter matrix. Thirdly, the new data matrix is decomposed by PMD to obtain the sparse eigensamples. Finally, the core genes are identified according to the nonzero entries in eigensamples. The results on simulation data show that CIPMD method can reach higher identification accuracies than the conventional gene identification methods. Moreover, the results on real gene expression data demonstrate that CIPMD method can identify more core genes closely related to the abiotic stresses than the other methods. Public Library of Science 2014-09-02 /pmc/articles/PMC4152128/ /pubmed/25180509 http://dx.doi.org/10.1371/journal.pone.0106097 Text en © 2014 Liu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Liu, Jin-Xing Liu, Jian Gao, Ying-Lian Mi, Jian-Xun Ma, Chun-Xia Wang, Dong A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses |
title | A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses |
title_full | A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses |
title_fullStr | A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses |
title_full_unstemmed | A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses |
title_short | A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses |
title_sort | class-information-based penalized matrix decomposition for identifying plants core genes responding to abiotic stresses |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4152128/ https://www.ncbi.nlm.nih.gov/pubmed/25180509 http://dx.doi.org/10.1371/journal.pone.0106097 |
work_keys_str_mv | AT liujinxing aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses AT liujian aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses AT gaoyinglian aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses AT mijianxun aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses AT machunxia aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses AT wangdong aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses AT liujinxing classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses AT liujian classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses AT gaoyinglian classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses AT mijianxun classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses AT machunxia classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses AT wangdong classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses |