Cargando…

A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses

In terms of making genes expression data more interpretable and comprehensible, there exists a significant superiority on sparse methods. Many sparse methods, such as penalized matrix decomposition (PMD) and sparse principal component analysis (SPCA), have been applied to extract plants core genes....

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Jin-Xing, Liu, Jian, Gao, Ying-Lian, Mi, Jian-Xun, Ma, Chun-Xia, Wang, Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4152128/
https://www.ncbi.nlm.nih.gov/pubmed/25180509
http://dx.doi.org/10.1371/journal.pone.0106097
_version_ 1782333082940997632
author Liu, Jin-Xing
Liu, Jian
Gao, Ying-Lian
Mi, Jian-Xun
Ma, Chun-Xia
Wang, Dong
author_facet Liu, Jin-Xing
Liu, Jian
Gao, Ying-Lian
Mi, Jian-Xun
Ma, Chun-Xia
Wang, Dong
author_sort Liu, Jin-Xing
collection PubMed
description In terms of making genes expression data more interpretable and comprehensible, there exists a significant superiority on sparse methods. Many sparse methods, such as penalized matrix decomposition (PMD) and sparse principal component analysis (SPCA), have been applied to extract plants core genes. Supervised algorithms, especially the support vector machine-recursive feature elimination (SVM-RFE) method, always have good performance in gene selection. In this paper, we draw into class information via the total scatter matrix and put forward a class-information-based penalized matrix decomposition (CIPMD) method to improve the gene identification performance of PMD-based method. Firstly, the total scatter matrix is obtained based on different samples of the gene expression data. Secondly, a new data matrix is constructed by decomposing the total scatter matrix. Thirdly, the new data matrix is decomposed by PMD to obtain the sparse eigensamples. Finally, the core genes are identified according to the nonzero entries in eigensamples. The results on simulation data show that CIPMD method can reach higher identification accuracies than the conventional gene identification methods. Moreover, the results on real gene expression data demonstrate that CIPMD method can identify more core genes closely related to the abiotic stresses than the other methods.
format Online
Article
Text
id pubmed-4152128
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41521282014-09-05 A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses Liu, Jin-Xing Liu, Jian Gao, Ying-Lian Mi, Jian-Xun Ma, Chun-Xia Wang, Dong PLoS One Research Article In terms of making genes expression data more interpretable and comprehensible, there exists a significant superiority on sparse methods. Many sparse methods, such as penalized matrix decomposition (PMD) and sparse principal component analysis (SPCA), have been applied to extract plants core genes. Supervised algorithms, especially the support vector machine-recursive feature elimination (SVM-RFE) method, always have good performance in gene selection. In this paper, we draw into class information via the total scatter matrix and put forward a class-information-based penalized matrix decomposition (CIPMD) method to improve the gene identification performance of PMD-based method. Firstly, the total scatter matrix is obtained based on different samples of the gene expression data. Secondly, a new data matrix is constructed by decomposing the total scatter matrix. Thirdly, the new data matrix is decomposed by PMD to obtain the sparse eigensamples. Finally, the core genes are identified according to the nonzero entries in eigensamples. The results on simulation data show that CIPMD method can reach higher identification accuracies than the conventional gene identification methods. Moreover, the results on real gene expression data demonstrate that CIPMD method can identify more core genes closely related to the abiotic stresses than the other methods. Public Library of Science 2014-09-02 /pmc/articles/PMC4152128/ /pubmed/25180509 http://dx.doi.org/10.1371/journal.pone.0106097 Text en © 2014 Liu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Liu, Jin-Xing
Liu, Jian
Gao, Ying-Lian
Mi, Jian-Xun
Ma, Chun-Xia
Wang, Dong
A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses
title A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses
title_full A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses
title_fullStr A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses
title_full_unstemmed A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses
title_short A Class-Information-Based Penalized Matrix Decomposition for Identifying Plants Core Genes Responding to Abiotic Stresses
title_sort class-information-based penalized matrix decomposition for identifying plants core genes responding to abiotic stresses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4152128/
https://www.ncbi.nlm.nih.gov/pubmed/25180509
http://dx.doi.org/10.1371/journal.pone.0106097
work_keys_str_mv AT liujinxing aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses
AT liujian aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses
AT gaoyinglian aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses
AT mijianxun aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses
AT machunxia aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses
AT wangdong aclassinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses
AT liujinxing classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses
AT liujian classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses
AT gaoyinglian classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses
AT mijianxun classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses
AT machunxia classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses
AT wangdong classinformationbasedpenalizedmatrixdecompositionforidentifyingplantscoregenesrespondingtoabioticstresses