Cargando…
Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods
In contrast to feature selection and gene set analysis, bi-level selection is a process of selecting not only important gene sets but also important genes within those gene sets. Depending on the order of selections, a bi-level selection method can be classified into three categories – forward selec...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5384004/ https://www.ncbi.nlm.nih.gov/pubmed/28387364 http://dx.doi.org/10.1038/srep46164 |
_version_ | 1782520387191439360 |
---|---|
author | Tian, Suyan Wang, Chi Chang, Howard H. Sun, Jianguo |
author_facet | Tian, Suyan Wang, Chi Chang, Howard H. Sun, Jianguo |
author_sort | Tian, Suyan |
collection | PubMed |
description | In contrast to feature selection and gene set analysis, bi-level selection is a process of selecting not only important gene sets but also important genes within those gene sets. Depending on the order of selections, a bi-level selection method can be classified into three categories – forward selection, which first selects relevant gene sets followed by the selection of relevant individual genes; backward selection which takes the reversed order; and simultaneous selection, which performs the two tasks simultaneously usually with the aids of a penalized regression model. To test the existence of subtype-specific prognostic genes for non-small cell lung cancer (NSCLC), we had previously proposed the Cox-filter method that examines the association between patients’ survival time after diagnosis with one specific gene, the disease subtypes, and their interaction terms. In this study, we further extend it to carry out forward and backward bi-level selection. Using simulations and a NSCLC application, we demonstrate that the forward selection outperforms the backward selection and other relevant algorithms in our setting. Both proposed methods are readily understandable and interpretable. Therefore, they represent useful tools for the researchers who are interested in exploring the prognostic value of gene expression data for specific subtypes or stages of a disease. |
format | Online Article Text |
id | pubmed-5384004 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-53840042017-04-11 Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods Tian, Suyan Wang, Chi Chang, Howard H. Sun, Jianguo Sci Rep Article In contrast to feature selection and gene set analysis, bi-level selection is a process of selecting not only important gene sets but also important genes within those gene sets. Depending on the order of selections, a bi-level selection method can be classified into three categories – forward selection, which first selects relevant gene sets followed by the selection of relevant individual genes; backward selection which takes the reversed order; and simultaneous selection, which performs the two tasks simultaneously usually with the aids of a penalized regression model. To test the existence of subtype-specific prognostic genes for non-small cell lung cancer (NSCLC), we had previously proposed the Cox-filter method that examines the association between patients’ survival time after diagnosis with one specific gene, the disease subtypes, and their interaction terms. In this study, we further extend it to carry out forward and backward bi-level selection. Using simulations and a NSCLC application, we demonstrate that the forward selection outperforms the backward selection and other relevant algorithms in our setting. Both proposed methods are readily understandable and interpretable. Therefore, they represent useful tools for the researchers who are interested in exploring the prognostic value of gene expression data for specific subtypes or stages of a disease. Nature Publishing Group 2017-04-07 /pmc/articles/PMC5384004/ /pubmed/28387364 http://dx.doi.org/10.1038/srep46164 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Tian, Suyan Wang, Chi Chang, Howard H. Sun, Jianguo Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods |
title | Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods |
title_full | Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods |
title_fullStr | Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods |
title_full_unstemmed | Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods |
title_short | Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods |
title_sort | identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5384004/ https://www.ncbi.nlm.nih.gov/pubmed/28387364 http://dx.doi.org/10.1038/srep46164 |
work_keys_str_mv | AT tiansuyan identificationofprognosticgenesandgenesetsforearlystagenonsmallcelllungcancerusingbilevelselectionmethods AT wangchi identificationofprognosticgenesandgenesetsforearlystagenonsmallcelllungcancerusingbilevelselectionmethods AT changhowardh identificationofprognosticgenesandgenesetsforearlystagenonsmallcelllungcancerusingbilevelselectionmethods AT sunjianguo identificationofprognosticgenesandgenesetsforearlystagenonsmallcelllungcancerusingbilevelselectionmethods |