Cargando…

Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods

In contrast to feature selection and gene set analysis, bi-level selection is a process of selecting not only important gene sets but also important genes within those gene sets. Depending on the order of selections, a bi-level selection method can be classified into three categories – forward selec...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Suyan, Wang, Chi, Chang, Howard H., Sun, Jianguo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5384004/
https://www.ncbi.nlm.nih.gov/pubmed/28387364
http://dx.doi.org/10.1038/srep46164
_version_ 1782520387191439360
author Tian, Suyan
Wang, Chi
Chang, Howard H.
Sun, Jianguo
author_facet Tian, Suyan
Wang, Chi
Chang, Howard H.
Sun, Jianguo
author_sort Tian, Suyan
collection PubMed
description In contrast to feature selection and gene set analysis, bi-level selection is a process of selecting not only important gene sets but also important genes within those gene sets. Depending on the order of selections, a bi-level selection method can be classified into three categories – forward selection, which first selects relevant gene sets followed by the selection of relevant individual genes; backward selection which takes the reversed order; and simultaneous selection, which performs the two tasks simultaneously usually with the aids of a penalized regression model. To test the existence of subtype-specific prognostic genes for non-small cell lung cancer (NSCLC), we had previously proposed the Cox-filter method that examines the association between patients’ survival time after diagnosis with one specific gene, the disease subtypes, and their interaction terms. In this study, we further extend it to carry out forward and backward bi-level selection. Using simulations and a NSCLC application, we demonstrate that the forward selection outperforms the backward selection and other relevant algorithms in our setting. Both proposed methods are readily understandable and interpretable. Therefore, they represent useful tools for the researchers who are interested in exploring the prognostic value of gene expression data for specific subtypes or stages of a disease.
format Online
Article
Text
id pubmed-5384004
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53840042017-04-11 Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods Tian, Suyan Wang, Chi Chang, Howard H. Sun, Jianguo Sci Rep Article In contrast to feature selection and gene set analysis, bi-level selection is a process of selecting not only important gene sets but also important genes within those gene sets. Depending on the order of selections, a bi-level selection method can be classified into three categories – forward selection, which first selects relevant gene sets followed by the selection of relevant individual genes; backward selection which takes the reversed order; and simultaneous selection, which performs the two tasks simultaneously usually with the aids of a penalized regression model. To test the existence of subtype-specific prognostic genes for non-small cell lung cancer (NSCLC), we had previously proposed the Cox-filter method that examines the association between patients’ survival time after diagnosis with one specific gene, the disease subtypes, and their interaction terms. In this study, we further extend it to carry out forward and backward bi-level selection. Using simulations and a NSCLC application, we demonstrate that the forward selection outperforms the backward selection and other relevant algorithms in our setting. Both proposed methods are readily understandable and interpretable. Therefore, they represent useful tools for the researchers who are interested in exploring the prognostic value of gene expression data for specific subtypes or stages of a disease. Nature Publishing Group 2017-04-07 /pmc/articles/PMC5384004/ /pubmed/28387364 http://dx.doi.org/10.1038/srep46164 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Tian, Suyan
Wang, Chi
Chang, Howard H.
Sun, Jianguo
Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods
title Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods
title_full Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods
title_fullStr Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods
title_full_unstemmed Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods
title_short Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods
title_sort identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5384004/
https://www.ncbi.nlm.nih.gov/pubmed/28387364
http://dx.doi.org/10.1038/srep46164
work_keys_str_mv AT tiansuyan identificationofprognosticgenesandgenesetsforearlystagenonsmallcelllungcancerusingbilevelselectionmethods
AT wangchi identificationofprognosticgenesandgenesetsforearlystagenonsmallcelllungcancerusingbilevelselectionmethods
AT changhowardh identificationofprognosticgenesandgenesetsforearlystagenonsmallcelllungcancerusingbilevelselectionmethods
AT sunjianguo identificationofprognosticgenesandgenesetsforearlystagenonsmallcelllungcancerusingbilevelselectionmethods