Cargando…

A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time

BACKGROUND: Feature selection and gene set analysis are of increasing interest in the field of bioinformatics. While these two approaches have been developed for different purposes, we describe how some gene set analysis methods can be utilized to conduct feature selection. METHODS: We adopted a gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Suyan, Wang, Chi, Chang, Howard H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6284265/
https://www.ncbi.nlm.nih.gov/pubmed/30526581
http://dx.doi.org/10.1186/s12911-018-0685-8
_version_ 1783379303519485952
author Tian, Suyan
Wang, Chi
Chang, Howard H.
author_facet Tian, Suyan
Wang, Chi
Chang, Howard H.
author_sort Tian, Suyan
collection PubMed
description BACKGROUND: Feature selection and gene set analysis are of increasing interest in the field of bioinformatics. While these two approaches have been developed for different purposes, we describe how some gene set analysis methods can be utilized to conduct feature selection. METHODS: We adopted a gene set analysis method, the significance analysis of microarray gene set reduction (SAMGSR) algorithm, to carry out feature selection for longitudinal gene expression data. RESULTS: Using a real-world application and simulated data, it is demonstrated that the proposed SAMGSR extension outperforms other relevant methods. In this study, we illustrate that a gene’s expression profiles over time can be regarded as a gene set and then a suitable gene set analysis method can be utilized directly to select relevant genes associated with the phenotype of interest over time. CONCLUSIONS: We believe this work will motivate more research to bridge feature selection and gene set analysis, with the development of novel algorithms capable of carrying out feature selection for longitudinal gene expression data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12911-018-0685-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6284265
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62842652018-12-14 A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time Tian, Suyan Wang, Chi Chang, Howard H. BMC Med Inform Decis Mak Research BACKGROUND: Feature selection and gene set analysis are of increasing interest in the field of bioinformatics. While these two approaches have been developed for different purposes, we describe how some gene set analysis methods can be utilized to conduct feature selection. METHODS: We adopted a gene set analysis method, the significance analysis of microarray gene set reduction (SAMGSR) algorithm, to carry out feature selection for longitudinal gene expression data. RESULTS: Using a real-world application and simulated data, it is demonstrated that the proposed SAMGSR extension outperforms other relevant methods. In this study, we illustrate that a gene’s expression profiles over time can be regarded as a gene set and then a suitable gene set analysis method can be utilized directly to select relevant genes associated with the phenotype of interest over time. CONCLUSIONS: We believe this work will motivate more research to bridge feature selection and gene set analysis, with the development of novel algorithms capable of carrying out feature selection for longitudinal gene expression data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12911-018-0685-8) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-07 /pmc/articles/PMC6284265/ /pubmed/30526581 http://dx.doi.org/10.1186/s12911-018-0685-8 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Tian, Suyan
Wang, Chi
Chang, Howard H.
A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time
title A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time
title_full A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time
title_fullStr A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time
title_full_unstemmed A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time
title_short A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time
title_sort longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6284265/
https://www.ncbi.nlm.nih.gov/pubmed/30526581
http://dx.doi.org/10.1186/s12911-018-0685-8
work_keys_str_mv AT tiansuyan alongitudinalfeatureselectionmethodidentifiesrelevantgenestodistinguishcomplicatedinjuryanduncomplicatedinjuryovertime
AT wangchi alongitudinalfeatureselectionmethodidentifiesrelevantgenestodistinguishcomplicatedinjuryanduncomplicatedinjuryovertime
AT changhowardh alongitudinalfeatureselectionmethodidentifiesrelevantgenestodistinguishcomplicatedinjuryanduncomplicatedinjuryovertime
AT tiansuyan longitudinalfeatureselectionmethodidentifiesrelevantgenestodistinguishcomplicatedinjuryanduncomplicatedinjuryovertime
AT wangchi longitudinalfeatureselectionmethodidentifiesrelevantgenestodistinguishcomplicatedinjuryanduncomplicatedinjuryovertime
AT changhowardh longitudinalfeatureselectionmethodidentifiesrelevantgenestodistinguishcomplicatedinjuryanduncomplicatedinjuryovertime