Cargando…

To select relevant features for longitudinal gene expression data by extending a pathway analysis method

The emerging field of pathway-based feature selection that incorporates biological information conveyed by gene sets/pathways to guide the selection of relevant genes has become increasingly popular and widespread. In this study, we adapt a gene set analysis method – the significance analysis of mic...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Suyan, Wang, Chi, Chang, Howard H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6124382/
https://www.ncbi.nlm.nih.gov/pubmed/30271585
http://dx.doi.org/10.12688/f1000research.15357.1
_version_ 1783353024577536000
author Tian, Suyan
Wang, Chi
Chang, Howard H.
author_facet Tian, Suyan
Wang, Chi
Chang, Howard H.
author_sort Tian, Suyan
collection PubMed
description The emerging field of pathway-based feature selection that incorporates biological information conveyed by gene sets/pathways to guide the selection of relevant genes has become increasingly popular and widespread. In this study, we adapt a gene set analysis method – the significance analysis of microarray gene set reduction (SAMGSR) algorithm to carry out feature selection for longitudinal microarray data, and propose a pathway-based feature selection algorithm – the two-level SAMGSR method. By using simulated data and a real-world application, we demonstrate that a gene’s expression profiles over time can be considered as a gene set. Thus a suitable gene set analysis method can be utilized or modified to execute the selection of relevant genes for longitudinal omics data. We believe this work paves the way for more research to bridge feature selection and gene set analysis with the development of novel pathway-based feature selection algorithms.
format Online
Article
Text
id pubmed-6124382
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-61243822018-09-28 To select relevant features for longitudinal gene expression data by extending a pathway analysis method Tian, Suyan Wang, Chi Chang, Howard H. F1000Res Method Article The emerging field of pathway-based feature selection that incorporates biological information conveyed by gene sets/pathways to guide the selection of relevant genes has become increasingly popular and widespread. In this study, we adapt a gene set analysis method – the significance analysis of microarray gene set reduction (SAMGSR) algorithm to carry out feature selection for longitudinal microarray data, and propose a pathway-based feature selection algorithm – the two-level SAMGSR method. By using simulated data and a real-world application, we demonstrate that a gene’s expression profiles over time can be considered as a gene set. Thus a suitable gene set analysis method can be utilized or modified to execute the selection of relevant genes for longitudinal omics data. We believe this work paves the way for more research to bridge feature selection and gene set analysis with the development of novel pathway-based feature selection algorithms. F1000 Research Limited 2018-07-31 /pmc/articles/PMC6124382/ /pubmed/30271585 http://dx.doi.org/10.12688/f1000research.15357.1 Text en Copyright: © 2018 Tian S et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
Tian, Suyan
Wang, Chi
Chang, Howard H.
To select relevant features for longitudinal gene expression data by extending a pathway analysis method
title To select relevant features for longitudinal gene expression data by extending a pathway analysis method
title_full To select relevant features for longitudinal gene expression data by extending a pathway analysis method
title_fullStr To select relevant features for longitudinal gene expression data by extending a pathway analysis method
title_full_unstemmed To select relevant features for longitudinal gene expression data by extending a pathway analysis method
title_short To select relevant features for longitudinal gene expression data by extending a pathway analysis method
title_sort to select relevant features for longitudinal gene expression data by extending a pathway analysis method
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6124382/
https://www.ncbi.nlm.nih.gov/pubmed/30271585
http://dx.doi.org/10.12688/f1000research.15357.1
work_keys_str_mv AT tiansuyan toselectrelevantfeaturesforlongitudinalgeneexpressiondatabyextendingapathwayanalysismethod
AT wangchi toselectrelevantfeaturesforlongitudinalgeneexpressiondatabyextendingapathwayanalysismethod
AT changhowardh toselectrelevantfeaturesforlongitudinalgeneexpressiondatabyextendingapathwayanalysismethod