Cargando…
To select relevant features for longitudinal gene expression data by extending a pathway analysis method
The emerging field of pathway-based feature selection that incorporates biological information conveyed by gene sets/pathways to guide the selection of relevant genes has become increasingly popular and widespread. In this study, we adapt a gene set analysis method – the significance analysis of mic...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000 Research Limited
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6124382/ https://www.ncbi.nlm.nih.gov/pubmed/30271585 http://dx.doi.org/10.12688/f1000research.15357.1 |
_version_ | 1783353024577536000 |
---|---|
author | Tian, Suyan Wang, Chi Chang, Howard H. |
author_facet | Tian, Suyan Wang, Chi Chang, Howard H. |
author_sort | Tian, Suyan |
collection | PubMed |
description | The emerging field of pathway-based feature selection that incorporates biological information conveyed by gene sets/pathways to guide the selection of relevant genes has become increasingly popular and widespread. In this study, we adapt a gene set analysis method – the significance analysis of microarray gene set reduction (SAMGSR) algorithm to carry out feature selection for longitudinal microarray data, and propose a pathway-based feature selection algorithm – the two-level SAMGSR method. By using simulated data and a real-world application, we demonstrate that a gene’s expression profiles over time can be considered as a gene set. Thus a suitable gene set analysis method can be utilized or modified to execute the selection of relevant genes for longitudinal omics data. We believe this work paves the way for more research to bridge feature selection and gene set analysis with the development of novel pathway-based feature selection algorithms. |
format | Online Article Text |
id | pubmed-6124382 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | F1000 Research Limited |
record_format | MEDLINE/PubMed |
spelling | pubmed-61243822018-09-28 To select relevant features for longitudinal gene expression data by extending a pathway analysis method Tian, Suyan Wang, Chi Chang, Howard H. F1000Res Method Article The emerging field of pathway-based feature selection that incorporates biological information conveyed by gene sets/pathways to guide the selection of relevant genes has become increasingly popular and widespread. In this study, we adapt a gene set analysis method – the significance analysis of microarray gene set reduction (SAMGSR) algorithm to carry out feature selection for longitudinal microarray data, and propose a pathway-based feature selection algorithm – the two-level SAMGSR method. By using simulated data and a real-world application, we demonstrate that a gene’s expression profiles over time can be considered as a gene set. Thus a suitable gene set analysis method can be utilized or modified to execute the selection of relevant genes for longitudinal omics data. We believe this work paves the way for more research to bridge feature selection and gene set analysis with the development of novel pathway-based feature selection algorithms. F1000 Research Limited 2018-07-31 /pmc/articles/PMC6124382/ /pubmed/30271585 http://dx.doi.org/10.12688/f1000research.15357.1 Text en Copyright: © 2018 Tian S et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Method Article Tian, Suyan Wang, Chi Chang, Howard H. To select relevant features for longitudinal gene expression data by extending a pathway analysis method |
title | To select relevant features for longitudinal gene expression data by extending a pathway analysis method |
title_full | To select relevant features for longitudinal gene expression data by extending a pathway analysis method |
title_fullStr | To select relevant features for longitudinal gene expression data by extending a pathway analysis method |
title_full_unstemmed | To select relevant features for longitudinal gene expression data by extending a pathway analysis method |
title_short | To select relevant features for longitudinal gene expression data by extending a pathway analysis method |
title_sort | to select relevant features for longitudinal gene expression data by extending a pathway analysis method |
topic | Method Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6124382/ https://www.ncbi.nlm.nih.gov/pubmed/30271585 http://dx.doi.org/10.12688/f1000research.15357.1 |
work_keys_str_mv | AT tiansuyan toselectrelevantfeaturesforlongitudinalgeneexpressiondatabyextendingapathwayanalysismethod AT wangchi toselectrelevantfeaturesforlongitudinalgeneexpressiondatabyextendingapathwayanalysismethod AT changhowardh toselectrelevantfeaturesforlongitudinalgeneexpressiondatabyextendingapathwayanalysismethod |