Cargando…

A Robust Rerank Approach for Feature Selection and Its Application to Pooling-Based GWA Studies

Large-p-small-n datasets are commonly encountered in modern biomedical studies. To detect the difference between two groups, conventional methods would fail to apply due to the instability in estimating variances in t-test and a high proportion of tied values in AUC (area under the receiver operatin...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Jia-Rou, Kuo, Po-Hsiu, Hung, Hung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3638651/
https://www.ncbi.nlm.nih.gov/pubmed/23653667
http://dx.doi.org/10.1155/2013/860673
_version_ 1782475861480439808
author Liu, Jia-Rou
Kuo, Po-Hsiu
Hung, Hung
author_facet Liu, Jia-Rou
Kuo, Po-Hsiu
Hung, Hung
author_sort Liu, Jia-Rou
collection PubMed
description Large-p-small-n datasets are commonly encountered in modern biomedical studies. To detect the difference between two groups, conventional methods would fail to apply due to the instability in estimating variances in t-test and a high proportion of tied values in AUC (area under the receiver operating characteristic curve) estimates. The significance analysis of microarrays (SAM) may also not be satisfactory, since its performance is sensitive to the tuning parameter, and its selection is not straightforward. In this work, we propose a robust rerank approach to overcome the above-mentioned diffculties. In particular, we obtain a rank-based statistic for each feature based on the concept of “rank-over-variable.” Techniques of “random subset” and “rerank” are then iteratively applied to rank features, and the leading features will be selected for further studies. The proposed re-rank approach is especially applicable for large-p-small-n datasets. Moreover, it is insensitive to the selection of tuning parameters, which is an appealing property for practical implementation. Simulation studies and real data analysis of pooling-based genome wide association (GWA) studies demonstrate the usefulness of our method.
format Online
Article
Text
id pubmed-3638651
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-36386512013-05-07 A Robust Rerank Approach for Feature Selection and Its Application to Pooling-Based GWA Studies Liu, Jia-Rou Kuo, Po-Hsiu Hung, Hung Comput Math Methods Med Research Article Large-p-small-n datasets are commonly encountered in modern biomedical studies. To detect the difference between two groups, conventional methods would fail to apply due to the instability in estimating variances in t-test and a high proportion of tied values in AUC (area under the receiver operating characteristic curve) estimates. The significance analysis of microarrays (SAM) may also not be satisfactory, since its performance is sensitive to the tuning parameter, and its selection is not straightforward. In this work, we propose a robust rerank approach to overcome the above-mentioned diffculties. In particular, we obtain a rank-based statistic for each feature based on the concept of “rank-over-variable.” Techniques of “random subset” and “rerank” are then iteratively applied to rank features, and the leading features will be selected for further studies. The proposed re-rank approach is especially applicable for large-p-small-n datasets. Moreover, it is insensitive to the selection of tuning parameters, which is an appealing property for practical implementation. Simulation studies and real data analysis of pooling-based genome wide association (GWA) studies demonstrate the usefulness of our method. Hindawi Publishing Corporation 2013 2013-04-04 /pmc/articles/PMC3638651/ /pubmed/23653667 http://dx.doi.org/10.1155/2013/860673 Text en Copyright © 2013 Jia-Rou Liu et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Liu, Jia-Rou
Kuo, Po-Hsiu
Hung, Hung
A Robust Rerank Approach for Feature Selection and Its Application to Pooling-Based GWA Studies
title A Robust Rerank Approach for Feature Selection and Its Application to Pooling-Based GWA Studies
title_full A Robust Rerank Approach for Feature Selection and Its Application to Pooling-Based GWA Studies
title_fullStr A Robust Rerank Approach for Feature Selection and Its Application to Pooling-Based GWA Studies
title_full_unstemmed A Robust Rerank Approach for Feature Selection and Its Application to Pooling-Based GWA Studies
title_short A Robust Rerank Approach for Feature Selection and Its Application to Pooling-Based GWA Studies
title_sort robust rerank approach for feature selection and its application to pooling-based gwa studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3638651/
https://www.ncbi.nlm.nih.gov/pubmed/23653667
http://dx.doi.org/10.1155/2013/860673
work_keys_str_mv AT liujiarou arobustrerankapproachforfeatureselectionanditsapplicationtopoolingbasedgwastudies
AT kuopohsiu arobustrerankapproachforfeatureselectionanditsapplicationtopoolingbasedgwastudies
AT hunghung arobustrerankapproachforfeatureselectionanditsapplicationtopoolingbasedgwastudies
AT liujiarou robustrerankapproachforfeatureselectionanditsapplicationtopoolingbasedgwastudies
AT kuopohsiu robustrerankapproachforfeatureselectionanditsapplicationtopoolingbasedgwastudies
AT hunghung robustrerankapproachforfeatureselectionanditsapplicationtopoolingbasedgwastudies