Cargando…

The projack: a resampling approach to correct for ranking bias in high-throughput studies

The problem of ranked inference arises in a number of settings, for which the investigator wishes to perform parameter inference after ordering a set of [Formula: see text] statistics. In contrast to inference for a single hypothesis, the ranking procedure introduces considerable bias, a problem kno...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Yi-Hui, Wright, Fred A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4679068/
https://www.ncbi.nlm.nih.gov/pubmed/26040912
http://dx.doi.org/10.1093/biostatistics/kxv022
_version_ 1782405540973903872
author Zhou, Yi-Hui
Wright, Fred A.
author_facet Zhou, Yi-Hui
Wright, Fred A.
author_sort Zhou, Yi-Hui
collection PubMed
description The problem of ranked inference arises in a number of settings, for which the investigator wishes to perform parameter inference after ordering a set of [Formula: see text] statistics. In contrast to inference for a single hypothesis, the ranking procedure introduces considerable bias, a problem known as the “winner's curse” in genetic association. We introduce the projack (for Prediction by Re- Ordered Jackknife and Cross-Validation, [Formula: see text]-fold). The projack is a resampling-based procedure that provides low-bias estimates of the expected ranked effect size parameter for a set of possibly correlated [Formula: see text] statistics. The approach is flexible, and has wide applicability to high-dimensional datasets, including those arising from genomics platforms. Initially, motivated for the setting where original data are available for resampling, the projack can be extended to the situation where only the vector of [Formula: see text] values is available. We illustrate the projack for correction of the winner's curse in genetic association, although it can be used much more generally.
format Online
Article
Text
id pubmed-4679068
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-46790682015-12-16 The projack: a resampling approach to correct for ranking bias in high-throughput studies Zhou, Yi-Hui Wright, Fred A. Biostatistics Articles The problem of ranked inference arises in a number of settings, for which the investigator wishes to perform parameter inference after ordering a set of [Formula: see text] statistics. In contrast to inference for a single hypothesis, the ranking procedure introduces considerable bias, a problem known as the “winner's curse” in genetic association. We introduce the projack (for Prediction by Re- Ordered Jackknife and Cross-Validation, [Formula: see text]-fold). The projack is a resampling-based procedure that provides low-bias estimates of the expected ranked effect size parameter for a set of possibly correlated [Formula: see text] statistics. The approach is flexible, and has wide applicability to high-dimensional datasets, including those arising from genomics platforms. Initially, motivated for the setting where original data are available for resampling, the projack can be extended to the situation where only the vector of [Formula: see text] values is available. We illustrate the projack for correction of the winner's curse in genetic association, although it can be used much more generally. Oxford University Press 2016-01 2015-06-03 /pmc/articles/PMC4679068/ /pubmed/26040912 http://dx.doi.org/10.1093/biostatistics/kxv022 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Zhou, Yi-Hui
Wright, Fred A.
The projack: a resampling approach to correct for ranking bias in high-throughput studies
title The projack: a resampling approach to correct for ranking bias in high-throughput studies
title_full The projack: a resampling approach to correct for ranking bias in high-throughput studies
title_fullStr The projack: a resampling approach to correct for ranking bias in high-throughput studies
title_full_unstemmed The projack: a resampling approach to correct for ranking bias in high-throughput studies
title_short The projack: a resampling approach to correct for ranking bias in high-throughput studies
title_sort projack: a resampling approach to correct for ranking bias in high-throughput studies
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4679068/
https://www.ncbi.nlm.nih.gov/pubmed/26040912
http://dx.doi.org/10.1093/biostatistics/kxv022
work_keys_str_mv AT zhouyihui theprojackaresamplingapproachtocorrectforrankingbiasinhighthroughputstudies
AT wrightfreda theprojackaresamplingapproachtocorrectforrankingbiasinhighthroughputstudies
AT zhouyihui projackaresamplingapproachtocorrectforrankingbiasinhighthroughputstudies
AT wrightfreda projackaresamplingapproachtocorrectforrankingbiasinhighthroughputstudies