Cargando…
The projack: a resampling approach to correct for ranking bias in high-throughput studies
The problem of ranked inference arises in a number of settings, for which the investigator wishes to perform parameter inference after ordering a set of [Formula: see text] statistics. In contrast to inference for a single hypothesis, the ranking procedure introduces considerable bias, a problem kno...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4679068/ https://www.ncbi.nlm.nih.gov/pubmed/26040912 http://dx.doi.org/10.1093/biostatistics/kxv022 |
_version_ | 1782405540973903872 |
---|---|
author | Zhou, Yi-Hui Wright, Fred A. |
author_facet | Zhou, Yi-Hui Wright, Fred A. |
author_sort | Zhou, Yi-Hui |
collection | PubMed |
description | The problem of ranked inference arises in a number of settings, for which the investigator wishes to perform parameter inference after ordering a set of [Formula: see text] statistics. In contrast to inference for a single hypothesis, the ranking procedure introduces considerable bias, a problem known as the “winner's curse” in genetic association. We introduce the projack (for Prediction by Re- Ordered Jackknife and Cross-Validation, [Formula: see text]-fold). The projack is a resampling-based procedure that provides low-bias estimates of the expected ranked effect size parameter for a set of possibly correlated [Formula: see text] statistics. The approach is flexible, and has wide applicability to high-dimensional datasets, including those arising from genomics platforms. Initially, motivated for the setting where original data are available for resampling, the projack can be extended to the situation where only the vector of [Formula: see text] values is available. We illustrate the projack for correction of the winner's curse in genetic association, although it can be used much more generally. |
format | Online Article Text |
id | pubmed-4679068 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-46790682015-12-16 The projack: a resampling approach to correct for ranking bias in high-throughput studies Zhou, Yi-Hui Wright, Fred A. Biostatistics Articles The problem of ranked inference arises in a number of settings, for which the investigator wishes to perform parameter inference after ordering a set of [Formula: see text] statistics. In contrast to inference for a single hypothesis, the ranking procedure introduces considerable bias, a problem known as the “winner's curse” in genetic association. We introduce the projack (for Prediction by Re- Ordered Jackknife and Cross-Validation, [Formula: see text]-fold). The projack is a resampling-based procedure that provides low-bias estimates of the expected ranked effect size parameter for a set of possibly correlated [Formula: see text] statistics. The approach is flexible, and has wide applicability to high-dimensional datasets, including those arising from genomics platforms. Initially, motivated for the setting where original data are available for resampling, the projack can be extended to the situation where only the vector of [Formula: see text] values is available. We illustrate the projack for correction of the winner's curse in genetic association, although it can be used much more generally. Oxford University Press 2016-01 2015-06-03 /pmc/articles/PMC4679068/ /pubmed/26040912 http://dx.doi.org/10.1093/biostatistics/kxv022 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Zhou, Yi-Hui Wright, Fred A. The projack: a resampling approach to correct for ranking bias in high-throughput studies |
title | The projack: a resampling approach to correct for ranking bias in high-throughput studies |
title_full | The projack: a resampling approach to correct for ranking bias in high-throughput studies |
title_fullStr | The projack: a resampling approach to correct for ranking bias in high-throughput studies |
title_full_unstemmed | The projack: a resampling approach to correct for ranking bias in high-throughput studies |
title_short | The projack: a resampling approach to correct for ranking bias in high-throughput studies |
title_sort | projack: a resampling approach to correct for ranking bias in high-throughput studies |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4679068/ https://www.ncbi.nlm.nih.gov/pubmed/26040912 http://dx.doi.org/10.1093/biostatistics/kxv022 |
work_keys_str_mv | AT zhouyihui theprojackaresamplingapproachtocorrectforrankingbiasinhighthroughputstudies AT wrightfreda theprojackaresamplingapproachtocorrectforrankingbiasinhighthroughputstudies AT zhouyihui projackaresamplingapproachtocorrectforrankingbiasinhighthroughputstudies AT wrightfreda projackaresamplingapproachtocorrectforrankingbiasinhighthroughputstudies |