Cargando…

Null-free False Discovery Rate Control Using Decoy Permutations

The traditional approaches to false discovery rate (FDR) control in multiple hypothesis testing are usually based on the null distribution of a test statistic. However, all types of null distributions, including the theoretical, permutation-based and empirical ones, have some inherent drawbacks. For...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Kun, Li, Meng-jie, Fu, Yan, Gong, Fu-zhou, Sun, Xiao-ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8994022/
https://www.ncbi.nlm.nih.gov/pubmed/35431377
http://dx.doi.org/10.1007/s10255-022-1077-5
_version_ 1784684021483044864
author He, Kun
Li, Meng-jie
Fu, Yan
Gong, Fu-zhou
Sun, Xiao-ming
author_facet He, Kun
Li, Meng-jie
Fu, Yan
Gong, Fu-zhou
Sun, Xiao-ming
author_sort He, Kun
collection PubMed
description The traditional approaches to false discovery rate (FDR) control in multiple hypothesis testing are usually based on the null distribution of a test statistic. However, all types of null distributions, including the theoretical, permutation-based and empirical ones, have some inherent drawbacks. For example, the theoretical null might fail because of improper assumptions on the sample distribution. Here, we propose a null distribution-free approach to FDR control for multiple hypothesis testing in the case-control study. This approach, named target-decoy procedure, simply builds on the ordering of tests by some statistic or score, the null distribution of which is not required to be known. Competitive decoy tests are constructed from permutations of original samples and are used to estimate the false target discoveries. We prove that this approach controls the FDR when the score function is symmetric and the scores are independent between different tests. Simulation demonstrates that it is more stable and powerful than two popular traditional approaches, even in the existence of dependency. Evaluation is also made on two real datasets, including an arabidopsis genomics dataset and a COVID-19 proteomics dataset.
format Online
Article
Text
id pubmed-8994022
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-89940222022-04-11 Null-free False Discovery Rate Control Using Decoy Permutations He, Kun Li, Meng-jie Fu, Yan Gong, Fu-zhou Sun, Xiao-ming Acta Math Appl Sin Article The traditional approaches to false discovery rate (FDR) control in multiple hypothesis testing are usually based on the null distribution of a test statistic. However, all types of null distributions, including the theoretical, permutation-based and empirical ones, have some inherent drawbacks. For example, the theoretical null might fail because of improper assumptions on the sample distribution. Here, we propose a null distribution-free approach to FDR control for multiple hypothesis testing in the case-control study. This approach, named target-decoy procedure, simply builds on the ordering of tests by some statistic or score, the null distribution of which is not required to be known. Competitive decoy tests are constructed from permutations of original samples and are used to estimate the false target discoveries. We prove that this approach controls the FDR when the score function is symmetric and the scores are independent between different tests. Simulation demonstrates that it is more stable and powerful than two popular traditional approaches, even in the existence of dependency. Evaluation is also made on two real datasets, including an arabidopsis genomics dataset and a COVID-19 proteomics dataset. Springer Berlin Heidelberg 2022-04-09 2022 /pmc/articles/PMC8994022/ /pubmed/35431377 http://dx.doi.org/10.1007/s10255-022-1077-5 Text en © The Editorial Office of AMAS & Springer-Verlag GmbH Germany 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
He, Kun
Li, Meng-jie
Fu, Yan
Gong, Fu-zhou
Sun, Xiao-ming
Null-free False Discovery Rate Control Using Decoy Permutations
title Null-free False Discovery Rate Control Using Decoy Permutations
title_full Null-free False Discovery Rate Control Using Decoy Permutations
title_fullStr Null-free False Discovery Rate Control Using Decoy Permutations
title_full_unstemmed Null-free False Discovery Rate Control Using Decoy Permutations
title_short Null-free False Discovery Rate Control Using Decoy Permutations
title_sort null-free false discovery rate control using decoy permutations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8994022/
https://www.ncbi.nlm.nih.gov/pubmed/35431377
http://dx.doi.org/10.1007/s10255-022-1077-5
work_keys_str_mv AT hekun nullfreefalsediscoveryratecontrolusingdecoypermutations
AT limengjie nullfreefalsediscoveryratecontrolusingdecoypermutations
AT fuyan nullfreefalsediscoveryratecontrolusingdecoypermutations
AT gongfuzhou nullfreefalsediscoveryratecontrolusingdecoypermutations
AT sunxiaoming nullfreefalsediscoveryratecontrolusingdecoypermutations