Cargando…
Inflated false discovery rate due to volcano plots: problem and solutions
MOTIVATION: Volcano plots are used to select the most interesting discoveries when too many discoveries remain after application of Benjamini–Hochberg’s procedure (BH). The volcano plot suggests a double filtering procedure that selects features with both small adjusted [Formula: see text]-value and...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8425469/ https://www.ncbi.nlm.nih.gov/pubmed/33758907 http://dx.doi.org/10.1093/bib/bbab053 |
_version_ | 1783749852286418944 |
---|---|
author | Ebrahimpoor, Mitra Goeman, Jelle J |
author_facet | Ebrahimpoor, Mitra Goeman, Jelle J |
author_sort | Ebrahimpoor, Mitra |
collection | PubMed |
description | MOTIVATION: Volcano plots are used to select the most interesting discoveries when too many discoveries remain after application of Benjamini–Hochberg’s procedure (BH). The volcano plot suggests a double filtering procedure that selects features with both small adjusted [Formula: see text]-value and large estimated effect size. Despite its popularity, this type of selection overlooks the fact that BH does not guarantee error control over filtered subsets of discoveries. Therefore the selected subset of features may include an inflated number of false discoveries. RESULTS: In this paper, we illustrate the substantially inflated type I error rate of volcano plot selection with simulation experiments and RNA-seq data. In particular, we show that the feature with the largest estimated effect is a very likely false positive result. Next, we investigate two alternative approaches for multiple testing with double filtering that do not inflate the false discovery rate. Our procedure is implemented in an interactive web application and is publicly available. |
format | Online Article Text |
id | pubmed-8425469 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-84254692021-09-09 Inflated false discovery rate due to volcano plots: problem and solutions Ebrahimpoor, Mitra Goeman, Jelle J Brief Bioinform Method Review MOTIVATION: Volcano plots are used to select the most interesting discoveries when too many discoveries remain after application of Benjamini–Hochberg’s procedure (BH). The volcano plot suggests a double filtering procedure that selects features with both small adjusted [Formula: see text]-value and large estimated effect size. Despite its popularity, this type of selection overlooks the fact that BH does not guarantee error control over filtered subsets of discoveries. Therefore the selected subset of features may include an inflated number of false discoveries. RESULTS: In this paper, we illustrate the substantially inflated type I error rate of volcano plot selection with simulation experiments and RNA-seq data. In particular, we show that the feature with the largest estimated effect is a very likely false positive result. Next, we investigate two alternative approaches for multiple testing with double filtering that do not inflate the false discovery rate. Our procedure is implemented in an interactive web application and is publicly available. Oxford University Press 2021-03-24 /pmc/articles/PMC8425469/ /pubmed/33758907 http://dx.doi.org/10.1093/bib/bbab053 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Method Review Ebrahimpoor, Mitra Goeman, Jelle J Inflated false discovery rate due to volcano plots: problem and solutions |
title | Inflated false discovery rate due to volcano plots: problem and solutions |
title_full | Inflated false discovery rate due to volcano plots: problem and solutions |
title_fullStr | Inflated false discovery rate due to volcano plots: problem and solutions |
title_full_unstemmed | Inflated false discovery rate due to volcano plots: problem and solutions |
title_short | Inflated false discovery rate due to volcano plots: problem and solutions |
title_sort | inflated false discovery rate due to volcano plots: problem and solutions |
topic | Method Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8425469/ https://www.ncbi.nlm.nih.gov/pubmed/33758907 http://dx.doi.org/10.1093/bib/bbab053 |
work_keys_str_mv | AT ebrahimpoormitra inflatedfalsediscoveryrateduetovolcanoplotsproblemandsolutions AT goemanjellej inflatedfalsediscoveryrateduetovolcanoplotsproblemandsolutions |