Cargando…

Inflated false discovery rate due to volcano plots: problem and solutions

MOTIVATION: Volcano plots are used to select the most interesting discoveries when too many discoveries remain after application of Benjamini–Hochberg’s procedure (BH). The volcano plot suggests a double filtering procedure that selects features with both small adjusted [Formula: see text]-value and...

Descripción completa

Detalles Bibliográficos
Autores principales: Ebrahimpoor, Mitra, Goeman, Jelle J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8425469/
https://www.ncbi.nlm.nih.gov/pubmed/33758907
http://dx.doi.org/10.1093/bib/bbab053
_version_ 1783749852286418944
author Ebrahimpoor, Mitra
Goeman, Jelle J
author_facet Ebrahimpoor, Mitra
Goeman, Jelle J
author_sort Ebrahimpoor, Mitra
collection PubMed
description MOTIVATION: Volcano plots are used to select the most interesting discoveries when too many discoveries remain after application of Benjamini–Hochberg’s procedure (BH). The volcano plot suggests a double filtering procedure that selects features with both small adjusted [Formula: see text]-value and large estimated effect size. Despite its popularity, this type of selection overlooks the fact that BH does not guarantee error control over filtered subsets of discoveries. Therefore the selected subset of features may include an inflated number of false discoveries. RESULTS: In this paper, we illustrate the substantially inflated type I error rate of volcano plot selection with simulation experiments and RNA-seq data. In particular, we show that the feature with the largest estimated effect is a very likely false positive result. Next, we investigate two alternative approaches for multiple testing with double filtering that do not inflate the false discovery rate. Our procedure is implemented in an interactive web application and is publicly available.
format Online
Article
Text
id pubmed-8425469
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84254692021-09-09 Inflated false discovery rate due to volcano plots: problem and solutions Ebrahimpoor, Mitra Goeman, Jelle J Brief Bioinform Method Review MOTIVATION: Volcano plots are used to select the most interesting discoveries when too many discoveries remain after application of Benjamini–Hochberg’s procedure (BH). The volcano plot suggests a double filtering procedure that selects features with both small adjusted [Formula: see text]-value and large estimated effect size. Despite its popularity, this type of selection overlooks the fact that BH does not guarantee error control over filtered subsets of discoveries. Therefore the selected subset of features may include an inflated number of false discoveries. RESULTS: In this paper, we illustrate the substantially inflated type I error rate of volcano plot selection with simulation experiments and RNA-seq data. In particular, we show that the feature with the largest estimated effect is a very likely false positive result. Next, we investigate two alternative approaches for multiple testing with double filtering that do not inflate the false discovery rate. Our procedure is implemented in an interactive web application and is publicly available. Oxford University Press 2021-03-24 /pmc/articles/PMC8425469/ /pubmed/33758907 http://dx.doi.org/10.1093/bib/bbab053 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Method Review
Ebrahimpoor, Mitra
Goeman, Jelle J
Inflated false discovery rate due to volcano plots: problem and solutions
title Inflated false discovery rate due to volcano plots: problem and solutions
title_full Inflated false discovery rate due to volcano plots: problem and solutions
title_fullStr Inflated false discovery rate due to volcano plots: problem and solutions
title_full_unstemmed Inflated false discovery rate due to volcano plots: problem and solutions
title_short Inflated false discovery rate due to volcano plots: problem and solutions
title_sort inflated false discovery rate due to volcano plots: problem and solutions
topic Method Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8425469/
https://www.ncbi.nlm.nih.gov/pubmed/33758907
http://dx.doi.org/10.1093/bib/bbab053
work_keys_str_mv AT ebrahimpoormitra inflatedfalsediscoveryrateduetovolcanoplotsproblemandsolutions
AT goemanjellej inflatedfalsediscoveryrateduetovolcanoplotsproblemandsolutions