Cargando…

False discovery rate control in two-stage designs

BACKGROUND: For gene expression or gene association studies with a large number of hypotheses the number of measurements per marker in a conventional single-stage design is often low due to limited resources. Two-stage designs have been proposed where in a first stage promising hypotheses are identi...

Descripción completa

Detalles Bibliográficos
Autores principales: Zehetmayer, Sonja, Posch, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496575/
https://www.ncbi.nlm.nih.gov/pubmed/22559038
http://dx.doi.org/10.1186/1471-2105-13-81
_version_ 1782249640196833280
author Zehetmayer, Sonja
Posch, Martin
author_facet Zehetmayer, Sonja
Posch, Martin
author_sort Zehetmayer, Sonja
collection PubMed
description BACKGROUND: For gene expression or gene association studies with a large number of hypotheses the number of measurements per marker in a conventional single-stage design is often low due to limited resources. Two-stage designs have been proposed where in a first stage promising hypotheses are identified and further investigated in the second stage with larger sample sizes. For two types of two-stage designs proposed in the literature we derive multiple testing procedures controlling the False Discovery Rate (FDR) demonstrating FDR control by simulations: designs where a fixed number of top-ranked hypotheses are selected and designs where the selection in the interim analysis is based on an FDR threshold. In contrast to earlier approaches which use only the second-stage data in the hypothesis tests (pilot approach), the proposed testing procedures are based on the pooled data from both stages (integrated approach). RESULTS: For both selection rules the multiple testing procedures control the FDR in the considered simulation scenarios. This holds for the case of independent observations across hypotheses as well as for certain correlation structures. Additionally, we show that in scenarios with small effect sizes the testing procedures based on the pooled data from both stages can give a considerable improvement in power compared to tests based on the second-stage data only. CONCLUSION: The proposed hypothesis tests provide a tool for FDR control for the considered two-stage designs. Comparing the integrated approaches for both selection rules with the corresponding pilot approaches showed an advantage of the integrated approach in many simulation scenarios.
format Online
Article
Text
id pubmed-3496575
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34965752012-11-19 False discovery rate control in two-stage designs Zehetmayer, Sonja Posch, Martin BMC Bioinformatics Methodology Article BACKGROUND: For gene expression or gene association studies with a large number of hypotheses the number of measurements per marker in a conventional single-stage design is often low due to limited resources. Two-stage designs have been proposed where in a first stage promising hypotheses are identified and further investigated in the second stage with larger sample sizes. For two types of two-stage designs proposed in the literature we derive multiple testing procedures controlling the False Discovery Rate (FDR) demonstrating FDR control by simulations: designs where a fixed number of top-ranked hypotheses are selected and designs where the selection in the interim analysis is based on an FDR threshold. In contrast to earlier approaches which use only the second-stage data in the hypothesis tests (pilot approach), the proposed testing procedures are based on the pooled data from both stages (integrated approach). RESULTS: For both selection rules the multiple testing procedures control the FDR in the considered simulation scenarios. This holds for the case of independent observations across hypotheses as well as for certain correlation structures. Additionally, we show that in scenarios with small effect sizes the testing procedures based on the pooled data from both stages can give a considerable improvement in power compared to tests based on the second-stage data only. CONCLUSION: The proposed hypothesis tests provide a tool for FDR control for the considered two-stage designs. Comparing the integrated approaches for both selection rules with the corresponding pilot approaches showed an advantage of the integrated approach in many simulation scenarios. BioMed Central 2012-05-06 /pmc/articles/PMC3496575/ /pubmed/22559038 http://dx.doi.org/10.1186/1471-2105-13-81 Text en Copyright ©2012 Zehetmayer and Posch; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Zehetmayer, Sonja
Posch, Martin
False discovery rate control in two-stage designs
title False discovery rate control in two-stage designs
title_full False discovery rate control in two-stage designs
title_fullStr False discovery rate control in two-stage designs
title_full_unstemmed False discovery rate control in two-stage designs
title_short False discovery rate control in two-stage designs
title_sort false discovery rate control in two-stage designs
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496575/
https://www.ncbi.nlm.nih.gov/pubmed/22559038
http://dx.doi.org/10.1186/1471-2105-13-81
work_keys_str_mv AT zehetmayersonja falsediscoveryratecontrolintwostagedesigns
AT poschmartin falsediscoveryratecontrolintwostagedesigns