Cargando…

False discovery rate control in two-stage designs

BACKGROUND: For gene expression or gene association studies with a large number of hypotheses the number of measurements per marker in a conventional single-stage design is often low due to limited resources. Two-stage designs have been proposed where in a first stage promising hypotheses are identi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zehetmayer, Sonja, Posch, Martin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496575/ https://www.ncbi.nlm.nih.gov/pubmed/22559038 http://dx.doi.org/10.1186/1471-2105-13-81

_version_	1782249640196833280
author	Zehetmayer, Sonja Posch, Martin
author_facet	Zehetmayer, Sonja Posch, Martin
author_sort	Zehetmayer, Sonja
collection	PubMed
description	BACKGROUND: For gene expression or gene association studies with a large number of hypotheses the number of measurements per marker in a conventional single-stage design is often low due to limited resources. Two-stage designs have been proposed where in a first stage promising hypotheses are identified and further investigated in the second stage with larger sample sizes. For two types of two-stage designs proposed in the literature we derive multiple testing procedures controlling the False Discovery Rate (FDR) demonstrating FDR control by simulations: designs where a fixed number of top-ranked hypotheses are selected and designs where the selection in the interim analysis is based on an FDR threshold. In contrast to earlier approaches which use only the second-stage data in the hypothesis tests (pilot approach), the proposed testing procedures are based on the pooled data from both stages (integrated approach). RESULTS: For both selection rules the multiple testing procedures control the FDR in the considered simulation scenarios. This holds for the case of independent observations across hypotheses as well as for certain correlation structures. Additionally, we show that in scenarios with small effect sizes the testing procedures based on the pooled data from both stages can give a considerable improvement in power compared to tests based on the second-stage data only. CONCLUSION: The proposed hypothesis tests provide a tool for FDR control for the considered two-stage designs. Comparing the integrated approaches for both selection rules with the corresponding pilot approaches showed an advantage of the integrated approach in many simulation scenarios.
format	Online Article Text
id	pubmed-3496575
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-34965752012-11-19 False discovery rate control in two-stage designs Zehetmayer, Sonja Posch, Martin BMC Bioinformatics Methodology Article BACKGROUND: For gene expression or gene association studies with a large number of hypotheses the number of measurements per marker in a conventional single-stage design is often low due to limited resources. Two-stage designs have been proposed where in a first stage promising hypotheses are identified and further investigated in the second stage with larger sample sizes. For two types of two-stage designs proposed in the literature we derive multiple testing procedures controlling the False Discovery Rate (FDR) demonstrating FDR control by simulations: designs where a fixed number of top-ranked hypotheses are selected and designs where the selection in the interim analysis is based on an FDR threshold. In contrast to earlier approaches which use only the second-stage data in the hypothesis tests (pilot approach), the proposed testing procedures are based on the pooled data from both stages (integrated approach). RESULTS: For both selection rules the multiple testing procedures control the FDR in the considered simulation scenarios. This holds for the case of independent observations across hypotheses as well as for certain correlation structures. Additionally, we show that in scenarios with small effect sizes the testing procedures based on the pooled data from both stages can give a considerable improvement in power compared to tests based on the second-stage data only. CONCLUSION: The proposed hypothesis tests provide a tool for FDR control for the considered two-stage designs. Comparing the integrated approaches for both selection rules with the corresponding pilot approaches showed an advantage of the integrated approach in many simulation scenarios. BioMed Central 2012-05-06 /pmc/articles/PMC3496575/ /pubmed/22559038 http://dx.doi.org/10.1186/1471-2105-13-81 Text en Copyright ©2012 Zehetmayer and Posch; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Zehetmayer, Sonja Posch, Martin False discovery rate control in two-stage designs
title	False discovery rate control in two-stage designs
title_full	False discovery rate control in two-stage designs
title_fullStr	False discovery rate control in two-stage designs
title_full_unstemmed	False discovery rate control in two-stage designs
title_short	False discovery rate control in two-stage designs
title_sort	false discovery rate control in two-stage designs
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496575/ https://www.ncbi.nlm.nih.gov/pubmed/22559038 http://dx.doi.org/10.1186/1471-2105-13-81
work_keys_str_mv	AT zehetmayersonja falsediscoveryratecontrolintwostagedesigns AT poschmartin falsediscoveryratecontrolintwostagedesigns

False discovery rate control in two-stage designs

Ejemplares similares