Cargando…
False discovery rate control in two-stage designs
BACKGROUND: For gene expression or gene association studies with a large number of hypotheses the number of measurements per marker in a conventional single-stage design is often low due to limited resources. Two-stage designs have been proposed where in a first stage promising hypotheses are identi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496575/ https://www.ncbi.nlm.nih.gov/pubmed/22559038 http://dx.doi.org/10.1186/1471-2105-13-81 |
_version_ | 1782249640196833280 |
---|---|
author | Zehetmayer, Sonja Posch, Martin |
author_facet | Zehetmayer, Sonja Posch, Martin |
author_sort | Zehetmayer, Sonja |
collection | PubMed |
description | BACKGROUND: For gene expression or gene association studies with a large number of hypotheses the number of measurements per marker in a conventional single-stage design is often low due to limited resources. Two-stage designs have been proposed where in a first stage promising hypotheses are identified and further investigated in the second stage with larger sample sizes. For two types of two-stage designs proposed in the literature we derive multiple testing procedures controlling the False Discovery Rate (FDR) demonstrating FDR control by simulations: designs where a fixed number of top-ranked hypotheses are selected and designs where the selection in the interim analysis is based on an FDR threshold. In contrast to earlier approaches which use only the second-stage data in the hypothesis tests (pilot approach), the proposed testing procedures are based on the pooled data from both stages (integrated approach). RESULTS: For both selection rules the multiple testing procedures control the FDR in the considered simulation scenarios. This holds for the case of independent observations across hypotheses as well as for certain correlation structures. Additionally, we show that in scenarios with small effect sizes the testing procedures based on the pooled data from both stages can give a considerable improvement in power compared to tests based on the second-stage data only. CONCLUSION: The proposed hypothesis tests provide a tool for FDR control for the considered two-stage designs. Comparing the integrated approaches for both selection rules with the corresponding pilot approaches showed an advantage of the integrated approach in many simulation scenarios. |
format | Online Article Text |
id | pubmed-3496575 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-34965752012-11-19 False discovery rate control in two-stage designs Zehetmayer, Sonja Posch, Martin BMC Bioinformatics Methodology Article BACKGROUND: For gene expression or gene association studies with a large number of hypotheses the number of measurements per marker in a conventional single-stage design is often low due to limited resources. Two-stage designs have been proposed where in a first stage promising hypotheses are identified and further investigated in the second stage with larger sample sizes. For two types of two-stage designs proposed in the literature we derive multiple testing procedures controlling the False Discovery Rate (FDR) demonstrating FDR control by simulations: designs where a fixed number of top-ranked hypotheses are selected and designs where the selection in the interim analysis is based on an FDR threshold. In contrast to earlier approaches which use only the second-stage data in the hypothesis tests (pilot approach), the proposed testing procedures are based on the pooled data from both stages (integrated approach). RESULTS: For both selection rules the multiple testing procedures control the FDR in the considered simulation scenarios. This holds for the case of independent observations across hypotheses as well as for certain correlation structures. Additionally, we show that in scenarios with small effect sizes the testing procedures based on the pooled data from both stages can give a considerable improvement in power compared to tests based on the second-stage data only. CONCLUSION: The proposed hypothesis tests provide a tool for FDR control for the considered two-stage designs. Comparing the integrated approaches for both selection rules with the corresponding pilot approaches showed an advantage of the integrated approach in many simulation scenarios. BioMed Central 2012-05-06 /pmc/articles/PMC3496575/ /pubmed/22559038 http://dx.doi.org/10.1186/1471-2105-13-81 Text en Copyright ©2012 Zehetmayer and Posch; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Zehetmayer, Sonja Posch, Martin False discovery rate control in two-stage designs |
title | False discovery rate control in two-stage designs |
title_full | False discovery rate control in two-stage designs |
title_fullStr | False discovery rate control in two-stage designs |
title_full_unstemmed | False discovery rate control in two-stage designs |
title_short | False discovery rate control in two-stage designs |
title_sort | false discovery rate control in two-stage designs |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3496575/ https://www.ncbi.nlm.nih.gov/pubmed/22559038 http://dx.doi.org/10.1186/1471-2105-13-81 |
work_keys_str_mv | AT zehetmayersonja falsediscoveryratecontrolintwostagedesigns AT poschmartin falsediscoveryratecontrolintwostagedesigns |