Cargando…
Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies
The genetic basis of multiple phenotypes such as gene expression, metabolite levels, or imaging features is often investigated by testing a large collection of hypotheses, probing the existence of association between each of the traits and hundreds of thousands of genotyped variants. Appropriate mul...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4738479/ https://www.ncbi.nlm.nih.gov/pubmed/26626037 http://dx.doi.org/10.1002/gepi.21942 |
_version_ | 1782413620081065984 |
---|---|
author | Peterson, Christine B. Bogomolov, Marina Benjamini, Yoav Sabatti, Chiara |
author_facet | Peterson, Christine B. Bogomolov, Marina Benjamini, Yoav Sabatti, Chiara |
author_sort | Peterson, Christine B. |
collection | PubMed |
description | The genetic basis of multiple phenotypes such as gene expression, metabolite levels, or imaging features is often investigated by testing a large collection of hypotheses, probing the existence of association between each of the traits and hundreds of thousands of genotyped variants. Appropriate multiplicity adjustment is crucial to guarantee replicability of findings, and the false discovery rate (FDR) is frequently adopted as a measure of global error. In the interest of interpretability, results are often summarized so that reporting focuses on variants discovered to be associated to some phenotypes. We show that applying FDR‐controlling procedures on the entire collection of hypotheses fails to control the rate of false discovery of associated variants as well as the expected value of the average proportion of false discovery of phenotypes influenced by such variants. We propose a simple hierarchical testing procedure that allows control of both these error rates and provides a more reliable basis for the identification of variants with functional effects. We demonstrate the utility of this approach through simulation studies comparing various error rates and measures of power for genetic association studies of multiple traits. Finally, we apply the proposed method to identify genetic variants that impact flowering phenotypes in Arabidopsis thaliana, expanding the set of discoveries. |
format | Online Article Text |
id | pubmed-4738479 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-47384792016-02-12 Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies Peterson, Christine B. Bogomolov, Marina Benjamini, Yoav Sabatti, Chiara Genet Epidemiol Research Articles The genetic basis of multiple phenotypes such as gene expression, metabolite levels, or imaging features is often investigated by testing a large collection of hypotheses, probing the existence of association between each of the traits and hundreds of thousands of genotyped variants. Appropriate multiplicity adjustment is crucial to guarantee replicability of findings, and the false discovery rate (FDR) is frequently adopted as a measure of global error. In the interest of interpretability, results are often summarized so that reporting focuses on variants discovered to be associated to some phenotypes. We show that applying FDR‐controlling procedures on the entire collection of hypotheses fails to control the rate of false discovery of associated variants as well as the expected value of the average proportion of false discovery of phenotypes influenced by such variants. We propose a simple hierarchical testing procedure that allows control of both these error rates and provides a more reliable basis for the identification of variants with functional effects. We demonstrate the utility of this approach through simulation studies comparing various error rates and measures of power for genetic association studies of multiple traits. Finally, we apply the proposed method to identify genetic variants that impact flowering phenotypes in Arabidopsis thaliana, expanding the set of discoveries. John Wiley and Sons Inc. 2015-12-02 2016-01 /pmc/articles/PMC4738479/ /pubmed/26626037 http://dx.doi.org/10.1002/gepi.21942 Text en © 2015 The Authors. *Genetic Epidemiologypublished by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial‐NoDerivs (http://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Research Articles Peterson, Christine B. Bogomolov, Marina Benjamini, Yoav Sabatti, Chiara Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies |
title | Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies |
title_full | Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies |
title_fullStr | Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies |
title_full_unstemmed | Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies |
title_short | Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies |
title_sort | many phenotypes without many false discoveries: error controlling strategies for multitrait association studies |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4738479/ https://www.ncbi.nlm.nih.gov/pubmed/26626037 http://dx.doi.org/10.1002/gepi.21942 |
work_keys_str_mv | AT petersonchristineb manyphenotypeswithoutmanyfalsediscoverieserrorcontrollingstrategiesformultitraitassociationstudies AT bogomolovmarina manyphenotypeswithoutmanyfalsediscoverieserrorcontrollingstrategiesformultitraitassociationstudies AT benjaminiyoav manyphenotypeswithoutmanyfalsediscoverieserrorcontrollingstrategiesformultitraitassociationstudies AT sabattichiara manyphenotypeswithoutmanyfalsediscoverieserrorcontrollingstrategiesformultitraitassociationstudies |