Cargando…

Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies

The genetic basis of multiple phenotypes such as gene expression, metabolite levels, or imaging features is often investigated by testing a large collection of hypotheses, probing the existence of association between each of the traits and hundreds of thousands of genotyped variants. Appropriate mul...

Descripción completa

Detalles Bibliográficos
Autores principales: Peterson, Christine B., Bogomolov, Marina, Benjamini, Yoav, Sabatti, Chiara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4738479/
https://www.ncbi.nlm.nih.gov/pubmed/26626037
http://dx.doi.org/10.1002/gepi.21942
_version_ 1782413620081065984
author Peterson, Christine B.
Bogomolov, Marina
Benjamini, Yoav
Sabatti, Chiara
author_facet Peterson, Christine B.
Bogomolov, Marina
Benjamini, Yoav
Sabatti, Chiara
author_sort Peterson, Christine B.
collection PubMed
description The genetic basis of multiple phenotypes such as gene expression, metabolite levels, or imaging features is often investigated by testing a large collection of hypotheses, probing the existence of association between each of the traits and hundreds of thousands of genotyped variants. Appropriate multiplicity adjustment is crucial to guarantee replicability of findings, and the false discovery rate (FDR) is frequently adopted as a measure of global error. In the interest of interpretability, results are often summarized so that reporting focuses on variants discovered to be associated to some phenotypes. We show that applying FDR‐controlling procedures on the entire collection of hypotheses fails to control the rate of false discovery of associated variants as well as the expected value of the average proportion of false discovery of phenotypes influenced by such variants. We propose a simple hierarchical testing procedure that allows control of both these error rates and provides a more reliable basis for the identification of variants with functional effects. We demonstrate the utility of this approach through simulation studies comparing various error rates and measures of power for genetic association studies of multiple traits. Finally, we apply the proposed method to identify genetic variants that impact flowering phenotypes in Arabidopsis thaliana, expanding the set of discoveries.
format Online
Article
Text
id pubmed-4738479
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-47384792016-02-12 Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies Peterson, Christine B. Bogomolov, Marina Benjamini, Yoav Sabatti, Chiara Genet Epidemiol Research Articles The genetic basis of multiple phenotypes such as gene expression, metabolite levels, or imaging features is often investigated by testing a large collection of hypotheses, probing the existence of association between each of the traits and hundreds of thousands of genotyped variants. Appropriate multiplicity adjustment is crucial to guarantee replicability of findings, and the false discovery rate (FDR) is frequently adopted as a measure of global error. In the interest of interpretability, results are often summarized so that reporting focuses on variants discovered to be associated to some phenotypes. We show that applying FDR‐controlling procedures on the entire collection of hypotheses fails to control the rate of false discovery of associated variants as well as the expected value of the average proportion of false discovery of phenotypes influenced by such variants. We propose a simple hierarchical testing procedure that allows control of both these error rates and provides a more reliable basis for the identification of variants with functional effects. We demonstrate the utility of this approach through simulation studies comparing various error rates and measures of power for genetic association studies of multiple traits. Finally, we apply the proposed method to identify genetic variants that impact flowering phenotypes in Arabidopsis thaliana, expanding the set of discoveries. John Wiley and Sons Inc. 2015-12-02 2016-01 /pmc/articles/PMC4738479/ /pubmed/26626037 http://dx.doi.org/10.1002/gepi.21942 Text en © 2015 The Authors. *Genetic Epidemiologypublished by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial‐NoDerivs (http://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Research Articles
Peterson, Christine B.
Bogomolov, Marina
Benjamini, Yoav
Sabatti, Chiara
Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies
title Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies
title_full Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies
title_fullStr Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies
title_full_unstemmed Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies
title_short Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies
title_sort many phenotypes without many false discoveries: error controlling strategies for multitrait association studies
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4738479/
https://www.ncbi.nlm.nih.gov/pubmed/26626037
http://dx.doi.org/10.1002/gepi.21942
work_keys_str_mv AT petersonchristineb manyphenotypeswithoutmanyfalsediscoverieserrorcontrollingstrategiesformultitraitassociationstudies
AT bogomolovmarina manyphenotypeswithoutmanyfalsediscoverieserrorcontrollingstrategiesformultitraitassociationstudies
AT benjaminiyoav manyphenotypeswithoutmanyfalsediscoverieserrorcontrollingstrategiesformultitraitassociationstudies
AT sabattichiara manyphenotypeswithoutmanyfalsediscoverieserrorcontrollingstrategiesformultitraitassociationstudies