Cargando…

Estimating the number and size of the main effects in genome-wide case-control association studies

It has recently become possible to screen thousands of markers to detect genetic causes of common diseases. Along with this potential comes analytical challenges, and it is important to develop new statistical tools to identify markers with causal effects and accurately estimate their effect sizes....

Descripción completa

Detalles Bibliográficos
Autores principales: Kuo, Po-Hsiu, Bukszár, József, van den Oord, Edwin JCG
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367536/
https://www.ncbi.nlm.nih.gov/pubmed/18466487
_version_ 1782154315285135360
author Kuo, Po-Hsiu
Bukszár, József
van den Oord, Edwin JCG
author_facet Kuo, Po-Hsiu
Bukszár, József
van den Oord, Edwin JCG
author_sort Kuo, Po-Hsiu
collection PubMed
description It has recently become possible to screen thousands of markers to detect genetic causes of common diseases. Along with this potential comes analytical challenges, and it is important to develop new statistical tools to identify markers with causal effects and accurately estimate their effect sizes. Knowledge of the proportion of markers without true effects (p(0)) and the effect sizes of markers with effects provides information to control for false discoveries and to design follow-up studies. We apply newly developed methods to simulated Genetic Analysis Workshop 15 genome-wide case-control data sets, including a maximum likelihood (ML) and a quasi-ML (QML) approach that incorporate the test statistic distribution and estimates effect size simultaneously with p(0), and two conservative estimators of p(0 )that do not rely on the test statistic distribution under the alternative. Compared with four existing commonly used estimators for p(0), our results illustrated that all of our estimators have favorable properties in terms of the standard deviation with which p(0 )is estimated. On average, the ML method performed slightly better than the QML method; the conservative method performed well and was even slightly more precise than the ML estimators, and can be more robust in less optimal conditions (small sample sizes and small number of markers). Further improvements and extensions of the proposed methods are conceivable, such as estimating the distribution of effect sizes and taking population stratification into account when obtain estimates of p(0 )and effect size.
format Text
id pubmed-2367536
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23675362008-05-06 Estimating the number and size of the main effects in genome-wide case-control association studies Kuo, Po-Hsiu Bukszár, József van den Oord, Edwin JCG BMC Proc Proceedings It has recently become possible to screen thousands of markers to detect genetic causes of common diseases. Along with this potential comes analytical challenges, and it is important to develop new statistical tools to identify markers with causal effects and accurately estimate their effect sizes. Knowledge of the proportion of markers without true effects (p(0)) and the effect sizes of markers with effects provides information to control for false discoveries and to design follow-up studies. We apply newly developed methods to simulated Genetic Analysis Workshop 15 genome-wide case-control data sets, including a maximum likelihood (ML) and a quasi-ML (QML) approach that incorporate the test statistic distribution and estimates effect size simultaneously with p(0), and two conservative estimators of p(0 )that do not rely on the test statistic distribution under the alternative. Compared with four existing commonly used estimators for p(0), our results illustrated that all of our estimators have favorable properties in terms of the standard deviation with which p(0 )is estimated. On average, the ML method performed slightly better than the QML method; the conservative method performed well and was even slightly more precise than the ML estimators, and can be more robust in less optimal conditions (small sample sizes and small number of markers). Further improvements and extensions of the proposed methods are conceivable, such as estimating the distribution of effect sizes and taking population stratification into account when obtain estimates of p(0 )and effect size. BioMed Central 2007-12-18 /pmc/articles/PMC2367536/ /pubmed/18466487 Text en Copyright © 2007 Kuo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Kuo, Po-Hsiu
Bukszár, József
van den Oord, Edwin JCG
Estimating the number and size of the main effects in genome-wide case-control association studies
title Estimating the number and size of the main effects in genome-wide case-control association studies
title_full Estimating the number and size of the main effects in genome-wide case-control association studies
title_fullStr Estimating the number and size of the main effects in genome-wide case-control association studies
title_full_unstemmed Estimating the number and size of the main effects in genome-wide case-control association studies
title_short Estimating the number and size of the main effects in genome-wide case-control association studies
title_sort estimating the number and size of the main effects in genome-wide case-control association studies
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367536/
https://www.ncbi.nlm.nih.gov/pubmed/18466487
work_keys_str_mv AT kuopohsiu estimatingthenumberandsizeofthemaineffectsingenomewidecasecontrolassociationstudies
AT bukszarjozsef estimatingthenumberandsizeofthemaineffectsingenomewidecasecontrolassociationstudies
AT vandenoordedwinjcg estimatingthenumberandsizeofthemaineffectsingenomewidecasecontrolassociationstudies