Cargando…

A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts

BACKGROUND: The theoretical basis of genome-wide association studies (GWAS) is statistical inference of linkage disequilibrium (LD) between any polymorphic marker and a putative disease locus. Most methods widely implemented for such analyses are vulnerable to several key demographic factors and del...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Minghui, Wang, Lin, Jiang, Ning, Jia, Tianye, Luo, Zewei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3626840/
https://www.ncbi.nlm.nih.gov/pubmed/23394771
http://dx.doi.org/10.1186/1471-2164-14-88
_version_ 1782266253510967296
author Wang, Minghui
Wang, Lin
Jiang, Ning
Jia, Tianye
Luo, Zewei
author_facet Wang, Minghui
Wang, Lin
Jiang, Ning
Jia, Tianye
Luo, Zewei
author_sort Wang, Minghui
collection PubMed
description BACKGROUND: The theoretical basis of genome-wide association studies (GWAS) is statistical inference of linkage disequilibrium (LD) between any polymorphic marker and a putative disease locus. Most methods widely implemented for such analyses are vulnerable to several key demographic factors and deliver a poor statistical power for detecting genuine associations and also a high false positive rate. Here, we present a likelihood-based statistical approach that accounts properly for non-random nature of case–control samples in regard of genotypic distribution at the loci in populations under study and confers flexibility to test for genetic association in presence of different confounding factors such as population structure, non-randomness of samples etc. RESULTS: We implemented this novel method together with several popular methods in the literature of GWAS, to re-analyze recently published Parkinson’s disease (PD) case–control samples. The real data analysis and computer simulation show that the new method confers not only significantly improved statistical power for detecting the associations but also robustness to the difficulties stemmed from non-randomly sampling and genetic structures when compared to its rivals. In particular, the new method detected 44 significant SNPs within 25 chromosomal regions of size < 1 Mb but only 6 SNPs in two of these regions were previously detected by the trend test based methods. It discovered two SNPs located 1.18 Mb and 0.18 Mb from the PD candidates, FGF20 and PARK8, without invoking false positive risk. CONCLUSIONS: We developed a novel likelihood-based method which provides adequate estimation of LD and other population model parameters by using case and control samples, the ease in integration of these samples from multiple genetically divergent populations and thus confers statistically robust and powerful analyses of GWAS. On basis of simulation studies and analysis of real datasets, we demonstrated significant improvement of the new method over the non-parametric trend test, which is the most popularly implemented in the literature of GWAS.
format Online
Article
Text
id pubmed-3626840
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36268402013-04-24 A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts Wang, Minghui Wang, Lin Jiang, Ning Jia, Tianye Luo, Zewei BMC Genomics Methodology Article BACKGROUND: The theoretical basis of genome-wide association studies (GWAS) is statistical inference of linkage disequilibrium (LD) between any polymorphic marker and a putative disease locus. Most methods widely implemented for such analyses are vulnerable to several key demographic factors and deliver a poor statistical power for detecting genuine associations and also a high false positive rate. Here, we present a likelihood-based statistical approach that accounts properly for non-random nature of case–control samples in regard of genotypic distribution at the loci in populations under study and confers flexibility to test for genetic association in presence of different confounding factors such as population structure, non-randomness of samples etc. RESULTS: We implemented this novel method together with several popular methods in the literature of GWAS, to re-analyze recently published Parkinson’s disease (PD) case–control samples. The real data analysis and computer simulation show that the new method confers not only significantly improved statistical power for detecting the associations but also robustness to the difficulties stemmed from non-randomly sampling and genetic structures when compared to its rivals. In particular, the new method detected 44 significant SNPs within 25 chromosomal regions of size < 1 Mb but only 6 SNPs in two of these regions were previously detected by the trend test based methods. It discovered two SNPs located 1.18 Mb and 0.18 Mb from the PD candidates, FGF20 and PARK8, without invoking false positive risk. CONCLUSIONS: We developed a novel likelihood-based method which provides adequate estimation of LD and other population model parameters by using case and control samples, the ease in integration of these samples from multiple genetically divergent populations and thus confers statistically robust and powerful analyses of GWAS. On basis of simulation studies and analysis of real datasets, we demonstrated significant improvement of the new method over the non-parametric trend test, which is the most popularly implemented in the literature of GWAS. BioMed Central 2013-02-08 /pmc/articles/PMC3626840/ /pubmed/23394771 http://dx.doi.org/10.1186/1471-2164-14-88 Text en Copyright © 2013 Wang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Wang, Minghui
Wang, Lin
Jiang, Ning
Jia, Tianye
Luo, Zewei
A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts
title A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts
title_full A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts
title_fullStr A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts
title_full_unstemmed A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts
title_short A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts
title_sort robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3626840/
https://www.ncbi.nlm.nih.gov/pubmed/23394771
http://dx.doi.org/10.1186/1471-2164-14-88
work_keys_str_mv AT wangminghui arobustandefficientstatisticalmethodforgeneticassociationstudiesusingcaseandcontrolsamplesfrommultiplecohorts
AT wanglin arobustandefficientstatisticalmethodforgeneticassociationstudiesusingcaseandcontrolsamplesfrommultiplecohorts
AT jiangning arobustandefficientstatisticalmethodforgeneticassociationstudiesusingcaseandcontrolsamplesfrommultiplecohorts
AT jiatianye arobustandefficientstatisticalmethodforgeneticassociationstudiesusingcaseandcontrolsamplesfrommultiplecohorts
AT luozewei arobustandefficientstatisticalmethodforgeneticassociationstudiesusingcaseandcontrolsamplesfrommultiplecohorts
AT wangminghui robustandefficientstatisticalmethodforgeneticassociationstudiesusingcaseandcontrolsamplesfrommultiplecohorts
AT wanglin robustandefficientstatisticalmethodforgeneticassociationstudiesusingcaseandcontrolsamplesfrommultiplecohorts
AT jiangning robustandefficientstatisticalmethodforgeneticassociationstudiesusingcaseandcontrolsamplesfrommultiplecohorts
AT jiatianye robustandefficientstatisticalmethodforgeneticassociationstudiesusingcaseandcontrolsamplesfrommultiplecohorts
AT luozewei robustandefficientstatisticalmethodforgeneticassociationstudiesusingcaseandcontrolsamplesfrommultiplecohorts