Cargando…

Effect of population stratification analysis on false-positive rates for common and rare variants

Principal components analysis (PCA) has been successfully used to correct for population stratification in genome-wide association studies of common variants. However, rare variants also have a role in common disease etiology. Whether PCA successfully controls population stratification for rare vari...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Hua, Zhang, Xue, Ding, Lili, Baye, Tesfaye M, Kurowski, Brad G, Martin, Lisa J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287840/
https://www.ncbi.nlm.nih.gov/pubmed/22373282
http://dx.doi.org/10.1186/1753-6561-5-S9-S116
_version_ 1782224755615596544
author He, Hua
Zhang, Xue
Ding, Lili
Baye, Tesfaye M
Kurowski, Brad G
Martin, Lisa J
author_facet He, Hua
Zhang, Xue
Ding, Lili
Baye, Tesfaye M
Kurowski, Brad G
Martin, Lisa J
author_sort He, Hua
collection PubMed
description Principal components analysis (PCA) has been successfully used to correct for population stratification in genome-wide association studies of common variants. However, rare variants also have a role in common disease etiology. Whether PCA successfully controls population stratification for rare variants has not been addressed. Thus we evaluate the effect of population stratification analysis on false-positive rates for common and rare variants at the single-nucleotide polymorphism (SNP) and gene level. We use the simulation data from Genetic Analysis Workshop 17 and compare false-positive rates with and without PCA at the SNP and gene level. We found that SNPs’ minor allele frequency (MAF) influenced the ability of PCA to effectively control false discovery. Specifically, PCA reduced false-positive rates more effectively in common SNPs (MAF > 0.05) than in rare SNPs (MAF < 0.01). Furthermore, at the gene level, although false-positive rates were reduced, power to detect true associations was also reduced using PCA. Taken together, these results suggest that sequence-level data should be interpreted with caution, because extremely rare SNPs may exhibit sporadic association that is not controlled using PCA.
format Online
Article
Text
id pubmed-3287840
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32878402012-02-28 Effect of population stratification analysis on false-positive rates for common and rare variants He, Hua Zhang, Xue Ding, Lili Baye, Tesfaye M Kurowski, Brad G Martin, Lisa J BMC Proc Proceedings Principal components analysis (PCA) has been successfully used to correct for population stratification in genome-wide association studies of common variants. However, rare variants also have a role in common disease etiology. Whether PCA successfully controls population stratification for rare variants has not been addressed. Thus we evaluate the effect of population stratification analysis on false-positive rates for common and rare variants at the single-nucleotide polymorphism (SNP) and gene level. We use the simulation data from Genetic Analysis Workshop 17 and compare false-positive rates with and without PCA at the SNP and gene level. We found that SNPs’ minor allele frequency (MAF) influenced the ability of PCA to effectively control false discovery. Specifically, PCA reduced false-positive rates more effectively in common SNPs (MAF > 0.05) than in rare SNPs (MAF < 0.01). Furthermore, at the gene level, although false-positive rates were reduced, power to detect true associations was also reduced using PCA. Taken together, these results suggest that sequence-level data should be interpreted with caution, because extremely rare SNPs may exhibit sporadic association that is not controlled using PCA. BioMed Central 2011-11-29 /pmc/articles/PMC3287840/ /pubmed/22373282 http://dx.doi.org/10.1186/1753-6561-5-S9-S116 Text en Copyright ©2011 He et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
He, Hua
Zhang, Xue
Ding, Lili
Baye, Tesfaye M
Kurowski, Brad G
Martin, Lisa J
Effect of population stratification analysis on false-positive rates for common and rare variants
title Effect of population stratification analysis on false-positive rates for common and rare variants
title_full Effect of population stratification analysis on false-positive rates for common and rare variants
title_fullStr Effect of population stratification analysis on false-positive rates for common and rare variants
title_full_unstemmed Effect of population stratification analysis on false-positive rates for common and rare variants
title_short Effect of population stratification analysis on false-positive rates for common and rare variants
title_sort effect of population stratification analysis on false-positive rates for common and rare variants
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287840/
https://www.ncbi.nlm.nih.gov/pubmed/22373282
http://dx.doi.org/10.1186/1753-6561-5-S9-S116
work_keys_str_mv AT hehua effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants
AT zhangxue effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants
AT dinglili effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants
AT bayetesfayem effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants
AT kurowskibradg effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants
AT martinlisaj effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants