Cargando…
Effect of population stratification analysis on false-positive rates for common and rare variants
Principal components analysis (PCA) has been successfully used to correct for population stratification in genome-wide association studies of common variants. However, rare variants also have a role in common disease etiology. Whether PCA successfully controls population stratification for rare vari...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287840/ https://www.ncbi.nlm.nih.gov/pubmed/22373282 http://dx.doi.org/10.1186/1753-6561-5-S9-S116 |
_version_ | 1782224755615596544 |
---|---|
author | He, Hua Zhang, Xue Ding, Lili Baye, Tesfaye M Kurowski, Brad G Martin, Lisa J |
author_facet | He, Hua Zhang, Xue Ding, Lili Baye, Tesfaye M Kurowski, Brad G Martin, Lisa J |
author_sort | He, Hua |
collection | PubMed |
description | Principal components analysis (PCA) has been successfully used to correct for population stratification in genome-wide association studies of common variants. However, rare variants also have a role in common disease etiology. Whether PCA successfully controls population stratification for rare variants has not been addressed. Thus we evaluate the effect of population stratification analysis on false-positive rates for common and rare variants at the single-nucleotide polymorphism (SNP) and gene level. We use the simulation data from Genetic Analysis Workshop 17 and compare false-positive rates with and without PCA at the SNP and gene level. We found that SNPs’ minor allele frequency (MAF) influenced the ability of PCA to effectively control false discovery. Specifically, PCA reduced false-positive rates more effectively in common SNPs (MAF > 0.05) than in rare SNPs (MAF < 0.01). Furthermore, at the gene level, although false-positive rates were reduced, power to detect true associations was also reduced using PCA. Taken together, these results suggest that sequence-level data should be interpreted with caution, because extremely rare SNPs may exhibit sporadic association that is not controlled using PCA. |
format | Online Article Text |
id | pubmed-3287840 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32878402012-02-28 Effect of population stratification analysis on false-positive rates for common and rare variants He, Hua Zhang, Xue Ding, Lili Baye, Tesfaye M Kurowski, Brad G Martin, Lisa J BMC Proc Proceedings Principal components analysis (PCA) has been successfully used to correct for population stratification in genome-wide association studies of common variants. However, rare variants also have a role in common disease etiology. Whether PCA successfully controls population stratification for rare variants has not been addressed. Thus we evaluate the effect of population stratification analysis on false-positive rates for common and rare variants at the single-nucleotide polymorphism (SNP) and gene level. We use the simulation data from Genetic Analysis Workshop 17 and compare false-positive rates with and without PCA at the SNP and gene level. We found that SNPs’ minor allele frequency (MAF) influenced the ability of PCA to effectively control false discovery. Specifically, PCA reduced false-positive rates more effectively in common SNPs (MAF > 0.05) than in rare SNPs (MAF < 0.01). Furthermore, at the gene level, although false-positive rates were reduced, power to detect true associations was also reduced using PCA. Taken together, these results suggest that sequence-level data should be interpreted with caution, because extremely rare SNPs may exhibit sporadic association that is not controlled using PCA. BioMed Central 2011-11-29 /pmc/articles/PMC3287840/ /pubmed/22373282 http://dx.doi.org/10.1186/1753-6561-5-S9-S116 Text en Copyright ©2011 He et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings He, Hua Zhang, Xue Ding, Lili Baye, Tesfaye M Kurowski, Brad G Martin, Lisa J Effect of population stratification analysis on false-positive rates for common and rare variants |
title | Effect of population stratification analysis on false-positive rates for common and rare variants |
title_full | Effect of population stratification analysis on false-positive rates for common and rare variants |
title_fullStr | Effect of population stratification analysis on false-positive rates for common and rare variants |
title_full_unstemmed | Effect of population stratification analysis on false-positive rates for common and rare variants |
title_short | Effect of population stratification analysis on false-positive rates for common and rare variants |
title_sort | effect of population stratification analysis on false-positive rates for common and rare variants |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287840/ https://www.ncbi.nlm.nih.gov/pubmed/22373282 http://dx.doi.org/10.1186/1753-6561-5-S9-S116 |
work_keys_str_mv | AT hehua effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants AT zhangxue effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants AT dinglili effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants AT bayetesfayem effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants AT kurowskibradg effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants AT martinlisaj effectofpopulationstratificationanalysisonfalsepositiveratesforcommonandrarevariants |