Cargando…

Magnitude of Stratification in Human Populations and Impacts on Genome Wide Association Studies

Genome-wide association studies (GWAS) may be biased by population stratification (PS). We conducted empirical quantification of the magnitude of PS among human populations and its impact on GWAS. Liver tissues were collected from 979, 59 and 49 Caucasian Americans (CA), African Americans (AA) and H...

Descripción completa

Detalles Bibliográficos
Autores principales: Hao, Ke, Chudin, Eugene, Greenawalt, Danielle, Schadt, Eric E.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2805717/
https://www.ncbi.nlm.nih.gov/pubmed/20084173
http://dx.doi.org/10.1371/journal.pone.0008695
_version_ 1782176225014317056
author Hao, Ke
Chudin, Eugene
Greenawalt, Danielle
Schadt, Eric E.
author_facet Hao, Ke
Chudin, Eugene
Greenawalt, Danielle
Schadt, Eric E.
author_sort Hao, Ke
collection PubMed
description Genome-wide association studies (GWAS) may be biased by population stratification (PS). We conducted empirical quantification of the magnitude of PS among human populations and its impact on GWAS. Liver tissues were collected from 979, 59 and 49 Caucasian Americans (CA), African Americans (AA) and Hispanic Americans (HA), respectively, and genotyped using Illumina650Y (Ilmn650Y) arrays. RNA was also isolated and hybridized to Agilent whole-genome gene expression arrays. We propose a new method (i.e., hgdp-eigen) for detecting PS by projecting genotype vectors for each sample to the eigenvector space defined by the Human Genetic Diversity Panel (HGDP). Further, we conducted GWAS to map expression quantitative trait loci (eQTL) for the ∼40,000 liver gene expression traits monitored by the Agilent arrays. HGDP-eigen performed similarly to the conventional self-eigen methods in capturing PS. However, leveraging the HGDP offered a significant advantage in revealing the origins, directions and magnitude of PS. Adjusting for eigenvectors had minor impacts on eQTL detection rates in CA. In contrast, for AA and HA, adjustment dramatically reduced association findings. At an FDR = 10%, we identified 65 eQTLs in AA with the unadjusted analysis, but only 18 eQTLs after the eigenvector adjustment. Strikingly, 55 out of the 65 unadjusted AA eQTLs were validated in CA, indicating that the adjustment procedure significantly reduced GWAS power. A number of the 55 AA eQTLs validated in CA overlapped with published disease associated SNPs. For example, rs646776 and rs10903129 have previously been associated with lipid levels and coronary heart disease risk, however, the rs10903129 eQTL was missed in the eigenvector adjusted analysis.
format Text
id pubmed-2805717
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28057172010-01-16 Magnitude of Stratification in Human Populations and Impacts on Genome Wide Association Studies Hao, Ke Chudin, Eugene Greenawalt, Danielle Schadt, Eric E. PLoS One Research Article Genome-wide association studies (GWAS) may be biased by population stratification (PS). We conducted empirical quantification of the magnitude of PS among human populations and its impact on GWAS. Liver tissues were collected from 979, 59 and 49 Caucasian Americans (CA), African Americans (AA) and Hispanic Americans (HA), respectively, and genotyped using Illumina650Y (Ilmn650Y) arrays. RNA was also isolated and hybridized to Agilent whole-genome gene expression arrays. We propose a new method (i.e., hgdp-eigen) for detecting PS by projecting genotype vectors for each sample to the eigenvector space defined by the Human Genetic Diversity Panel (HGDP). Further, we conducted GWAS to map expression quantitative trait loci (eQTL) for the ∼40,000 liver gene expression traits monitored by the Agilent arrays. HGDP-eigen performed similarly to the conventional self-eigen methods in capturing PS. However, leveraging the HGDP offered a significant advantage in revealing the origins, directions and magnitude of PS. Adjusting for eigenvectors had minor impacts on eQTL detection rates in CA. In contrast, for AA and HA, adjustment dramatically reduced association findings. At an FDR = 10%, we identified 65 eQTLs in AA with the unadjusted analysis, but only 18 eQTLs after the eigenvector adjustment. Strikingly, 55 out of the 65 unadjusted AA eQTLs were validated in CA, indicating that the adjustment procedure significantly reduced GWAS power. A number of the 55 AA eQTLs validated in CA overlapped with published disease associated SNPs. For example, rs646776 and rs10903129 have previously been associated with lipid levels and coronary heart disease risk, however, the rs10903129 eQTL was missed in the eigenvector adjusted analysis. Public Library of Science 2010-01-13 /pmc/articles/PMC2805717/ /pubmed/20084173 http://dx.doi.org/10.1371/journal.pone.0008695 Text en Hao et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Hao, Ke
Chudin, Eugene
Greenawalt, Danielle
Schadt, Eric E.
Magnitude of Stratification in Human Populations and Impacts on Genome Wide Association Studies
title Magnitude of Stratification in Human Populations and Impacts on Genome Wide Association Studies
title_full Magnitude of Stratification in Human Populations and Impacts on Genome Wide Association Studies
title_fullStr Magnitude of Stratification in Human Populations and Impacts on Genome Wide Association Studies
title_full_unstemmed Magnitude of Stratification in Human Populations and Impacts on Genome Wide Association Studies
title_short Magnitude of Stratification in Human Populations and Impacts on Genome Wide Association Studies
title_sort magnitude of stratification in human populations and impacts on genome wide association studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2805717/
https://www.ncbi.nlm.nih.gov/pubmed/20084173
http://dx.doi.org/10.1371/journal.pone.0008695
work_keys_str_mv AT haoke magnitudeofstratificationinhumanpopulationsandimpactsongenomewideassociationstudies
AT chudineugene magnitudeofstratificationinhumanpopulationsandimpactsongenomewideassociationstudies
AT greenawaltdanielle magnitudeofstratificationinhumanpopulationsandimpactsongenomewideassociationstudies
AT schadterice magnitudeofstratificationinhumanpopulationsandimpactsongenomewideassociationstudies