Cargando…

Genetic disease risks can be misestimated across global populations

BACKGROUND: Accurate assessment of health disparities requires unbiased knowledge of genetic risks in different populations. Unfortunately, most genome-wide association studies use genotyping arrays and European samples. Here, we integrate whole genome sequence data from global populations, results...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Michelle S., Patel, Kane P., Teng, Andrew K., Berens, Ali J., Lachance, Joseph
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234640/
https://www.ncbi.nlm.nih.gov/pubmed/30424772
http://dx.doi.org/10.1186/s13059-018-1561-7
_version_ 1783370737344577536
author Kim, Michelle S.
Patel, Kane P.
Teng, Andrew K.
Berens, Ali J.
Lachance, Joseph
author_facet Kim, Michelle S.
Patel, Kane P.
Teng, Andrew K.
Berens, Ali J.
Lachance, Joseph
author_sort Kim, Michelle S.
collection PubMed
description BACKGROUND: Accurate assessment of health disparities requires unbiased knowledge of genetic risks in different populations. Unfortunately, most genome-wide association studies use genotyping arrays and European samples. Here, we integrate whole genome sequence data from global populations, results from thousands of genome-wide association studies (GWAS), and extensive computer simulations to identify how genetic disease risks can be misestimated. RESULTS: In contrast to null expectations, we find that risk allele frequencies at known disease loci are significantly different for African populations compared to other continents. Strikingly, ancestral risk alleles are found at 9.51% higher frequency in Africa, and derived risk alleles are found at 5.40% lower frequency in Africa. By simulating GWAS with different study populations, we find that non-African cohorts yield disease associations that have biased allele frequencies and that African cohorts yield disease associations that are relatively free of bias. We also find empirical evidence that genotyping arrays and SNP ascertainment bias contribute to continental differences in risk allele frequencies. Because of these causes, polygenic risk scores can be grossly misestimated for individuals of African descent. Importantly, continental differences in risk allele frequencies are only moderately reduced if GWAS use whole genome sequences and hundreds of thousands of cases and controls. Finally, comparisons between uncorrected and corrected genetic risk scores reveal the benefits of considering whether risk alleles are ancestral or derived. CONCLUSIONS: Our results imply that caution must be taken when extrapolating GWAS results from one population to predict disease risks in another population. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13059-018-1561-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6234640
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62346402018-11-23 Genetic disease risks can be misestimated across global populations Kim, Michelle S. Patel, Kane P. Teng, Andrew K. Berens, Ali J. Lachance, Joseph Genome Biol Research BACKGROUND: Accurate assessment of health disparities requires unbiased knowledge of genetic risks in different populations. Unfortunately, most genome-wide association studies use genotyping arrays and European samples. Here, we integrate whole genome sequence data from global populations, results from thousands of genome-wide association studies (GWAS), and extensive computer simulations to identify how genetic disease risks can be misestimated. RESULTS: In contrast to null expectations, we find that risk allele frequencies at known disease loci are significantly different for African populations compared to other continents. Strikingly, ancestral risk alleles are found at 9.51% higher frequency in Africa, and derived risk alleles are found at 5.40% lower frequency in Africa. By simulating GWAS with different study populations, we find that non-African cohorts yield disease associations that have biased allele frequencies and that African cohorts yield disease associations that are relatively free of bias. We also find empirical evidence that genotyping arrays and SNP ascertainment bias contribute to continental differences in risk allele frequencies. Because of these causes, polygenic risk scores can be grossly misestimated for individuals of African descent. Importantly, continental differences in risk allele frequencies are only moderately reduced if GWAS use whole genome sequences and hundreds of thousands of cases and controls. Finally, comparisons between uncorrected and corrected genetic risk scores reveal the benefits of considering whether risk alleles are ancestral or derived. CONCLUSIONS: Our results imply that caution must be taken when extrapolating GWAS results from one population to predict disease risks in another population. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13059-018-1561-7) contains supplementary material, which is available to authorized users. BioMed Central 2018-11-14 /pmc/articles/PMC6234640/ /pubmed/30424772 http://dx.doi.org/10.1186/s13059-018-1561-7 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Kim, Michelle S.
Patel, Kane P.
Teng, Andrew K.
Berens, Ali J.
Lachance, Joseph
Genetic disease risks can be misestimated across global populations
title Genetic disease risks can be misestimated across global populations
title_full Genetic disease risks can be misestimated across global populations
title_fullStr Genetic disease risks can be misestimated across global populations
title_full_unstemmed Genetic disease risks can be misestimated across global populations
title_short Genetic disease risks can be misestimated across global populations
title_sort genetic disease risks can be misestimated across global populations
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234640/
https://www.ncbi.nlm.nih.gov/pubmed/30424772
http://dx.doi.org/10.1186/s13059-018-1561-7
work_keys_str_mv AT kimmichelles geneticdiseaseriskscanbemisestimatedacrossglobalpopulations
AT patelkanep geneticdiseaseriskscanbemisestimatedacrossglobalpopulations
AT tengandrewk geneticdiseaseriskscanbemisestimatedacrossglobalpopulations
AT berensalij geneticdiseaseriskscanbemisestimatedacrossglobalpopulations
AT lachancejoseph geneticdiseaseriskscanbemisestimatedacrossglobalpopulations