Cargando…
Genetic disease risks can be misestimated across global populations
BACKGROUND: Accurate assessment of health disparities requires unbiased knowledge of genetic risks in different populations. Unfortunately, most genome-wide association studies use genotyping arrays and European samples. Here, we integrate whole genome sequence data from global populations, results...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234640/ https://www.ncbi.nlm.nih.gov/pubmed/30424772 http://dx.doi.org/10.1186/s13059-018-1561-7 |
_version_ | 1783370737344577536 |
---|---|
author | Kim, Michelle S. Patel, Kane P. Teng, Andrew K. Berens, Ali J. Lachance, Joseph |
author_facet | Kim, Michelle S. Patel, Kane P. Teng, Andrew K. Berens, Ali J. Lachance, Joseph |
author_sort | Kim, Michelle S. |
collection | PubMed |
description | BACKGROUND: Accurate assessment of health disparities requires unbiased knowledge of genetic risks in different populations. Unfortunately, most genome-wide association studies use genotyping arrays and European samples. Here, we integrate whole genome sequence data from global populations, results from thousands of genome-wide association studies (GWAS), and extensive computer simulations to identify how genetic disease risks can be misestimated. RESULTS: In contrast to null expectations, we find that risk allele frequencies at known disease loci are significantly different for African populations compared to other continents. Strikingly, ancestral risk alleles are found at 9.51% higher frequency in Africa, and derived risk alleles are found at 5.40% lower frequency in Africa. By simulating GWAS with different study populations, we find that non-African cohorts yield disease associations that have biased allele frequencies and that African cohorts yield disease associations that are relatively free of bias. We also find empirical evidence that genotyping arrays and SNP ascertainment bias contribute to continental differences in risk allele frequencies. Because of these causes, polygenic risk scores can be grossly misestimated for individuals of African descent. Importantly, continental differences in risk allele frequencies are only moderately reduced if GWAS use whole genome sequences and hundreds of thousands of cases and controls. Finally, comparisons between uncorrected and corrected genetic risk scores reveal the benefits of considering whether risk alleles are ancestral or derived. CONCLUSIONS: Our results imply that caution must be taken when extrapolating GWAS results from one population to predict disease risks in another population. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13059-018-1561-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6234640 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62346402018-11-23 Genetic disease risks can be misestimated across global populations Kim, Michelle S. Patel, Kane P. Teng, Andrew K. Berens, Ali J. Lachance, Joseph Genome Biol Research BACKGROUND: Accurate assessment of health disparities requires unbiased knowledge of genetic risks in different populations. Unfortunately, most genome-wide association studies use genotyping arrays and European samples. Here, we integrate whole genome sequence data from global populations, results from thousands of genome-wide association studies (GWAS), and extensive computer simulations to identify how genetic disease risks can be misestimated. RESULTS: In contrast to null expectations, we find that risk allele frequencies at known disease loci are significantly different for African populations compared to other continents. Strikingly, ancestral risk alleles are found at 9.51% higher frequency in Africa, and derived risk alleles are found at 5.40% lower frequency in Africa. By simulating GWAS with different study populations, we find that non-African cohorts yield disease associations that have biased allele frequencies and that African cohorts yield disease associations that are relatively free of bias. We also find empirical evidence that genotyping arrays and SNP ascertainment bias contribute to continental differences in risk allele frequencies. Because of these causes, polygenic risk scores can be grossly misestimated for individuals of African descent. Importantly, continental differences in risk allele frequencies are only moderately reduced if GWAS use whole genome sequences and hundreds of thousands of cases and controls. Finally, comparisons between uncorrected and corrected genetic risk scores reveal the benefits of considering whether risk alleles are ancestral or derived. CONCLUSIONS: Our results imply that caution must be taken when extrapolating GWAS results from one population to predict disease risks in another population. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13059-018-1561-7) contains supplementary material, which is available to authorized users. BioMed Central 2018-11-14 /pmc/articles/PMC6234640/ /pubmed/30424772 http://dx.doi.org/10.1186/s13059-018-1561-7 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Kim, Michelle S. Patel, Kane P. Teng, Andrew K. Berens, Ali J. Lachance, Joseph Genetic disease risks can be misestimated across global populations |
title | Genetic disease risks can be misestimated across global populations |
title_full | Genetic disease risks can be misestimated across global populations |
title_fullStr | Genetic disease risks can be misestimated across global populations |
title_full_unstemmed | Genetic disease risks can be misestimated across global populations |
title_short | Genetic disease risks can be misestimated across global populations |
title_sort | genetic disease risks can be misestimated across global populations |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234640/ https://www.ncbi.nlm.nih.gov/pubmed/30424772 http://dx.doi.org/10.1186/s13059-018-1561-7 |
work_keys_str_mv | AT kimmichelles geneticdiseaseriskscanbemisestimatedacrossglobalpopulations AT patelkanep geneticdiseaseriskscanbemisestimatedacrossglobalpopulations AT tengandrewk geneticdiseaseriskscanbemisestimatedacrossglobalpopulations AT berensalij geneticdiseaseriskscanbemisestimatedacrossglobalpopulations AT lachancejoseph geneticdiseaseriskscanbemisestimatedacrossglobalpopulations |