Cargando…

Genetic architecture of gene expression traits across diverse populations

For many complex traits, gene regulation is likely to play a crucial mechanistic role. How the genetic architectures of complex traits vary between populations and subsequent effects on genetic prediction are not well understood, in part due to the historical paucity of GWAS in populations of non-Eu...

Descripción completa

Detalles Bibliográficos
Autores principales: Mogil, Lauren S., Andaleon, Angela, Badalamenti, Alexa, Dickinson, Scott P., Guo, Xiuqing, Rotter, Jerome I., Johnson, W. Craig, Im, Hae Kyung, Liu, Yongmei, Wheeler, Heather E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6105030/
https://www.ncbi.nlm.nih.gov/pubmed/30096133
http://dx.doi.org/10.1371/journal.pgen.1007586
_version_ 1783349593990234112
author Mogil, Lauren S.
Andaleon, Angela
Badalamenti, Alexa
Dickinson, Scott P.
Guo, Xiuqing
Rotter, Jerome I.
Johnson, W. Craig
Im, Hae Kyung
Liu, Yongmei
Wheeler, Heather E.
author_facet Mogil, Lauren S.
Andaleon, Angela
Badalamenti, Alexa
Dickinson, Scott P.
Guo, Xiuqing
Rotter, Jerome I.
Johnson, W. Craig
Im, Hae Kyung
Liu, Yongmei
Wheeler, Heather E.
author_sort Mogil, Lauren S.
collection PubMed
description For many complex traits, gene regulation is likely to play a crucial mechanistic role. How the genetic architectures of complex traits vary between populations and subsequent effects on genetic prediction are not well understood, in part due to the historical paucity of GWAS in populations of non-European ancestry. We used data from the MESA (Multi-Ethnic Study of Atherosclerosis) cohort to characterize the genetic architecture of gene expression within and between diverse populations. Genotype and monocyte gene expression were available in individuals with African American (AFA, n = 233), Hispanic (HIS, n = 352), and European (CAU, n = 578) ancestry. We performed expression quantitative trait loci (eQTL) mapping in each population and show genetic correlation of gene expression depends on shared ancestry proportions. Using elastic net modeling with cross validation to optimize genotypic predictors of gene expression in each population, we show the genetic architecture of gene expression for most predictable genes is sparse. We found the best predicted gene in each population, TACSTD2 in AFA and CHURC1 in CAU and HIS, had similar prediction performance across populations with R(2) > 0.8 in each population. However, we identified a subset of genes that are well-predicted in one population, but poorly predicted in another. We show these differences in predictive performance are due to allele frequency differences between populations. Using genotype weights trained in MESA to predict gene expression in independent populations showed that a training set with ancestry similar to the test set is better at predicting gene expression in test populations, demonstrating an urgent need for diverse population sampling in genomics. Our predictive models and performance statistics in diverse cohorts are made publicly available for use in transcriptome mapping methods at https://github.com/WheelerLab/DivPop.
format Online
Article
Text
id pubmed-6105030
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-61050302018-08-30 Genetic architecture of gene expression traits across diverse populations Mogil, Lauren S. Andaleon, Angela Badalamenti, Alexa Dickinson, Scott P. Guo, Xiuqing Rotter, Jerome I. Johnson, W. Craig Im, Hae Kyung Liu, Yongmei Wheeler, Heather E. PLoS Genet Research Article For many complex traits, gene regulation is likely to play a crucial mechanistic role. How the genetic architectures of complex traits vary between populations and subsequent effects on genetic prediction are not well understood, in part due to the historical paucity of GWAS in populations of non-European ancestry. We used data from the MESA (Multi-Ethnic Study of Atherosclerosis) cohort to characterize the genetic architecture of gene expression within and between diverse populations. Genotype and monocyte gene expression were available in individuals with African American (AFA, n = 233), Hispanic (HIS, n = 352), and European (CAU, n = 578) ancestry. We performed expression quantitative trait loci (eQTL) mapping in each population and show genetic correlation of gene expression depends on shared ancestry proportions. Using elastic net modeling with cross validation to optimize genotypic predictors of gene expression in each population, we show the genetic architecture of gene expression for most predictable genes is sparse. We found the best predicted gene in each population, TACSTD2 in AFA and CHURC1 in CAU and HIS, had similar prediction performance across populations with R(2) > 0.8 in each population. However, we identified a subset of genes that are well-predicted in one population, but poorly predicted in another. We show these differences in predictive performance are due to allele frequency differences between populations. Using genotype weights trained in MESA to predict gene expression in independent populations showed that a training set with ancestry similar to the test set is better at predicting gene expression in test populations, demonstrating an urgent need for diverse population sampling in genomics. Our predictive models and performance statistics in diverse cohorts are made publicly available for use in transcriptome mapping methods at https://github.com/WheelerLab/DivPop. Public Library of Science 2018-08-10 /pmc/articles/PMC6105030/ /pubmed/30096133 http://dx.doi.org/10.1371/journal.pgen.1007586 Text en © 2018 Mogil et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mogil, Lauren S.
Andaleon, Angela
Badalamenti, Alexa
Dickinson, Scott P.
Guo, Xiuqing
Rotter, Jerome I.
Johnson, W. Craig
Im, Hae Kyung
Liu, Yongmei
Wheeler, Heather E.
Genetic architecture of gene expression traits across diverse populations
title Genetic architecture of gene expression traits across diverse populations
title_full Genetic architecture of gene expression traits across diverse populations
title_fullStr Genetic architecture of gene expression traits across diverse populations
title_full_unstemmed Genetic architecture of gene expression traits across diverse populations
title_short Genetic architecture of gene expression traits across diverse populations
title_sort genetic architecture of gene expression traits across diverse populations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6105030/
https://www.ncbi.nlm.nih.gov/pubmed/30096133
http://dx.doi.org/10.1371/journal.pgen.1007586
work_keys_str_mv AT mogillaurens geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT andaleonangela geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT badalamentialexa geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT dickinsonscottp geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT guoxiuqing geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT rotterjeromei geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT johnsonwcraig geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT imhaekyung geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT liuyongmei geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations
AT wheelerheathere geneticarchitectureofgeneexpressiontraitsacrossdiversepopulations