Cargando…

Is population structure in the genetic biobank era irrelevant, a challenge, or an opportunity?

Replicable genetic association signals have consistently been found through genome-wide association studies in recent years. The recent dramatic expansion of study sizes improves power of estimation of effect sizes, genomic prediction, causal inference, and polygenic selection, but it simultaneously...

Descripción completa

Detalles Bibliográficos
Autores principales: Lawson, Daniel John, Davies, Neil Martin, Haworth, Simon, Ashraf, Bilal, Howe, Laurence, Crawford, Andrew, Hemani, Gibran, Davey Smith, George, Timpson, Nicholas John
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6942007/
https://www.ncbi.nlm.nih.gov/pubmed/31030318
http://dx.doi.org/10.1007/s00439-019-02014-8
_version_ 1783484635475345408
author Lawson, Daniel John
Davies, Neil Martin
Haworth, Simon
Ashraf, Bilal
Howe, Laurence
Crawford, Andrew
Hemani, Gibran
Davey Smith, George
Timpson, Nicholas John
author_facet Lawson, Daniel John
Davies, Neil Martin
Haworth, Simon
Ashraf, Bilal
Howe, Laurence
Crawford, Andrew
Hemani, Gibran
Davey Smith, George
Timpson, Nicholas John
author_sort Lawson, Daniel John
collection PubMed
description Replicable genetic association signals have consistently been found through genome-wide association studies in recent years. The recent dramatic expansion of study sizes improves power of estimation of effect sizes, genomic prediction, causal inference, and polygenic selection, but it simultaneously increases susceptibility of these methods to bias due to subtle population structure. Standard methods using genetic principal components to correct for structure might not always be appropriate and we use a simulation study to illustrate when correction might be ineffective for avoiding biases. New methods such as trans-ethnic modeling and chromosome painting allow for a richer understanding of the relationship between traits and population structure. We illustrate the arguments using real examples (stroke and educational attainment) and provide a more nuanced understanding of population structure, which is set to be revisited as a critical aspect of future analyses in genetic epidemiology. We also make simple recommendations for how problems can be avoided in the future. Our results have particular importance for the implementation of GWAS meta-analysis, for prediction of traits, and for causal inference.
format Online
Article
Text
id pubmed-6942007
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-69420072020-01-16 Is population structure in the genetic biobank era irrelevant, a challenge, or an opportunity? Lawson, Daniel John Davies, Neil Martin Haworth, Simon Ashraf, Bilal Howe, Laurence Crawford, Andrew Hemani, Gibran Davey Smith, George Timpson, Nicholas John Hum Genet Review Replicable genetic association signals have consistently been found through genome-wide association studies in recent years. The recent dramatic expansion of study sizes improves power of estimation of effect sizes, genomic prediction, causal inference, and polygenic selection, but it simultaneously increases susceptibility of these methods to bias due to subtle population structure. Standard methods using genetic principal components to correct for structure might not always be appropriate and we use a simulation study to illustrate when correction might be ineffective for avoiding biases. New methods such as trans-ethnic modeling and chromosome painting allow for a richer understanding of the relationship between traits and population structure. We illustrate the arguments using real examples (stroke and educational attainment) and provide a more nuanced understanding of population structure, which is set to be revisited as a critical aspect of future analyses in genetic epidemiology. We also make simple recommendations for how problems can be avoided in the future. Our results have particular importance for the implementation of GWAS meta-analysis, for prediction of traits, and for causal inference. Springer Berlin Heidelberg 2019-04-27 2020 /pmc/articles/PMC6942007/ /pubmed/31030318 http://dx.doi.org/10.1007/s00439-019-02014-8 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Review
Lawson, Daniel John
Davies, Neil Martin
Haworth, Simon
Ashraf, Bilal
Howe, Laurence
Crawford, Andrew
Hemani, Gibran
Davey Smith, George
Timpson, Nicholas John
Is population structure in the genetic biobank era irrelevant, a challenge, or an opportunity?
title Is population structure in the genetic biobank era irrelevant, a challenge, or an opportunity?
title_full Is population structure in the genetic biobank era irrelevant, a challenge, or an opportunity?
title_fullStr Is population structure in the genetic biobank era irrelevant, a challenge, or an opportunity?
title_full_unstemmed Is population structure in the genetic biobank era irrelevant, a challenge, or an opportunity?
title_short Is population structure in the genetic biobank era irrelevant, a challenge, or an opportunity?
title_sort is population structure in the genetic biobank era irrelevant, a challenge, or an opportunity?
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6942007/
https://www.ncbi.nlm.nih.gov/pubmed/31030318
http://dx.doi.org/10.1007/s00439-019-02014-8
work_keys_str_mv AT lawsondanieljohn ispopulationstructureinthegeneticbiobankerairrelevantachallengeoranopportunity
AT daviesneilmartin ispopulationstructureinthegeneticbiobankerairrelevantachallengeoranopportunity
AT haworthsimon ispopulationstructureinthegeneticbiobankerairrelevantachallengeoranopportunity
AT ashrafbilal ispopulationstructureinthegeneticbiobankerairrelevantachallengeoranopportunity
AT howelaurence ispopulationstructureinthegeneticbiobankerairrelevantachallengeoranopportunity
AT crawfordandrew ispopulationstructureinthegeneticbiobankerairrelevantachallengeoranopportunity
AT hemanigibran ispopulationstructureinthegeneticbiobankerairrelevantachallengeoranopportunity
AT daveysmithgeorge ispopulationstructureinthegeneticbiobankerairrelevantachallengeoranopportunity
AT timpsonnicholasjohn ispopulationstructureinthegeneticbiobankerairrelevantachallengeoranopportunity