Cargando…

A framework for research into continental ancestry groups of the UK Biobank

BACKGROUND: The UK Biobank is a large prospective cohort, based in the UK, that has deep phenotypic and genomic data on roughly a half a million individuals. Included in this resource are data on approximately 78,000 individuals with “non-white British ancestry.” While most epidemiology studies have...

Descripción completa

Detalles Bibliográficos
Autores principales: Constantinescu, Andrei-Emil, Mitchell, Ruth E., Zheng, Jie, Bull, Caroline J., Timpson, Nicholas J., Amulic, Borko, Vincent, Emma E., Hughes, David A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8800339/
https://www.ncbi.nlm.nih.gov/pubmed/35093177
http://dx.doi.org/10.1186/s40246-022-00380-5
_version_ 1784642239791628288
author Constantinescu, Andrei-Emil
Mitchell, Ruth E.
Zheng, Jie
Bull, Caroline J.
Timpson, Nicholas J.
Amulic, Borko
Vincent, Emma E.
Hughes, David A.
author_facet Constantinescu, Andrei-Emil
Mitchell, Ruth E.
Zheng, Jie
Bull, Caroline J.
Timpson, Nicholas J.
Amulic, Borko
Vincent, Emma E.
Hughes, David A.
author_sort Constantinescu, Andrei-Emil
collection PubMed
description BACKGROUND: The UK Biobank is a large prospective cohort, based in the UK, that has deep phenotypic and genomic data on roughly a half a million individuals. Included in this resource are data on approximately 78,000 individuals with “non-white British ancestry.” While most epidemiology studies have focused predominantly on populations of European ancestry, there is an opportunity to contribute to the study of health and disease for a broader segment of the population by making use of the UK Biobank’s “non-white British ancestry” samples. Here, we present an empirical description of the continental ancestry and population structure among the individuals in this UK Biobank subset. RESULTS: Reference populations from the 1000 Genomes Project for Africa, Europe, East Asia, and South Asia were used to estimate ancestry for each individual. Those with at least 80% ancestry in one of these four continental ancestry groups were taken forward (N = 62,484). Principal component and K-means clustering analyses were used to identify and characterize population structure within each ancestry group. Of the approximately 78,000 individuals in the UK Biobank that are of “non-white British” ancestry, 50,685, 6653, 2782, and 2364 individuals were associated to the European, African, South Asian, and East Asian continental ancestry groups, respectively. Each continental ancestry group exhibits prominent population structure that is consistent with self-reported country of birth data and geography. CONCLUSIONS: Methods outlined here provide an avenue to leverage UK Biobank’s deeply phenotyped data allowing researchers to maximize its potential in the study of health and disease in individuals of non-white British ancestry. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40246-022-00380-5.
format Online
Article
Text
id pubmed-8800339
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-88003392022-02-02 A framework for research into continental ancestry groups of the UK Biobank Constantinescu, Andrei-Emil Mitchell, Ruth E. Zheng, Jie Bull, Caroline J. Timpson, Nicholas J. Amulic, Borko Vincent, Emma E. Hughes, David A. Hum Genomics Primary Research BACKGROUND: The UK Biobank is a large prospective cohort, based in the UK, that has deep phenotypic and genomic data on roughly a half a million individuals. Included in this resource are data on approximately 78,000 individuals with “non-white British ancestry.” While most epidemiology studies have focused predominantly on populations of European ancestry, there is an opportunity to contribute to the study of health and disease for a broader segment of the population by making use of the UK Biobank’s “non-white British ancestry” samples. Here, we present an empirical description of the continental ancestry and population structure among the individuals in this UK Biobank subset. RESULTS: Reference populations from the 1000 Genomes Project for Africa, Europe, East Asia, and South Asia were used to estimate ancestry for each individual. Those with at least 80% ancestry in one of these four continental ancestry groups were taken forward (N = 62,484). Principal component and K-means clustering analyses were used to identify and characterize population structure within each ancestry group. Of the approximately 78,000 individuals in the UK Biobank that are of “non-white British” ancestry, 50,685, 6653, 2782, and 2364 individuals were associated to the European, African, South Asian, and East Asian continental ancestry groups, respectively. Each continental ancestry group exhibits prominent population structure that is consistent with self-reported country of birth data and geography. CONCLUSIONS: Methods outlined here provide an avenue to leverage UK Biobank’s deeply phenotyped data allowing researchers to maximize its potential in the study of health and disease in individuals of non-white British ancestry. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40246-022-00380-5. BioMed Central 2022-01-29 /pmc/articles/PMC8800339/ /pubmed/35093177 http://dx.doi.org/10.1186/s40246-022-00380-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Primary Research
Constantinescu, Andrei-Emil
Mitchell, Ruth E.
Zheng, Jie
Bull, Caroline J.
Timpson, Nicholas J.
Amulic, Borko
Vincent, Emma E.
Hughes, David A.
A framework for research into continental ancestry groups of the UK Biobank
title A framework for research into continental ancestry groups of the UK Biobank
title_full A framework for research into continental ancestry groups of the UK Biobank
title_fullStr A framework for research into continental ancestry groups of the UK Biobank
title_full_unstemmed A framework for research into continental ancestry groups of the UK Biobank
title_short A framework for research into continental ancestry groups of the UK Biobank
title_sort framework for research into continental ancestry groups of the uk biobank
topic Primary Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8800339/
https://www.ncbi.nlm.nih.gov/pubmed/35093177
http://dx.doi.org/10.1186/s40246-022-00380-5
work_keys_str_mv AT constantinescuandreiemil aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT mitchellruthe aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT zhengjie aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT bullcarolinej aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT timpsonnicholasj aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT amulicborko aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT vincentemmae aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT hughesdavida aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT constantinescuandreiemil frameworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT mitchellruthe frameworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT zhengjie frameworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT bullcarolinej frameworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT timpsonnicholasj frameworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT amulicborko frameworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT vincentemmae frameworkforresearchintocontinentalancestrygroupsoftheukbiobank
AT hughesdavida frameworkforresearchintocontinentalancestrygroupsoftheukbiobank