Cargando…
A framework for research into continental ancestry groups of the UK Biobank
BACKGROUND: The UK Biobank is a large prospective cohort, based in the UK, that has deep phenotypic and genomic data on roughly a half a million individuals. Included in this resource are data on approximately 78,000 individuals with “non-white British ancestry.” While most epidemiology studies have...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8800339/ https://www.ncbi.nlm.nih.gov/pubmed/35093177 http://dx.doi.org/10.1186/s40246-022-00380-5 |
_version_ | 1784642239791628288 |
---|---|
author | Constantinescu, Andrei-Emil Mitchell, Ruth E. Zheng, Jie Bull, Caroline J. Timpson, Nicholas J. Amulic, Borko Vincent, Emma E. Hughes, David A. |
author_facet | Constantinescu, Andrei-Emil Mitchell, Ruth E. Zheng, Jie Bull, Caroline J. Timpson, Nicholas J. Amulic, Borko Vincent, Emma E. Hughes, David A. |
author_sort | Constantinescu, Andrei-Emil |
collection | PubMed |
description | BACKGROUND: The UK Biobank is a large prospective cohort, based in the UK, that has deep phenotypic and genomic data on roughly a half a million individuals. Included in this resource are data on approximately 78,000 individuals with “non-white British ancestry.” While most epidemiology studies have focused predominantly on populations of European ancestry, there is an opportunity to contribute to the study of health and disease for a broader segment of the population by making use of the UK Biobank’s “non-white British ancestry” samples. Here, we present an empirical description of the continental ancestry and population structure among the individuals in this UK Biobank subset. RESULTS: Reference populations from the 1000 Genomes Project for Africa, Europe, East Asia, and South Asia were used to estimate ancestry for each individual. Those with at least 80% ancestry in one of these four continental ancestry groups were taken forward (N = 62,484). Principal component and K-means clustering analyses were used to identify and characterize population structure within each ancestry group. Of the approximately 78,000 individuals in the UK Biobank that are of “non-white British” ancestry, 50,685, 6653, 2782, and 2364 individuals were associated to the European, African, South Asian, and East Asian continental ancestry groups, respectively. Each continental ancestry group exhibits prominent population structure that is consistent with self-reported country of birth data and geography. CONCLUSIONS: Methods outlined here provide an avenue to leverage UK Biobank’s deeply phenotyped data allowing researchers to maximize its potential in the study of health and disease in individuals of non-white British ancestry. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40246-022-00380-5. |
format | Online Article Text |
id | pubmed-8800339 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-88003392022-02-02 A framework for research into continental ancestry groups of the UK Biobank Constantinescu, Andrei-Emil Mitchell, Ruth E. Zheng, Jie Bull, Caroline J. Timpson, Nicholas J. Amulic, Borko Vincent, Emma E. Hughes, David A. Hum Genomics Primary Research BACKGROUND: The UK Biobank is a large prospective cohort, based in the UK, that has deep phenotypic and genomic data on roughly a half a million individuals. Included in this resource are data on approximately 78,000 individuals with “non-white British ancestry.” While most epidemiology studies have focused predominantly on populations of European ancestry, there is an opportunity to contribute to the study of health and disease for a broader segment of the population by making use of the UK Biobank’s “non-white British ancestry” samples. Here, we present an empirical description of the continental ancestry and population structure among the individuals in this UK Biobank subset. RESULTS: Reference populations from the 1000 Genomes Project for Africa, Europe, East Asia, and South Asia were used to estimate ancestry for each individual. Those with at least 80% ancestry in one of these four continental ancestry groups were taken forward (N = 62,484). Principal component and K-means clustering analyses were used to identify and characterize population structure within each ancestry group. Of the approximately 78,000 individuals in the UK Biobank that are of “non-white British” ancestry, 50,685, 6653, 2782, and 2364 individuals were associated to the European, African, South Asian, and East Asian continental ancestry groups, respectively. Each continental ancestry group exhibits prominent population structure that is consistent with self-reported country of birth data and geography. CONCLUSIONS: Methods outlined here provide an avenue to leverage UK Biobank’s deeply phenotyped data allowing researchers to maximize its potential in the study of health and disease in individuals of non-white British ancestry. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40246-022-00380-5. BioMed Central 2022-01-29 /pmc/articles/PMC8800339/ /pubmed/35093177 http://dx.doi.org/10.1186/s40246-022-00380-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Primary Research Constantinescu, Andrei-Emil Mitchell, Ruth E. Zheng, Jie Bull, Caroline J. Timpson, Nicholas J. Amulic, Borko Vincent, Emma E. Hughes, David A. A framework for research into continental ancestry groups of the UK Biobank |
title | A framework for research into continental ancestry groups of the UK Biobank |
title_full | A framework for research into continental ancestry groups of the UK Biobank |
title_fullStr | A framework for research into continental ancestry groups of the UK Biobank |
title_full_unstemmed | A framework for research into continental ancestry groups of the UK Biobank |
title_short | A framework for research into continental ancestry groups of the UK Biobank |
title_sort | framework for research into continental ancestry groups of the uk biobank |
topic | Primary Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8800339/ https://www.ncbi.nlm.nih.gov/pubmed/35093177 http://dx.doi.org/10.1186/s40246-022-00380-5 |
work_keys_str_mv | AT constantinescuandreiemil aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank AT mitchellruthe aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank AT zhengjie aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank AT bullcarolinej aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank AT timpsonnicholasj aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank AT amulicborko aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank AT vincentemmae aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank AT hughesdavida aframeworkforresearchintocontinentalancestrygroupsoftheukbiobank AT constantinescuandreiemil frameworkforresearchintocontinentalancestrygroupsoftheukbiobank AT mitchellruthe frameworkforresearchintocontinentalancestrygroupsoftheukbiobank AT zhengjie frameworkforresearchintocontinentalancestrygroupsoftheukbiobank AT bullcarolinej frameworkforresearchintocontinentalancestrygroupsoftheukbiobank AT timpsonnicholasj frameworkforresearchintocontinentalancestrygroupsoftheukbiobank AT amulicborko frameworkforresearchintocontinentalancestrygroupsoftheukbiobank AT vincentemmae frameworkforresearchintocontinentalancestrygroupsoftheukbiobank AT hughesdavida frameworkforresearchintocontinentalancestrygroupsoftheukbiobank |