Cargando…
The sequences of 150,119 genomes in the UK Biobank
Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequenci...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9329122/ https://www.ncbi.nlm.nih.gov/pubmed/35859178 http://dx.doi.org/10.1038/s41586-022-04965-x |
_version_ | 1784757870693187584 |
---|---|
author | Halldorsson, Bjarni V. Eggertsson, Hannes P. Moore, Kristjan H. S. Hauswedell, Hannes Eiriksson, Ogmundur Ulfarsson, Magnus O. Palsson, Gunnar Hardarson, Marteinn T. Oddsson, Asmundur Jensson, Brynjar O. Kristmundsdottir, Snaedis Sigurpalsdottir, Brynja D. Stefansson, Olafur A. Beyter, Doruk Holley, Guillaume Tragante, Vinicius Gylfason, Arnaldur Olason, Pall I. Zink, Florian Asgeirsdottir, Margret Sverrisson, Sverrir T. Sigurdsson, Brynjar Gudjonsson, Sigurjon A. Sigurdsson, Gunnar T. Halldorsson, Gisli H. Sveinbjornsson, Gardar Norland, Kristjan Styrkarsdottir, Unnur Magnusdottir, Droplaug N. Snorradottir, Steinunn Kristinsson, Kari Sobech, Emilia Jonsson, Helgi Geirsson, Arni J. Olafsson, Isleifur Jonsson, Palmi Pedersen, Ole Birger Erikstrup, Christian Brunak, Søren Ostrowski, Sisse Rye Thorleifsson, Gudmar Jonsson, Frosti Melsted, Pall Jonsdottir, Ingileif Rafnar, Thorunn Holm, Hilma Stefansson, Hreinn Saemundsdottir, Jona Gudbjartsson, Daniel F. Magnusson, Olafur T. Masson, Gisli Thorsteinsdottir, Unnur Helgason, Agnar Jonsson, Hakon Sulem, Patrick Stefansson, Kari |
author_facet | Halldorsson, Bjarni V. Eggertsson, Hannes P. Moore, Kristjan H. S. Hauswedell, Hannes Eiriksson, Ogmundur Ulfarsson, Magnus O. Palsson, Gunnar Hardarson, Marteinn T. Oddsson, Asmundur Jensson, Brynjar O. Kristmundsdottir, Snaedis Sigurpalsdottir, Brynja D. Stefansson, Olafur A. Beyter, Doruk Holley, Guillaume Tragante, Vinicius Gylfason, Arnaldur Olason, Pall I. Zink, Florian Asgeirsdottir, Margret Sverrisson, Sverrir T. Sigurdsson, Brynjar Gudjonsson, Sigurjon A. Sigurdsson, Gunnar T. Halldorsson, Gisli H. Sveinbjornsson, Gardar Norland, Kristjan Styrkarsdottir, Unnur Magnusdottir, Droplaug N. Snorradottir, Steinunn Kristinsson, Kari Sobech, Emilia Jonsson, Helgi Geirsson, Arni J. Olafsson, Isleifur Jonsson, Palmi Pedersen, Ole Birger Erikstrup, Christian Brunak, Søren Ostrowski, Sisse Rye Thorleifsson, Gudmar Jonsson, Frosti Melsted, Pall Jonsdottir, Ingileif Rafnar, Thorunn Holm, Hilma Stefansson, Hreinn Saemundsdottir, Jona Gudbjartsson, Daniel F. Magnusson, Olafur T. Masson, Gisli Thorsteinsdottir, Unnur Helgason, Agnar Jonsson, Hakon Sulem, Patrick Stefansson, Kari |
author_sort | Halldorsson, Bjarni V. |
collection | PubMed |
description | Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data(1,2). Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank(3). This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation. |
format | Online Article Text |
id | pubmed-9329122 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-93291222022-07-29 The sequences of 150,119 genomes in the UK Biobank Halldorsson, Bjarni V. Eggertsson, Hannes P. Moore, Kristjan H. S. Hauswedell, Hannes Eiriksson, Ogmundur Ulfarsson, Magnus O. Palsson, Gunnar Hardarson, Marteinn T. Oddsson, Asmundur Jensson, Brynjar O. Kristmundsdottir, Snaedis Sigurpalsdottir, Brynja D. Stefansson, Olafur A. Beyter, Doruk Holley, Guillaume Tragante, Vinicius Gylfason, Arnaldur Olason, Pall I. Zink, Florian Asgeirsdottir, Margret Sverrisson, Sverrir T. Sigurdsson, Brynjar Gudjonsson, Sigurjon A. Sigurdsson, Gunnar T. Halldorsson, Gisli H. Sveinbjornsson, Gardar Norland, Kristjan Styrkarsdottir, Unnur Magnusdottir, Droplaug N. Snorradottir, Steinunn Kristinsson, Kari Sobech, Emilia Jonsson, Helgi Geirsson, Arni J. Olafsson, Isleifur Jonsson, Palmi Pedersen, Ole Birger Erikstrup, Christian Brunak, Søren Ostrowski, Sisse Rye Thorleifsson, Gudmar Jonsson, Frosti Melsted, Pall Jonsdottir, Ingileif Rafnar, Thorunn Holm, Hilma Stefansson, Hreinn Saemundsdottir, Jona Gudbjartsson, Daniel F. Magnusson, Olafur T. Masson, Gisli Thorsteinsdottir, Unnur Helgason, Agnar Jonsson, Hakon Sulem, Patrick Stefansson, Kari Nature Article Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data(1,2). Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank(3). This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation. Nature Publishing Group UK 2022-07-20 2022 /pmc/articles/PMC9329122/ /pubmed/35859178 http://dx.doi.org/10.1038/s41586-022-04965-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Halldorsson, Bjarni V. Eggertsson, Hannes P. Moore, Kristjan H. S. Hauswedell, Hannes Eiriksson, Ogmundur Ulfarsson, Magnus O. Palsson, Gunnar Hardarson, Marteinn T. Oddsson, Asmundur Jensson, Brynjar O. Kristmundsdottir, Snaedis Sigurpalsdottir, Brynja D. Stefansson, Olafur A. Beyter, Doruk Holley, Guillaume Tragante, Vinicius Gylfason, Arnaldur Olason, Pall I. Zink, Florian Asgeirsdottir, Margret Sverrisson, Sverrir T. Sigurdsson, Brynjar Gudjonsson, Sigurjon A. Sigurdsson, Gunnar T. Halldorsson, Gisli H. Sveinbjornsson, Gardar Norland, Kristjan Styrkarsdottir, Unnur Magnusdottir, Droplaug N. Snorradottir, Steinunn Kristinsson, Kari Sobech, Emilia Jonsson, Helgi Geirsson, Arni J. Olafsson, Isleifur Jonsson, Palmi Pedersen, Ole Birger Erikstrup, Christian Brunak, Søren Ostrowski, Sisse Rye Thorleifsson, Gudmar Jonsson, Frosti Melsted, Pall Jonsdottir, Ingileif Rafnar, Thorunn Holm, Hilma Stefansson, Hreinn Saemundsdottir, Jona Gudbjartsson, Daniel F. Magnusson, Olafur T. Masson, Gisli Thorsteinsdottir, Unnur Helgason, Agnar Jonsson, Hakon Sulem, Patrick Stefansson, Kari The sequences of 150,119 genomes in the UK Biobank |
title | The sequences of 150,119 genomes in the UK Biobank |
title_full | The sequences of 150,119 genomes in the UK Biobank |
title_fullStr | The sequences of 150,119 genomes in the UK Biobank |
title_full_unstemmed | The sequences of 150,119 genomes in the UK Biobank |
title_short | The sequences of 150,119 genomes in the UK Biobank |
title_sort | sequences of 150,119 genomes in the uk biobank |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9329122/ https://www.ncbi.nlm.nih.gov/pubmed/35859178 http://dx.doi.org/10.1038/s41586-022-04965-x |
work_keys_str_mv | AT halldorssonbjarniv thesequencesof150119genomesintheukbiobank AT eggertssonhannesp thesequencesof150119genomesintheukbiobank AT moorekristjanhs thesequencesof150119genomesintheukbiobank AT hauswedellhannes thesequencesof150119genomesintheukbiobank AT eirikssonogmundur thesequencesof150119genomesintheukbiobank AT ulfarssonmagnuso thesequencesof150119genomesintheukbiobank AT palssongunnar thesequencesof150119genomesintheukbiobank AT hardarsonmarteinnt thesequencesof150119genomesintheukbiobank AT oddssonasmundur thesequencesof150119genomesintheukbiobank AT jenssonbrynjaro thesequencesof150119genomesintheukbiobank AT kristmundsdottirsnaedis thesequencesof150119genomesintheukbiobank AT sigurpalsdottirbrynjad thesequencesof150119genomesintheukbiobank AT stefanssonolafura thesequencesof150119genomesintheukbiobank AT beyterdoruk thesequencesof150119genomesintheukbiobank AT holleyguillaume thesequencesof150119genomesintheukbiobank AT tragantevinicius thesequencesof150119genomesintheukbiobank AT gylfasonarnaldur thesequencesof150119genomesintheukbiobank AT olasonpalli thesequencesof150119genomesintheukbiobank AT zinkflorian thesequencesof150119genomesintheukbiobank AT asgeirsdottirmargret thesequencesof150119genomesintheukbiobank AT sverrissonsverrirt thesequencesof150119genomesintheukbiobank AT sigurdssonbrynjar thesequencesof150119genomesintheukbiobank AT gudjonssonsigurjona thesequencesof150119genomesintheukbiobank AT sigurdssongunnart thesequencesof150119genomesintheukbiobank AT halldorssongislih thesequencesof150119genomesintheukbiobank AT sveinbjornssongardar thesequencesof150119genomesintheukbiobank AT norlandkristjan thesequencesof150119genomesintheukbiobank AT styrkarsdottirunnur thesequencesof150119genomesintheukbiobank AT magnusdottirdroplaugn thesequencesof150119genomesintheukbiobank AT snorradottirsteinunn thesequencesof150119genomesintheukbiobank AT kristinssonkari thesequencesof150119genomesintheukbiobank AT sobechemilia thesequencesof150119genomesintheukbiobank AT jonssonhelgi thesequencesof150119genomesintheukbiobank AT geirssonarnij thesequencesof150119genomesintheukbiobank AT olafssonisleifur thesequencesof150119genomesintheukbiobank AT jonssonpalmi thesequencesof150119genomesintheukbiobank AT pedersenolebirger thesequencesof150119genomesintheukbiobank AT erikstrupchristian thesequencesof150119genomesintheukbiobank AT brunaksøren thesequencesof150119genomesintheukbiobank AT ostrowskisisserye thesequencesof150119genomesintheukbiobank AT thesequencesof150119genomesintheukbiobank AT thorleifssongudmar thesequencesof150119genomesintheukbiobank AT jonssonfrosti thesequencesof150119genomesintheukbiobank AT melstedpall thesequencesof150119genomesintheukbiobank AT jonsdottiringileif thesequencesof150119genomesintheukbiobank AT rafnarthorunn thesequencesof150119genomesintheukbiobank AT holmhilma thesequencesof150119genomesintheukbiobank AT stefanssonhreinn thesequencesof150119genomesintheukbiobank AT saemundsdottirjona thesequencesof150119genomesintheukbiobank AT gudbjartssondanielf thesequencesof150119genomesintheukbiobank AT magnussonolafurt thesequencesof150119genomesintheukbiobank AT massongisli thesequencesof150119genomesintheukbiobank AT thorsteinsdottirunnur thesequencesof150119genomesintheukbiobank AT helgasonagnar thesequencesof150119genomesintheukbiobank AT jonssonhakon thesequencesof150119genomesintheukbiobank AT sulempatrick thesequencesof150119genomesintheukbiobank AT stefanssonkari thesequencesof150119genomesintheukbiobank AT halldorssonbjarniv sequencesof150119genomesintheukbiobank AT eggertssonhannesp sequencesof150119genomesintheukbiobank AT moorekristjanhs sequencesof150119genomesintheukbiobank AT hauswedellhannes sequencesof150119genomesintheukbiobank AT eirikssonogmundur sequencesof150119genomesintheukbiobank AT ulfarssonmagnuso sequencesof150119genomesintheukbiobank AT palssongunnar sequencesof150119genomesintheukbiobank AT hardarsonmarteinnt sequencesof150119genomesintheukbiobank AT oddssonasmundur sequencesof150119genomesintheukbiobank AT jenssonbrynjaro sequencesof150119genomesintheukbiobank AT kristmundsdottirsnaedis sequencesof150119genomesintheukbiobank AT sigurpalsdottirbrynjad sequencesof150119genomesintheukbiobank AT stefanssonolafura sequencesof150119genomesintheukbiobank AT beyterdoruk sequencesof150119genomesintheukbiobank AT holleyguillaume sequencesof150119genomesintheukbiobank AT tragantevinicius sequencesof150119genomesintheukbiobank AT gylfasonarnaldur sequencesof150119genomesintheukbiobank AT olasonpalli sequencesof150119genomesintheukbiobank AT zinkflorian sequencesof150119genomesintheukbiobank AT asgeirsdottirmargret sequencesof150119genomesintheukbiobank AT sverrissonsverrirt sequencesof150119genomesintheukbiobank AT sigurdssonbrynjar sequencesof150119genomesintheukbiobank AT gudjonssonsigurjona sequencesof150119genomesintheukbiobank AT sigurdssongunnart sequencesof150119genomesintheukbiobank AT halldorssongislih sequencesof150119genomesintheukbiobank AT sveinbjornssongardar sequencesof150119genomesintheukbiobank AT norlandkristjan sequencesof150119genomesintheukbiobank AT styrkarsdottirunnur sequencesof150119genomesintheukbiobank AT magnusdottirdroplaugn sequencesof150119genomesintheukbiobank AT snorradottirsteinunn sequencesof150119genomesintheukbiobank AT kristinssonkari sequencesof150119genomesintheukbiobank AT sobechemilia sequencesof150119genomesintheukbiobank AT jonssonhelgi sequencesof150119genomesintheukbiobank AT geirssonarnij sequencesof150119genomesintheukbiobank AT olafssonisleifur sequencesof150119genomesintheukbiobank AT jonssonpalmi sequencesof150119genomesintheukbiobank AT pedersenolebirger sequencesof150119genomesintheukbiobank AT erikstrupchristian sequencesof150119genomesintheukbiobank AT brunaksøren sequencesof150119genomesintheukbiobank AT ostrowskisisserye sequencesof150119genomesintheukbiobank AT sequencesof150119genomesintheukbiobank AT thorleifssongudmar sequencesof150119genomesintheukbiobank AT jonssonfrosti sequencesof150119genomesintheukbiobank AT melstedpall sequencesof150119genomesintheukbiobank AT jonsdottiringileif sequencesof150119genomesintheukbiobank AT rafnarthorunn sequencesof150119genomesintheukbiobank AT holmhilma sequencesof150119genomesintheukbiobank AT stefanssonhreinn sequencesof150119genomesintheukbiobank AT saemundsdottirjona sequencesof150119genomesintheukbiobank AT gudbjartssondanielf sequencesof150119genomesintheukbiobank AT magnussonolafurt sequencesof150119genomesintheukbiobank AT massongisli sequencesof150119genomesintheukbiobank AT thorsteinsdottirunnur sequencesof150119genomesintheukbiobank AT helgasonagnar sequencesof150119genomesintheukbiobank AT jonssonhakon sequencesof150119genomesintheukbiobank AT sulempatrick sequencesof150119genomesintheukbiobank AT stefanssonkari sequencesof150119genomesintheukbiobank |