Cargando…

The sequences of 150,119 genomes in the UK Biobank

Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequenci...

Descripción completa

Detalles Bibliográficos
Autores principales: Halldorsson, Bjarni V., Eggertsson, Hannes P., Moore, Kristjan H. S., Hauswedell, Hannes, Eiriksson, Ogmundur, Ulfarsson, Magnus O., Palsson, Gunnar, Hardarson, Marteinn T., Oddsson, Asmundur, Jensson, Brynjar O., Kristmundsdottir, Snaedis, Sigurpalsdottir, Brynja D., Stefansson, Olafur A., Beyter, Doruk, Holley, Guillaume, Tragante, Vinicius, Gylfason, Arnaldur, Olason, Pall I., Zink, Florian, Asgeirsdottir, Margret, Sverrisson, Sverrir T., Sigurdsson, Brynjar, Gudjonsson, Sigurjon A., Sigurdsson, Gunnar T., Halldorsson, Gisli H., Sveinbjornsson, Gardar, Norland, Kristjan, Styrkarsdottir, Unnur, Magnusdottir, Droplaug N., Snorradottir, Steinunn, Kristinsson, Kari, Sobech, Emilia, Jonsson, Helgi, Geirsson, Arni J., Olafsson, Isleifur, Jonsson, Palmi, Pedersen, Ole Birger, Erikstrup, Christian, Brunak, Søren, Ostrowski, Sisse Rye, Thorleifsson, Gudmar, Jonsson, Frosti, Melsted, Pall, Jonsdottir, Ingileif, Rafnar, Thorunn, Holm, Hilma, Stefansson, Hreinn, Saemundsdottir, Jona, Gudbjartsson, Daniel F., Magnusson, Olafur T., Masson, Gisli, Thorsteinsdottir, Unnur, Helgason, Agnar, Jonsson, Hakon, Sulem, Patrick, Stefansson, Kari
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9329122/
https://www.ncbi.nlm.nih.gov/pubmed/35859178
http://dx.doi.org/10.1038/s41586-022-04965-x
_version_ 1784757870693187584
author Halldorsson, Bjarni V.
Eggertsson, Hannes P.
Moore, Kristjan H. S.
Hauswedell, Hannes
Eiriksson, Ogmundur
Ulfarsson, Magnus O.
Palsson, Gunnar
Hardarson, Marteinn T.
Oddsson, Asmundur
Jensson, Brynjar O.
Kristmundsdottir, Snaedis
Sigurpalsdottir, Brynja D.
Stefansson, Olafur A.
Beyter, Doruk
Holley, Guillaume
Tragante, Vinicius
Gylfason, Arnaldur
Olason, Pall I.
Zink, Florian
Asgeirsdottir, Margret
Sverrisson, Sverrir T.
Sigurdsson, Brynjar
Gudjonsson, Sigurjon A.
Sigurdsson, Gunnar T.
Halldorsson, Gisli H.
Sveinbjornsson, Gardar
Norland, Kristjan
Styrkarsdottir, Unnur
Magnusdottir, Droplaug N.
Snorradottir, Steinunn
Kristinsson, Kari
Sobech, Emilia
Jonsson, Helgi
Geirsson, Arni J.
Olafsson, Isleifur
Jonsson, Palmi
Pedersen, Ole Birger
Erikstrup, Christian
Brunak, Søren
Ostrowski, Sisse Rye
Thorleifsson, Gudmar
Jonsson, Frosti
Melsted, Pall
Jonsdottir, Ingileif
Rafnar, Thorunn
Holm, Hilma
Stefansson, Hreinn
Saemundsdottir, Jona
Gudbjartsson, Daniel F.
Magnusson, Olafur T.
Masson, Gisli
Thorsteinsdottir, Unnur
Helgason, Agnar
Jonsson, Hakon
Sulem, Patrick
Stefansson, Kari
author_facet Halldorsson, Bjarni V.
Eggertsson, Hannes P.
Moore, Kristjan H. S.
Hauswedell, Hannes
Eiriksson, Ogmundur
Ulfarsson, Magnus O.
Palsson, Gunnar
Hardarson, Marteinn T.
Oddsson, Asmundur
Jensson, Brynjar O.
Kristmundsdottir, Snaedis
Sigurpalsdottir, Brynja D.
Stefansson, Olafur A.
Beyter, Doruk
Holley, Guillaume
Tragante, Vinicius
Gylfason, Arnaldur
Olason, Pall I.
Zink, Florian
Asgeirsdottir, Margret
Sverrisson, Sverrir T.
Sigurdsson, Brynjar
Gudjonsson, Sigurjon A.
Sigurdsson, Gunnar T.
Halldorsson, Gisli H.
Sveinbjornsson, Gardar
Norland, Kristjan
Styrkarsdottir, Unnur
Magnusdottir, Droplaug N.
Snorradottir, Steinunn
Kristinsson, Kari
Sobech, Emilia
Jonsson, Helgi
Geirsson, Arni J.
Olafsson, Isleifur
Jonsson, Palmi
Pedersen, Ole Birger
Erikstrup, Christian
Brunak, Søren
Ostrowski, Sisse Rye
Thorleifsson, Gudmar
Jonsson, Frosti
Melsted, Pall
Jonsdottir, Ingileif
Rafnar, Thorunn
Holm, Hilma
Stefansson, Hreinn
Saemundsdottir, Jona
Gudbjartsson, Daniel F.
Magnusson, Olafur T.
Masson, Gisli
Thorsteinsdottir, Unnur
Helgason, Agnar
Jonsson, Hakon
Sulem, Patrick
Stefansson, Kari
author_sort Halldorsson, Bjarni V.
collection PubMed
description Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data(1,2). Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank(3). This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.
format Online
Article
Text
id pubmed-9329122
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-93291222022-07-29 The sequences of 150,119 genomes in the UK Biobank Halldorsson, Bjarni V. Eggertsson, Hannes P. Moore, Kristjan H. S. Hauswedell, Hannes Eiriksson, Ogmundur Ulfarsson, Magnus O. Palsson, Gunnar Hardarson, Marteinn T. Oddsson, Asmundur Jensson, Brynjar O. Kristmundsdottir, Snaedis Sigurpalsdottir, Brynja D. Stefansson, Olafur A. Beyter, Doruk Holley, Guillaume Tragante, Vinicius Gylfason, Arnaldur Olason, Pall I. Zink, Florian Asgeirsdottir, Margret Sverrisson, Sverrir T. Sigurdsson, Brynjar Gudjonsson, Sigurjon A. Sigurdsson, Gunnar T. Halldorsson, Gisli H. Sveinbjornsson, Gardar Norland, Kristjan Styrkarsdottir, Unnur Magnusdottir, Droplaug N. Snorradottir, Steinunn Kristinsson, Kari Sobech, Emilia Jonsson, Helgi Geirsson, Arni J. Olafsson, Isleifur Jonsson, Palmi Pedersen, Ole Birger Erikstrup, Christian Brunak, Søren Ostrowski, Sisse Rye Thorleifsson, Gudmar Jonsson, Frosti Melsted, Pall Jonsdottir, Ingileif Rafnar, Thorunn Holm, Hilma Stefansson, Hreinn Saemundsdottir, Jona Gudbjartsson, Daniel F. Magnusson, Olafur T. Masson, Gisli Thorsteinsdottir, Unnur Helgason, Agnar Jonsson, Hakon Sulem, Patrick Stefansson, Kari Nature Article Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data(1,2). Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank(3). This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation. Nature Publishing Group UK 2022-07-20 2022 /pmc/articles/PMC9329122/ /pubmed/35859178 http://dx.doi.org/10.1038/s41586-022-04965-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Halldorsson, Bjarni V.
Eggertsson, Hannes P.
Moore, Kristjan H. S.
Hauswedell, Hannes
Eiriksson, Ogmundur
Ulfarsson, Magnus O.
Palsson, Gunnar
Hardarson, Marteinn T.
Oddsson, Asmundur
Jensson, Brynjar O.
Kristmundsdottir, Snaedis
Sigurpalsdottir, Brynja D.
Stefansson, Olafur A.
Beyter, Doruk
Holley, Guillaume
Tragante, Vinicius
Gylfason, Arnaldur
Olason, Pall I.
Zink, Florian
Asgeirsdottir, Margret
Sverrisson, Sverrir T.
Sigurdsson, Brynjar
Gudjonsson, Sigurjon A.
Sigurdsson, Gunnar T.
Halldorsson, Gisli H.
Sveinbjornsson, Gardar
Norland, Kristjan
Styrkarsdottir, Unnur
Magnusdottir, Droplaug N.
Snorradottir, Steinunn
Kristinsson, Kari
Sobech, Emilia
Jonsson, Helgi
Geirsson, Arni J.
Olafsson, Isleifur
Jonsson, Palmi
Pedersen, Ole Birger
Erikstrup, Christian
Brunak, Søren
Ostrowski, Sisse Rye
Thorleifsson, Gudmar
Jonsson, Frosti
Melsted, Pall
Jonsdottir, Ingileif
Rafnar, Thorunn
Holm, Hilma
Stefansson, Hreinn
Saemundsdottir, Jona
Gudbjartsson, Daniel F.
Magnusson, Olafur T.
Masson, Gisli
Thorsteinsdottir, Unnur
Helgason, Agnar
Jonsson, Hakon
Sulem, Patrick
Stefansson, Kari
The sequences of 150,119 genomes in the UK Biobank
title The sequences of 150,119 genomes in the UK Biobank
title_full The sequences of 150,119 genomes in the UK Biobank
title_fullStr The sequences of 150,119 genomes in the UK Biobank
title_full_unstemmed The sequences of 150,119 genomes in the UK Biobank
title_short The sequences of 150,119 genomes in the UK Biobank
title_sort sequences of 150,119 genomes in the uk biobank
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9329122/
https://www.ncbi.nlm.nih.gov/pubmed/35859178
http://dx.doi.org/10.1038/s41586-022-04965-x
work_keys_str_mv AT halldorssonbjarniv thesequencesof150119genomesintheukbiobank
AT eggertssonhannesp thesequencesof150119genomesintheukbiobank
AT moorekristjanhs thesequencesof150119genomesintheukbiobank
AT hauswedellhannes thesequencesof150119genomesintheukbiobank
AT eirikssonogmundur thesequencesof150119genomesintheukbiobank
AT ulfarssonmagnuso thesequencesof150119genomesintheukbiobank
AT palssongunnar thesequencesof150119genomesintheukbiobank
AT hardarsonmarteinnt thesequencesof150119genomesintheukbiobank
AT oddssonasmundur thesequencesof150119genomesintheukbiobank
AT jenssonbrynjaro thesequencesof150119genomesintheukbiobank
AT kristmundsdottirsnaedis thesequencesof150119genomesintheukbiobank
AT sigurpalsdottirbrynjad thesequencesof150119genomesintheukbiobank
AT stefanssonolafura thesequencesof150119genomesintheukbiobank
AT beyterdoruk thesequencesof150119genomesintheukbiobank
AT holleyguillaume thesequencesof150119genomesintheukbiobank
AT tragantevinicius thesequencesof150119genomesintheukbiobank
AT gylfasonarnaldur thesequencesof150119genomesintheukbiobank
AT olasonpalli thesequencesof150119genomesintheukbiobank
AT zinkflorian thesequencesof150119genomesintheukbiobank
AT asgeirsdottirmargret thesequencesof150119genomesintheukbiobank
AT sverrissonsverrirt thesequencesof150119genomesintheukbiobank
AT sigurdssonbrynjar thesequencesof150119genomesintheukbiobank
AT gudjonssonsigurjona thesequencesof150119genomesintheukbiobank
AT sigurdssongunnart thesequencesof150119genomesintheukbiobank
AT halldorssongislih thesequencesof150119genomesintheukbiobank
AT sveinbjornssongardar thesequencesof150119genomesintheukbiobank
AT norlandkristjan thesequencesof150119genomesintheukbiobank
AT styrkarsdottirunnur thesequencesof150119genomesintheukbiobank
AT magnusdottirdroplaugn thesequencesof150119genomesintheukbiobank
AT snorradottirsteinunn thesequencesof150119genomesintheukbiobank
AT kristinssonkari thesequencesof150119genomesintheukbiobank
AT sobechemilia thesequencesof150119genomesintheukbiobank
AT jonssonhelgi thesequencesof150119genomesintheukbiobank
AT geirssonarnij thesequencesof150119genomesintheukbiobank
AT olafssonisleifur thesequencesof150119genomesintheukbiobank
AT jonssonpalmi thesequencesof150119genomesintheukbiobank
AT pedersenolebirger thesequencesof150119genomesintheukbiobank
AT erikstrupchristian thesequencesof150119genomesintheukbiobank
AT brunaksøren thesequencesof150119genomesintheukbiobank
AT ostrowskisisserye thesequencesof150119genomesintheukbiobank
AT thesequencesof150119genomesintheukbiobank
AT thorleifssongudmar thesequencesof150119genomesintheukbiobank
AT jonssonfrosti thesequencesof150119genomesintheukbiobank
AT melstedpall thesequencesof150119genomesintheukbiobank
AT jonsdottiringileif thesequencesof150119genomesintheukbiobank
AT rafnarthorunn thesequencesof150119genomesintheukbiobank
AT holmhilma thesequencesof150119genomesintheukbiobank
AT stefanssonhreinn thesequencesof150119genomesintheukbiobank
AT saemundsdottirjona thesequencesof150119genomesintheukbiobank
AT gudbjartssondanielf thesequencesof150119genomesintheukbiobank
AT magnussonolafurt thesequencesof150119genomesintheukbiobank
AT massongisli thesequencesof150119genomesintheukbiobank
AT thorsteinsdottirunnur thesequencesof150119genomesintheukbiobank
AT helgasonagnar thesequencesof150119genomesintheukbiobank
AT jonssonhakon thesequencesof150119genomesintheukbiobank
AT sulempatrick thesequencesof150119genomesintheukbiobank
AT stefanssonkari thesequencesof150119genomesintheukbiobank
AT halldorssonbjarniv sequencesof150119genomesintheukbiobank
AT eggertssonhannesp sequencesof150119genomesintheukbiobank
AT moorekristjanhs sequencesof150119genomesintheukbiobank
AT hauswedellhannes sequencesof150119genomesintheukbiobank
AT eirikssonogmundur sequencesof150119genomesintheukbiobank
AT ulfarssonmagnuso sequencesof150119genomesintheukbiobank
AT palssongunnar sequencesof150119genomesintheukbiobank
AT hardarsonmarteinnt sequencesof150119genomesintheukbiobank
AT oddssonasmundur sequencesof150119genomesintheukbiobank
AT jenssonbrynjaro sequencesof150119genomesintheukbiobank
AT kristmundsdottirsnaedis sequencesof150119genomesintheukbiobank
AT sigurpalsdottirbrynjad sequencesof150119genomesintheukbiobank
AT stefanssonolafura sequencesof150119genomesintheukbiobank
AT beyterdoruk sequencesof150119genomesintheukbiobank
AT holleyguillaume sequencesof150119genomesintheukbiobank
AT tragantevinicius sequencesof150119genomesintheukbiobank
AT gylfasonarnaldur sequencesof150119genomesintheukbiobank
AT olasonpalli sequencesof150119genomesintheukbiobank
AT zinkflorian sequencesof150119genomesintheukbiobank
AT asgeirsdottirmargret sequencesof150119genomesintheukbiobank
AT sverrissonsverrirt sequencesof150119genomesintheukbiobank
AT sigurdssonbrynjar sequencesof150119genomesintheukbiobank
AT gudjonssonsigurjona sequencesof150119genomesintheukbiobank
AT sigurdssongunnart sequencesof150119genomesintheukbiobank
AT halldorssongislih sequencesof150119genomesintheukbiobank
AT sveinbjornssongardar sequencesof150119genomesintheukbiobank
AT norlandkristjan sequencesof150119genomesintheukbiobank
AT styrkarsdottirunnur sequencesof150119genomesintheukbiobank
AT magnusdottirdroplaugn sequencesof150119genomesintheukbiobank
AT snorradottirsteinunn sequencesof150119genomesintheukbiobank
AT kristinssonkari sequencesof150119genomesintheukbiobank
AT sobechemilia sequencesof150119genomesintheukbiobank
AT jonssonhelgi sequencesof150119genomesintheukbiobank
AT geirssonarnij sequencesof150119genomesintheukbiobank
AT olafssonisleifur sequencesof150119genomesintheukbiobank
AT jonssonpalmi sequencesof150119genomesintheukbiobank
AT pedersenolebirger sequencesof150119genomesintheukbiobank
AT erikstrupchristian sequencesof150119genomesintheukbiobank
AT brunaksøren sequencesof150119genomesintheukbiobank
AT ostrowskisisserye sequencesof150119genomesintheukbiobank
AT sequencesof150119genomesintheukbiobank
AT thorleifssongudmar sequencesof150119genomesintheukbiobank
AT jonssonfrosti sequencesof150119genomesintheukbiobank
AT melstedpall sequencesof150119genomesintheukbiobank
AT jonsdottiringileif sequencesof150119genomesintheukbiobank
AT rafnarthorunn sequencesof150119genomesintheukbiobank
AT holmhilma sequencesof150119genomesintheukbiobank
AT stefanssonhreinn sequencesof150119genomesintheukbiobank
AT saemundsdottirjona sequencesof150119genomesintheukbiobank
AT gudbjartssondanielf sequencesof150119genomesintheukbiobank
AT magnussonolafurt sequencesof150119genomesintheukbiobank
AT massongisli sequencesof150119genomesintheukbiobank
AT thorsteinsdottirunnur sequencesof150119genomesintheukbiobank
AT helgasonagnar sequencesof150119genomesintheukbiobank
AT jonssonhakon sequencesof150119genomesintheukbiobank
AT sulempatrick sequencesof150119genomesintheukbiobank
AT stefanssonkari sequencesof150119genomesintheukbiobank