Cargando…

Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative

BACKGROUND: Large medical centers in urban areas, like Los Angeles, care for a diverse patient population and offer the potential to study the interplay between genetic ancestry and social determinants of health. Here, we explore the implications of genetic ancestry within the University of Californ...

Descripción completa

Detalles Bibliográficos
Autores principales: Johnson, Ruth, Ding, Yi, Venkateswaran, Vidhya, Bhattacharya, Arjun, Boulier, Kristin, Chiu, Alec, Knyazev, Sergey, Schwarz, Tommer, Freund, Malika, Zhan, Lingyu, Burch, Kathryn S., Caggiano, Christa, Hill, Brian, Rakocz, Nadav, Balliu, Brunilda, Denny, Christopher T., Sul, Jae Hoon, Zaitlen, Noah, Arboleda, Valerie A., Halperin, Eran, Sankararaman, Sriram, Butte, Manish J., Lajonchere, Clara, Geschwind, Daniel H., Pasaniuc, Bogdan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9461263/
https://www.ncbi.nlm.nih.gov/pubmed/36085083
http://dx.doi.org/10.1186/s13073-022-01106-x
_version_ 1784786938482393088
author Johnson, Ruth
Ding, Yi
Venkateswaran, Vidhya
Bhattacharya, Arjun
Boulier, Kristin
Chiu, Alec
Knyazev, Sergey
Schwarz, Tommer
Freund, Malika
Zhan, Lingyu
Burch, Kathryn S.
Caggiano, Christa
Hill, Brian
Rakocz, Nadav
Balliu, Brunilda
Denny, Christopher T.
Sul, Jae Hoon
Zaitlen, Noah
Arboleda, Valerie A.
Halperin, Eran
Sankararaman, Sriram
Butte, Manish J.
Lajonchere, Clara
Geschwind, Daniel H.
Pasaniuc, Bogdan
author_facet Johnson, Ruth
Ding, Yi
Venkateswaran, Vidhya
Bhattacharya, Arjun
Boulier, Kristin
Chiu, Alec
Knyazev, Sergey
Schwarz, Tommer
Freund, Malika
Zhan, Lingyu
Burch, Kathryn S.
Caggiano, Christa
Hill, Brian
Rakocz, Nadav
Balliu, Brunilda
Denny, Christopher T.
Sul, Jae Hoon
Zaitlen, Noah
Arboleda, Valerie A.
Halperin, Eran
Sankararaman, Sriram
Butte, Manish J.
Lajonchere, Clara
Geschwind, Daniel H.
Pasaniuc, Bogdan
author_sort Johnson, Ruth
collection PubMed
description BACKGROUND: Large medical centers in urban areas, like Los Angeles, care for a diverse patient population and offer the potential to study the interplay between genetic ancestry and social determinants of health. Here, we explore the implications of genetic ancestry within the University of California, Los Angeles (UCLA) ATLAS Community Health Initiative—an ancestrally diverse biobank of genomic data linked with de-identified electronic health records (EHRs) of UCLA Health patients (N=36,736). METHODS: We quantify the extensive continental and subcontinental genetic diversity within the ATLAS data through principal component analysis, identity-by-descent, and genetic admixture. We assess the relationship between genetically inferred ancestry (GIA) and >1500 EHR-derived phenotypes (phecodes). Finally, we demonstrate the utility of genetic data linked with EHR to perform ancestry-specific and multi-ancestry genome and phenome-wide scans across a broad set of disease phenotypes. RESULTS: We identify 5 continental-scale GIA clusters including European American (EA), African American (AA), Hispanic Latino American (HL), South Asian American (SAA) and East Asian American (EAA) individuals and 7 subcontinental GIA clusters within the EAA GIA corresponding to Chinese American, Vietnamese American, and Japanese American individuals. Although we broadly find that self-identified race/ethnicity (SIRE) is highly correlated with GIA, we still observe marked differences between the two, emphasizing that the populations defined by these two criteria are not analogous. We find a total of 259 significant associations between continental GIA and phecodes even after accounting for individuals’ SIRE, demonstrating that for some phenotypes, GIA provides information not already captured by SIRE. GWAS identifies significant associations for liver disease in the 22q13.31 locus across the HL and EAA GIA groups (HL p-value=2.32×10(−16), EAA p-value=6.73×10(−11)). A subsequent PheWAS at the top SNP reveals significant associations with neurologic and neoplastic phenotypes specifically within the HL GIA group. CONCLUSIONS: Overall, our results explore the interplay between SIRE and GIA within a disease context and underscore the utility of studying the genomes of diverse individuals through biobank-scale genotyping linked with EHR-based phenotyping. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-022-01106-x.
format Online
Article
Text
id pubmed-9461263
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94612632022-09-10 Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative Johnson, Ruth Ding, Yi Venkateswaran, Vidhya Bhattacharya, Arjun Boulier, Kristin Chiu, Alec Knyazev, Sergey Schwarz, Tommer Freund, Malika Zhan, Lingyu Burch, Kathryn S. Caggiano, Christa Hill, Brian Rakocz, Nadav Balliu, Brunilda Denny, Christopher T. Sul, Jae Hoon Zaitlen, Noah Arboleda, Valerie A. Halperin, Eran Sankararaman, Sriram Butte, Manish J. Lajonchere, Clara Geschwind, Daniel H. Pasaniuc, Bogdan Genome Med Research BACKGROUND: Large medical centers in urban areas, like Los Angeles, care for a diverse patient population and offer the potential to study the interplay between genetic ancestry and social determinants of health. Here, we explore the implications of genetic ancestry within the University of California, Los Angeles (UCLA) ATLAS Community Health Initiative—an ancestrally diverse biobank of genomic data linked with de-identified electronic health records (EHRs) of UCLA Health patients (N=36,736). METHODS: We quantify the extensive continental and subcontinental genetic diversity within the ATLAS data through principal component analysis, identity-by-descent, and genetic admixture. We assess the relationship between genetically inferred ancestry (GIA) and >1500 EHR-derived phenotypes (phecodes). Finally, we demonstrate the utility of genetic data linked with EHR to perform ancestry-specific and multi-ancestry genome and phenome-wide scans across a broad set of disease phenotypes. RESULTS: We identify 5 continental-scale GIA clusters including European American (EA), African American (AA), Hispanic Latino American (HL), South Asian American (SAA) and East Asian American (EAA) individuals and 7 subcontinental GIA clusters within the EAA GIA corresponding to Chinese American, Vietnamese American, and Japanese American individuals. Although we broadly find that self-identified race/ethnicity (SIRE) is highly correlated with GIA, we still observe marked differences between the two, emphasizing that the populations defined by these two criteria are not analogous. We find a total of 259 significant associations between continental GIA and phecodes even after accounting for individuals’ SIRE, demonstrating that for some phenotypes, GIA provides information not already captured by SIRE. GWAS identifies significant associations for liver disease in the 22q13.31 locus across the HL and EAA GIA groups (HL p-value=2.32×10(−16), EAA p-value=6.73×10(−11)). A subsequent PheWAS at the top SNP reveals significant associations with neurologic and neoplastic phenotypes specifically within the HL GIA group. CONCLUSIONS: Overall, our results explore the interplay between SIRE and GIA within a disease context and underscore the utility of studying the genomes of diverse individuals through biobank-scale genotyping linked with EHR-based phenotyping. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-022-01106-x. BioMed Central 2022-09-09 /pmc/articles/PMC9461263/ /pubmed/36085083 http://dx.doi.org/10.1186/s13073-022-01106-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Johnson, Ruth
Ding, Yi
Venkateswaran, Vidhya
Bhattacharya, Arjun
Boulier, Kristin
Chiu, Alec
Knyazev, Sergey
Schwarz, Tommer
Freund, Malika
Zhan, Lingyu
Burch, Kathryn S.
Caggiano, Christa
Hill, Brian
Rakocz, Nadav
Balliu, Brunilda
Denny, Christopher T.
Sul, Jae Hoon
Zaitlen, Noah
Arboleda, Valerie A.
Halperin, Eran
Sankararaman, Sriram
Butte, Manish J.
Lajonchere, Clara
Geschwind, Daniel H.
Pasaniuc, Bogdan
Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative
title Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative
title_full Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative
title_fullStr Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative
title_full_unstemmed Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative
title_short Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative
title_sort leveraging genomic diversity for discovery in an electronic health record linked biobank: the ucla atlas community health initiative
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9461263/
https://www.ncbi.nlm.nih.gov/pubmed/36085083
http://dx.doi.org/10.1186/s13073-022-01106-x
work_keys_str_mv AT johnsonruth leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT dingyi leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT venkateswaranvidhya leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT bhattacharyaarjun leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT boulierkristin leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT chiualec leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT knyazevsergey leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT schwarztommer leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT freundmalika leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT zhanlingyu leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT burchkathryns leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT caggianochrista leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT hillbrian leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT rakocznadav leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT balliubrunilda leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT dennychristophert leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT suljaehoon leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT zaitlennoah leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT arboledavaleriea leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT halperineran leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT sankararamansriram leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT buttemanishj leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT lajonchereclara leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT geschwinddanielh leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative
AT pasaniucbogdan leveraginggenomicdiversityfordiscoveryinanelectronichealthrecordlinkedbiobanktheuclaatlascommunityhealthinitiative