Cargando…

scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies

BACKGROUND: The rapid accumulation of single-cell RNA sequencing (scRNA-seq) data presents unique opportunities to decode the genetically mediated cell-type specificity in complex diseases. Here, we develop a new method, scGWAS, which effectively leverages scRNA-seq data to achieve two goals: (1) to...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Peilin, Hu, Ruifeng, Yan, Fangfang, Dai, Yulin, Zhao, Zhongming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575201/
https://www.ncbi.nlm.nih.gov/pubmed/36253801
http://dx.doi.org/10.1186/s13059-022-02785-w
_version_ 1784811266791964672
author Jia, Peilin
Hu, Ruifeng
Yan, Fangfang
Dai, Yulin
Zhao, Zhongming
author_facet Jia, Peilin
Hu, Ruifeng
Yan, Fangfang
Dai, Yulin
Zhao, Zhongming
author_sort Jia, Peilin
collection PubMed
description BACKGROUND: The rapid accumulation of single-cell RNA sequencing (scRNA-seq) data presents unique opportunities to decode the genetically mediated cell-type specificity in complex diseases. Here, we develop a new method, scGWAS, which effectively leverages scRNA-seq data to achieve two goals: (1) to infer the cell types in which the disease-associated genes manifest and (2) to construct cellular modules which imply disease-specific activation of different processes. RESULTS: scGWAS only utilizes the average gene expression for each cell type followed by virtual search processes to construct the null distributions of module scores, making it scalable to large scRNA-seq datasets. We demonstrated scGWAS in 40 genome-wide association studies (GWAS) datasets (average sample size N ≈ 154,000) using 18 scRNA-seq datasets from nine major human/mouse tissues (totaling 1.08 million cells) and identified 2533 trait and cell-type associations, each with significant modules for further investigation. The module genes were validated using disease or clinically annotated references from ClinVar, OMIM, and pLI variants. CONCLUSIONS: We showed that the trait-cell type associations identified by scGWAS, while generally constrained to trait-tissue associations, could recapitulate many well-studied relationships and also reveal novel relationships, providing insights into the unsolved trait-tissue associations. Moreover, in each specific cell type, the associations with different traits were often mediated by different sets of risk genes, implying disease-specific activation of driving processes. In summary, scGWAS is a powerful tool for exploring the genetic basis of complex diseases at the cell type level using single-cell expression data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02785-w.
format Online
Article
Text
id pubmed-9575201
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-95752012022-10-18 scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies Jia, Peilin Hu, Ruifeng Yan, Fangfang Dai, Yulin Zhao, Zhongming Genome Biol Research BACKGROUND: The rapid accumulation of single-cell RNA sequencing (scRNA-seq) data presents unique opportunities to decode the genetically mediated cell-type specificity in complex diseases. Here, we develop a new method, scGWAS, which effectively leverages scRNA-seq data to achieve two goals: (1) to infer the cell types in which the disease-associated genes manifest and (2) to construct cellular modules which imply disease-specific activation of different processes. RESULTS: scGWAS only utilizes the average gene expression for each cell type followed by virtual search processes to construct the null distributions of module scores, making it scalable to large scRNA-seq datasets. We demonstrated scGWAS in 40 genome-wide association studies (GWAS) datasets (average sample size N ≈ 154,000) using 18 scRNA-seq datasets from nine major human/mouse tissues (totaling 1.08 million cells) and identified 2533 trait and cell-type associations, each with significant modules for further investigation. The module genes were validated using disease or clinically annotated references from ClinVar, OMIM, and pLI variants. CONCLUSIONS: We showed that the trait-cell type associations identified by scGWAS, while generally constrained to trait-tissue associations, could recapitulate many well-studied relationships and also reveal novel relationships, providing insights into the unsolved trait-tissue associations. Moreover, in each specific cell type, the associations with different traits were often mediated by different sets of risk genes, implying disease-specific activation of driving processes. In summary, scGWAS is a powerful tool for exploring the genetic basis of complex diseases at the cell type level using single-cell expression data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02785-w. BioMed Central 2022-10-17 /pmc/articles/PMC9575201/ /pubmed/36253801 http://dx.doi.org/10.1186/s13059-022-02785-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Jia, Peilin
Hu, Ruifeng
Yan, Fangfang
Dai, Yulin
Zhao, Zhongming
scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies
title scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies
title_full scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies
title_fullStr scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies
title_full_unstemmed scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies
title_short scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies
title_sort scgwas: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575201/
https://www.ncbi.nlm.nih.gov/pubmed/36253801
http://dx.doi.org/10.1186/s13059-022-02785-w
work_keys_str_mv AT jiapeilin scgwaslandscapeoftraitcelltypeassociationsbyintegratingsinglecelltranscriptomicswideandgenomewideassociationstudies
AT huruifeng scgwaslandscapeoftraitcelltypeassociationsbyintegratingsinglecelltranscriptomicswideandgenomewideassociationstudies
AT yanfangfang scgwaslandscapeoftraitcelltypeassociationsbyintegratingsinglecelltranscriptomicswideandgenomewideassociationstudies
AT daiyulin scgwaslandscapeoftraitcelltypeassociationsbyintegratingsinglecelltranscriptomicswideandgenomewideassociationstudies
AT zhaozhongming scgwaslandscapeoftraitcelltypeassociationsbyintegratingsinglecelltranscriptomicswideandgenomewideassociationstudies