Cargando…
scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies
BACKGROUND: The rapid accumulation of single-cell RNA sequencing (scRNA-seq) data presents unique opportunities to decode the genetically mediated cell-type specificity in complex diseases. Here, we develop a new method, scGWAS, which effectively leverages scRNA-seq data to achieve two goals: (1) to...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575201/ https://www.ncbi.nlm.nih.gov/pubmed/36253801 http://dx.doi.org/10.1186/s13059-022-02785-w |
_version_ | 1784811266791964672 |
---|---|
author | Jia, Peilin Hu, Ruifeng Yan, Fangfang Dai, Yulin Zhao, Zhongming |
author_facet | Jia, Peilin Hu, Ruifeng Yan, Fangfang Dai, Yulin Zhao, Zhongming |
author_sort | Jia, Peilin |
collection | PubMed |
description | BACKGROUND: The rapid accumulation of single-cell RNA sequencing (scRNA-seq) data presents unique opportunities to decode the genetically mediated cell-type specificity in complex diseases. Here, we develop a new method, scGWAS, which effectively leverages scRNA-seq data to achieve two goals: (1) to infer the cell types in which the disease-associated genes manifest and (2) to construct cellular modules which imply disease-specific activation of different processes. RESULTS: scGWAS only utilizes the average gene expression for each cell type followed by virtual search processes to construct the null distributions of module scores, making it scalable to large scRNA-seq datasets. We demonstrated scGWAS in 40 genome-wide association studies (GWAS) datasets (average sample size N ≈ 154,000) using 18 scRNA-seq datasets from nine major human/mouse tissues (totaling 1.08 million cells) and identified 2533 trait and cell-type associations, each with significant modules for further investigation. The module genes were validated using disease or clinically annotated references from ClinVar, OMIM, and pLI variants. CONCLUSIONS: We showed that the trait-cell type associations identified by scGWAS, while generally constrained to trait-tissue associations, could recapitulate many well-studied relationships and also reveal novel relationships, providing insights into the unsolved trait-tissue associations. Moreover, in each specific cell type, the associations with different traits were often mediated by different sets of risk genes, implying disease-specific activation of driving processes. In summary, scGWAS is a powerful tool for exploring the genetic basis of complex diseases at the cell type level using single-cell expression data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02785-w. |
format | Online Article Text |
id | pubmed-9575201 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-95752012022-10-18 scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies Jia, Peilin Hu, Ruifeng Yan, Fangfang Dai, Yulin Zhao, Zhongming Genome Biol Research BACKGROUND: The rapid accumulation of single-cell RNA sequencing (scRNA-seq) data presents unique opportunities to decode the genetically mediated cell-type specificity in complex diseases. Here, we develop a new method, scGWAS, which effectively leverages scRNA-seq data to achieve two goals: (1) to infer the cell types in which the disease-associated genes manifest and (2) to construct cellular modules which imply disease-specific activation of different processes. RESULTS: scGWAS only utilizes the average gene expression for each cell type followed by virtual search processes to construct the null distributions of module scores, making it scalable to large scRNA-seq datasets. We demonstrated scGWAS in 40 genome-wide association studies (GWAS) datasets (average sample size N ≈ 154,000) using 18 scRNA-seq datasets from nine major human/mouse tissues (totaling 1.08 million cells) and identified 2533 trait and cell-type associations, each with significant modules for further investigation. The module genes were validated using disease or clinically annotated references from ClinVar, OMIM, and pLI variants. CONCLUSIONS: We showed that the trait-cell type associations identified by scGWAS, while generally constrained to trait-tissue associations, could recapitulate many well-studied relationships and also reveal novel relationships, providing insights into the unsolved trait-tissue associations. Moreover, in each specific cell type, the associations with different traits were often mediated by different sets of risk genes, implying disease-specific activation of driving processes. In summary, scGWAS is a powerful tool for exploring the genetic basis of complex diseases at the cell type level using single-cell expression data. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02785-w. BioMed Central 2022-10-17 /pmc/articles/PMC9575201/ /pubmed/36253801 http://dx.doi.org/10.1186/s13059-022-02785-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Jia, Peilin Hu, Ruifeng Yan, Fangfang Dai, Yulin Zhao, Zhongming scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies |
title | scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies |
title_full | scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies |
title_fullStr | scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies |
title_full_unstemmed | scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies |
title_short | scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies |
title_sort | scgwas: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575201/ https://www.ncbi.nlm.nih.gov/pubmed/36253801 http://dx.doi.org/10.1186/s13059-022-02785-w |
work_keys_str_mv | AT jiapeilin scgwaslandscapeoftraitcelltypeassociationsbyintegratingsinglecelltranscriptomicswideandgenomewideassociationstudies AT huruifeng scgwaslandscapeoftraitcelltypeassociationsbyintegratingsinglecelltranscriptomicswideandgenomewideassociationstudies AT yanfangfang scgwaslandscapeoftraitcelltypeassociationsbyintegratingsinglecelltranscriptomicswideandgenomewideassociationstudies AT daiyulin scgwaslandscapeoftraitcelltypeassociationsbyintegratingsinglecelltranscriptomicswideandgenomewideassociationstudies AT zhaozhongming scgwaslandscapeoftraitcelltypeassociationsbyintegratingsinglecelltranscriptomicswideandgenomewideassociationstudies |