Cargando…
VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants
Rare or de novo variants have substantial contribution to human diseases, but the statistical power to identify risk genes by rare variants is generally low due to rarity of genotype data. Previous studies have shown that risk genes usually have high expression in relevant cell types, although for m...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368729/ https://www.ncbi.nlm.nih.gov/pubmed/37491581 http://dx.doi.org/10.1038/s42003-023-05155-9 |
_version_ | 1785077567735201792 |
---|---|
author | Zhong, Guojie Choi, Yoolim A. Shen, Yufeng |
author_facet | Zhong, Guojie Choi, Yoolim A. Shen, Yufeng |
author_sort | Zhong, Guojie |
collection | PubMed |
description | Rare or de novo variants have substantial contribution to human diseases, but the statistical power to identify risk genes by rare variants is generally low due to rarity of genotype data. Previous studies have shown that risk genes usually have high expression in relevant cell types, although for many conditions the identity of these cell types are largely unknown. Recent efforts in single cell atlas in human and model organisms produced large amount of gene expression data. Here we present VBASS, a Bayesian method that integrates single-cell expression and de novo variant (DNV) data to improve power of disease risk gene discovery. VBASS models disease risk prior as a function of expression profiles, approximated by deep neural networks. It learns the weights of neural networks and parameters of Gamma-Poisson likelihood models of DNV counts jointly from expression and genetics data. On simulated data, VBASS shows proper error rate control and better power than state-of-the-art methods. We applied VBASS to published datasets and identified more candidate risk genes with supports from literature or data from independent cohorts. VBASS can be generalized to integrate other types of functional genomics data in statistical genetics analysis. |
format | Online Article Text |
id | pubmed-10368729 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-103687292023-07-27 VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants Zhong, Guojie Choi, Yoolim A. Shen, Yufeng Commun Biol Article Rare or de novo variants have substantial contribution to human diseases, but the statistical power to identify risk genes by rare variants is generally low due to rarity of genotype data. Previous studies have shown that risk genes usually have high expression in relevant cell types, although for many conditions the identity of these cell types are largely unknown. Recent efforts in single cell atlas in human and model organisms produced large amount of gene expression data. Here we present VBASS, a Bayesian method that integrates single-cell expression and de novo variant (DNV) data to improve power of disease risk gene discovery. VBASS models disease risk prior as a function of expression profiles, approximated by deep neural networks. It learns the weights of neural networks and parameters of Gamma-Poisson likelihood models of DNV counts jointly from expression and genetics data. On simulated data, VBASS shows proper error rate control and better power than state-of-the-art methods. We applied VBASS to published datasets and identified more candidate risk genes with supports from literature or data from independent cohorts. VBASS can be generalized to integrate other types of functional genomics data in statistical genetics analysis. Nature Publishing Group UK 2023-07-25 /pmc/articles/PMC10368729/ /pubmed/37491581 http://dx.doi.org/10.1038/s42003-023-05155-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Zhong, Guojie Choi, Yoolim A. Shen, Yufeng VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants |
title | VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants |
title_full | VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants |
title_fullStr | VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants |
title_full_unstemmed | VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants |
title_short | VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants |
title_sort | vbass enables integration of single cell gene expression data in bayesian association analysis of rare variants |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368729/ https://www.ncbi.nlm.nih.gov/pubmed/37491581 http://dx.doi.org/10.1038/s42003-023-05155-9 |
work_keys_str_mv | AT zhongguojie vbassenablesintegrationofsinglecellgeneexpressiondatainbayesianassociationanalysisofrarevariants AT choiyoolima vbassenablesintegrationofsinglecellgeneexpressiondatainbayesianassociationanalysisofrarevariants AT shenyufeng vbassenablesintegrationofsinglecellgeneexpressiondatainbayesianassociationanalysisofrarevariants |