Cargando…

VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants

Rare or de novo variants have substantial contribution to human diseases, but the statistical power to identify risk genes by rare variants is generally low due to rarity of genotype data. Previous studies have shown that risk genes usually have high expression in relevant cell types, although for m...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhong, Guojie, Choi, Yoolim A., Shen, Yufeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368729/
https://www.ncbi.nlm.nih.gov/pubmed/37491581
http://dx.doi.org/10.1038/s42003-023-05155-9
_version_ 1785077567735201792
author Zhong, Guojie
Choi, Yoolim A.
Shen, Yufeng
author_facet Zhong, Guojie
Choi, Yoolim A.
Shen, Yufeng
author_sort Zhong, Guojie
collection PubMed
description Rare or de novo variants have substantial contribution to human diseases, but the statistical power to identify risk genes by rare variants is generally low due to rarity of genotype data. Previous studies have shown that risk genes usually have high expression in relevant cell types, although for many conditions the identity of these cell types are largely unknown. Recent efforts in single cell atlas in human and model organisms produced large amount of gene expression data. Here we present VBASS, a Bayesian method that integrates single-cell expression and de novo variant (DNV) data to improve power of disease risk gene discovery. VBASS models disease risk prior as a function of expression profiles, approximated by deep neural networks. It learns the weights of neural networks and parameters of Gamma-Poisson likelihood models of DNV counts jointly from expression and genetics data. On simulated data, VBASS shows proper error rate control and better power than state-of-the-art methods. We applied VBASS to published datasets and identified more candidate risk genes with supports from literature or data from independent cohorts. VBASS can be generalized to integrate other types of functional genomics data in statistical genetics analysis.
format Online
Article
Text
id pubmed-10368729
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103687292023-07-27 VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants Zhong, Guojie Choi, Yoolim A. Shen, Yufeng Commun Biol Article Rare or de novo variants have substantial contribution to human diseases, but the statistical power to identify risk genes by rare variants is generally low due to rarity of genotype data. Previous studies have shown that risk genes usually have high expression in relevant cell types, although for many conditions the identity of these cell types are largely unknown. Recent efforts in single cell atlas in human and model organisms produced large amount of gene expression data. Here we present VBASS, a Bayesian method that integrates single-cell expression and de novo variant (DNV) data to improve power of disease risk gene discovery. VBASS models disease risk prior as a function of expression profiles, approximated by deep neural networks. It learns the weights of neural networks and parameters of Gamma-Poisson likelihood models of DNV counts jointly from expression and genetics data. On simulated data, VBASS shows proper error rate control and better power than state-of-the-art methods. We applied VBASS to published datasets and identified more candidate risk genes with supports from literature or data from independent cohorts. VBASS can be generalized to integrate other types of functional genomics data in statistical genetics analysis. Nature Publishing Group UK 2023-07-25 /pmc/articles/PMC10368729/ /pubmed/37491581 http://dx.doi.org/10.1038/s42003-023-05155-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Zhong, Guojie
Choi, Yoolim A.
Shen, Yufeng
VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants
title VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants
title_full VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants
title_fullStr VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants
title_full_unstemmed VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants
title_short VBASS enables integration of single cell gene expression data in Bayesian association analysis of rare variants
title_sort vbass enables integration of single cell gene expression data in bayesian association analysis of rare variants
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368729/
https://www.ncbi.nlm.nih.gov/pubmed/37491581
http://dx.doi.org/10.1038/s42003-023-05155-9
work_keys_str_mv AT zhongguojie vbassenablesintegrationofsinglecellgeneexpressiondatainbayesianassociationanalysisofrarevariants
AT choiyoolima vbassenablesintegrationofsinglecellgeneexpressiondatainbayesianassociationanalysisofrarevariants
AT shenyufeng vbassenablesintegrationofsinglecellgeneexpressiondatainbayesianassociationanalysisofrarevariants