Cargando…

Airpart: interpretable statistical models for analyzing allelic imbalance in single-cell datasets

MOTIVATION: Allelic expression analysis aids in detection of cis-regulatory mechanisms of genetic variation, which produce allelic imbalance (AI) in heterozygotes. Measuring AI in bulk data lacking time or spatial resolution has the limitation that cell-type-specific (CTS), spatial- or time-dependen...

Descripción completa

Detalles Bibliográficos
Autores principales: Mu, Wancen, Sarkar, Hirak, Srivastava, Avi, Choi, Kwangbom, Patro, Rob, Love, Michael I
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9113279/
https://www.ncbi.nlm.nih.gov/pubmed/35561168
http://dx.doi.org/10.1093/bioinformatics/btac212
_version_ 1784709556420476928
author Mu, Wancen
Sarkar, Hirak
Srivastava, Avi
Choi, Kwangbom
Patro, Rob
Love, Michael I
author_facet Mu, Wancen
Sarkar, Hirak
Srivastava, Avi
Choi, Kwangbom
Patro, Rob
Love, Michael I
author_sort Mu, Wancen
collection PubMed
description MOTIVATION: Allelic expression analysis aids in detection of cis-regulatory mechanisms of genetic variation, which produce allelic imbalance (AI) in heterozygotes. Measuring AI in bulk data lacking time or spatial resolution has the limitation that cell-type-specific (CTS), spatial- or time-dependent AI signals may be dampened or not detected. RESULTS: We introduce a statistical method airpart for identifying differential CTS AI from single-cell RNA-sequencing data, or dynamics AI from other spatially or time-resolved datasets. airpart outputs discrete partitions of data, pointing to groups of genes and cells under common mechanisms of cis-genetic regulation. In order to account for low counts in single-cell data, our method uses a Generalized Fused Lasso with Binomial likelihood for partitioning groups of cells by AI signal, and a hierarchical Bayesian model for AI statistical inference. In simulation, airpart accurately detected partitions of cell types by their AI and had lower Root Mean Square Error (RMSE) of allelic ratio estimates than existing methods. In real data, airpart identified differential allelic imbalance patterns across cell states and could be used to define trends of AI signal over spatial or time axes. AVAILABILITY AND IMPLEMENTATION: The airpart package is available as an R/Bioconductor package at https://bioconductor.org/packages/airpart. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9113279
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-91132792022-05-18 Airpart: interpretable statistical models for analyzing allelic imbalance in single-cell datasets Mu, Wancen Sarkar, Hirak Srivastava, Avi Choi, Kwangbom Patro, Rob Love, Michael I Bioinformatics Original Papers MOTIVATION: Allelic expression analysis aids in detection of cis-regulatory mechanisms of genetic variation, which produce allelic imbalance (AI) in heterozygotes. Measuring AI in bulk data lacking time or spatial resolution has the limitation that cell-type-specific (CTS), spatial- or time-dependent AI signals may be dampened or not detected. RESULTS: We introduce a statistical method airpart for identifying differential CTS AI from single-cell RNA-sequencing data, or dynamics AI from other spatially or time-resolved datasets. airpart outputs discrete partitions of data, pointing to groups of genes and cells under common mechanisms of cis-genetic regulation. In order to account for low counts in single-cell data, our method uses a Generalized Fused Lasso with Binomial likelihood for partitioning groups of cells by AI signal, and a hierarchical Bayesian model for AI statistical inference. In simulation, airpart accurately detected partitions of cell types by their AI and had lower Root Mean Square Error (RMSE) of allelic ratio estimates than existing methods. In real data, airpart identified differential allelic imbalance patterns across cell states and could be used to define trends of AI signal over spatial or time axes. AVAILABILITY AND IMPLEMENTATION: The airpart package is available as an R/Bioconductor package at https://bioconductor.org/packages/airpart. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-04-06 /pmc/articles/PMC9113279/ /pubmed/35561168 http://dx.doi.org/10.1093/bioinformatics/btac212 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Mu, Wancen
Sarkar, Hirak
Srivastava, Avi
Choi, Kwangbom
Patro, Rob
Love, Michael I
Airpart: interpretable statistical models for analyzing allelic imbalance in single-cell datasets
title Airpart: interpretable statistical models for analyzing allelic imbalance in single-cell datasets
title_full Airpart: interpretable statistical models for analyzing allelic imbalance in single-cell datasets
title_fullStr Airpart: interpretable statistical models for analyzing allelic imbalance in single-cell datasets
title_full_unstemmed Airpart: interpretable statistical models for analyzing allelic imbalance in single-cell datasets
title_short Airpart: interpretable statistical models for analyzing allelic imbalance in single-cell datasets
title_sort airpart: interpretable statistical models for analyzing allelic imbalance in single-cell datasets
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9113279/
https://www.ncbi.nlm.nih.gov/pubmed/35561168
http://dx.doi.org/10.1093/bioinformatics/btac212
work_keys_str_mv AT muwancen airpartinterpretablestatisticalmodelsforanalyzingallelicimbalanceinsinglecelldatasets
AT sarkarhirak airpartinterpretablestatisticalmodelsforanalyzingallelicimbalanceinsinglecelldatasets
AT srivastavaavi airpartinterpretablestatisticalmodelsforanalyzingallelicimbalanceinsinglecelldatasets
AT choikwangbom airpartinterpretablestatisticalmodelsforanalyzingallelicimbalanceinsinglecelldatasets
AT patrorob airpartinterpretablestatisticalmodelsforanalyzingallelicimbalanceinsinglecelldatasets
AT lovemichaeli airpartinterpretablestatisticalmodelsforanalyzingallelicimbalanceinsinglecelldatasets