Cargando…

Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics

MOTIVATION: Gene expression is characterized by stochastic bursts of transcription that occur at brief and random periods of promoter activity. The kinetics of gene expression burstiness differs across the genome and is dependent on the promoter sequence, among other factors. Single-cell RNA sequenc...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Wenhao, Jørgensen, Andreas Christ Sølvsten, Marguerat, Samuel, Thomas, Philipp, Shahrezaei, Vahid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10318389/
https://www.ncbi.nlm.nih.gov/pubmed/37354494
http://dx.doi.org/10.1093/bioinformatics/btad395
_version_ 1785068026682408960
author Tang, Wenhao
Jørgensen, Andreas Christ Sølvsten
Marguerat, Samuel
Thomas, Philipp
Shahrezaei, Vahid
author_facet Tang, Wenhao
Jørgensen, Andreas Christ Sølvsten
Marguerat, Samuel
Thomas, Philipp
Shahrezaei, Vahid
author_sort Tang, Wenhao
collection PubMed
description MOTIVATION: Gene expression is characterized by stochastic bursts of transcription that occur at brief and random periods of promoter activity. The kinetics of gene expression burstiness differs across the genome and is dependent on the promoter sequence, among other factors. Single-cell RNA sequencing (scRNA-seq) has made it possible to quantify the cell-to-cell variability in transcription at a global genome-wide level. However, scRNA-seq data are prone to technical variability, including low and variable capture efficiency of transcripts from individual cells. RESULTS: Here, we propose a novel mathematical theory for the observed variability in scRNA-seq data. Our method captures burst kinetics and variability in both the cell size and capture efficiency, which allows us to propose several likelihood-based and simulation-based methods for the inference of burst kinetics from scRNA-seq data. Using both synthetic and real data, we show that the simulation-based methods provide an accurate, robust and flexible tool for inferring burst kinetics from scRNA-seq data. In particular, in a supervised manner, a simulation-based inference method based on neural networks proves to be accurate and useful when applied to both allele and nonallele-specific scRNA-seq data. AVAILABILITY AND IMPLEMENTATION: The code for Neural Network and Approximate Bayesian Computation inference is available at https://github.com/WT215/nnRNA and https://github.com/WT215/Julia_ABC, respectively.
format Online
Article
Text
id pubmed-10318389
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103183892023-07-05 Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics Tang, Wenhao Jørgensen, Andreas Christ Sølvsten Marguerat, Samuel Thomas, Philipp Shahrezaei, Vahid Bioinformatics Original Paper MOTIVATION: Gene expression is characterized by stochastic bursts of transcription that occur at brief and random periods of promoter activity. The kinetics of gene expression burstiness differs across the genome and is dependent on the promoter sequence, among other factors. Single-cell RNA sequencing (scRNA-seq) has made it possible to quantify the cell-to-cell variability in transcription at a global genome-wide level. However, scRNA-seq data are prone to technical variability, including low and variable capture efficiency of transcripts from individual cells. RESULTS: Here, we propose a novel mathematical theory for the observed variability in scRNA-seq data. Our method captures burst kinetics and variability in both the cell size and capture efficiency, which allows us to propose several likelihood-based and simulation-based methods for the inference of burst kinetics from scRNA-seq data. Using both synthetic and real data, we show that the simulation-based methods provide an accurate, robust and flexible tool for inferring burst kinetics from scRNA-seq data. In particular, in a supervised manner, a simulation-based inference method based on neural networks proves to be accurate and useful when applied to both allele and nonallele-specific scRNA-seq data. AVAILABILITY AND IMPLEMENTATION: The code for Neural Network and Approximate Bayesian Computation inference is available at https://github.com/WT215/nnRNA and https://github.com/WT215/Julia_ABC, respectively. Oxford University Press 2023-06-24 /pmc/articles/PMC10318389/ /pubmed/37354494 http://dx.doi.org/10.1093/bioinformatics/btad395 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Tang, Wenhao
Jørgensen, Andreas Christ Sølvsten
Marguerat, Samuel
Thomas, Philipp
Shahrezaei, Vahid
Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics
title Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics
title_full Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics
title_fullStr Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics
title_full_unstemmed Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics
title_short Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics
title_sort modelling capture efficiency of single-cell rna-sequencing data improves inference of transcriptome-wide burst kinetics
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10318389/
https://www.ncbi.nlm.nih.gov/pubmed/37354494
http://dx.doi.org/10.1093/bioinformatics/btad395
work_keys_str_mv AT tangwenhao modellingcaptureefficiencyofsinglecellrnasequencingdataimprovesinferenceoftranscriptomewideburstkinetics
AT jørgensenandreaschristsølvsten modellingcaptureefficiencyofsinglecellrnasequencingdataimprovesinferenceoftranscriptomewideburstkinetics
AT margueratsamuel modellingcaptureefficiencyofsinglecellrnasequencingdataimprovesinferenceoftranscriptomewideburstkinetics
AT thomasphilipp modellingcaptureefficiencyofsinglecellrnasequencingdataimprovesinferenceoftranscriptomewideburstkinetics
AT shahrezaeivahid modellingcaptureefficiencyofsinglecellrnasequencingdataimprovesinferenceoftranscriptomewideburstkinetics