Cargando…

Estimating the Allele-Specific Expression of SNVs From 10× Genomics Single-Cell RNA-Sequencing Data

With the recent advances in single-cell RNA-sequencing (scRNA-seq) technologies, the estimation of allele expression from single cells is becoming increasingly reliable. Allele expression is both quantitative and dynamic and is an essential component of the genomic interactome. Here, we systematical...

Descripción completa

Detalles Bibliográficos
Autores principales: N. M., Prashant, Liu, Hongyu, Bousounis, Pavlos, Spurr, Liam, Alomran, Nawaf, Ibeawuchi, Helen, Sein, Justin, Reece-Stremtan, Dacian, Horvath, Anelia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7140866/
https://www.ncbi.nlm.nih.gov/pubmed/32106453
http://dx.doi.org/10.3390/genes11030240
_version_ 1783519087119302656
author N. M., Prashant
Liu, Hongyu
Bousounis, Pavlos
Spurr, Liam
Alomran, Nawaf
Ibeawuchi, Helen
Sein, Justin
Reece-Stremtan, Dacian
Horvath, Anelia
author_facet N. M., Prashant
Liu, Hongyu
Bousounis, Pavlos
Spurr, Liam
Alomran, Nawaf
Ibeawuchi, Helen
Sein, Justin
Reece-Stremtan, Dacian
Horvath, Anelia
author_sort N. M., Prashant
collection PubMed
description With the recent advances in single-cell RNA-sequencing (scRNA-seq) technologies, the estimation of allele expression from single cells is becoming increasingly reliable. Allele expression is both quantitative and dynamic and is an essential component of the genomic interactome. Here, we systematically estimate the allele expression from heterozygous single nucleotide variant (SNV) loci using scRNA-seq data generated on the 10×Genomics Chromium platform. We analyzed 26,640 human adipose-derived mesenchymal stem cells (from three healthy donors), sequenced to an average of 150K sequencing reads per cell (more than 4 billion scRNA-seq reads in total). High-quality SNV calls assessed in our study contained approximately 15% exonic and >50% intronic loci. To analyze the allele expression, we estimated the expressed variant allele fraction (VAF(RNA)) from SNV-aware alignments and analyzed its variance and distribution (mono- and bi-allelic) at different minimum sequencing read thresholds. Our analysis shows that when assessing positions covered by a minimum of three unique sequencing reads, over 50% of the heterozygous SNVs show bi-allelic expression, while at a threshold of 10 reads, nearly 90% of the SNVs are bi-allelic. In addition, our analysis demonstrates the feasibility of scVAF(RNA) estimation from current scRNA-seq datasets and shows that the 3′-based library generation protocol of 10×Genomics scRNA-seq data can be informative in SNV-based studies, including analyses of transcriptional kinetics.
format Online
Article
Text
id pubmed-7140866
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-71408662020-04-10 Estimating the Allele-Specific Expression of SNVs From 10× Genomics Single-Cell RNA-Sequencing Data N. M., Prashant Liu, Hongyu Bousounis, Pavlos Spurr, Liam Alomran, Nawaf Ibeawuchi, Helen Sein, Justin Reece-Stremtan, Dacian Horvath, Anelia Genes (Basel) Article With the recent advances in single-cell RNA-sequencing (scRNA-seq) technologies, the estimation of allele expression from single cells is becoming increasingly reliable. Allele expression is both quantitative and dynamic and is an essential component of the genomic interactome. Here, we systematically estimate the allele expression from heterozygous single nucleotide variant (SNV) loci using scRNA-seq data generated on the 10×Genomics Chromium platform. We analyzed 26,640 human adipose-derived mesenchymal stem cells (from three healthy donors), sequenced to an average of 150K sequencing reads per cell (more than 4 billion scRNA-seq reads in total). High-quality SNV calls assessed in our study contained approximately 15% exonic and >50% intronic loci. To analyze the allele expression, we estimated the expressed variant allele fraction (VAF(RNA)) from SNV-aware alignments and analyzed its variance and distribution (mono- and bi-allelic) at different minimum sequencing read thresholds. Our analysis shows that when assessing positions covered by a minimum of three unique sequencing reads, over 50% of the heterozygous SNVs show bi-allelic expression, while at a threshold of 10 reads, nearly 90% of the SNVs are bi-allelic. In addition, our analysis demonstrates the feasibility of scVAF(RNA) estimation from current scRNA-seq datasets and shows that the 3′-based library generation protocol of 10×Genomics scRNA-seq data can be informative in SNV-based studies, including analyses of transcriptional kinetics. MDPI 2020-02-25 /pmc/articles/PMC7140866/ /pubmed/32106453 http://dx.doi.org/10.3390/genes11030240 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
N. M., Prashant
Liu, Hongyu
Bousounis, Pavlos
Spurr, Liam
Alomran, Nawaf
Ibeawuchi, Helen
Sein, Justin
Reece-Stremtan, Dacian
Horvath, Anelia
Estimating the Allele-Specific Expression of SNVs From 10× Genomics Single-Cell RNA-Sequencing Data
title Estimating the Allele-Specific Expression of SNVs From 10× Genomics Single-Cell RNA-Sequencing Data
title_full Estimating the Allele-Specific Expression of SNVs From 10× Genomics Single-Cell RNA-Sequencing Data
title_fullStr Estimating the Allele-Specific Expression of SNVs From 10× Genomics Single-Cell RNA-Sequencing Data
title_full_unstemmed Estimating the Allele-Specific Expression of SNVs From 10× Genomics Single-Cell RNA-Sequencing Data
title_short Estimating the Allele-Specific Expression of SNVs From 10× Genomics Single-Cell RNA-Sequencing Data
title_sort estimating the allele-specific expression of snvs from 10× genomics single-cell rna-sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7140866/
https://www.ncbi.nlm.nih.gov/pubmed/32106453
http://dx.doi.org/10.3390/genes11030240
work_keys_str_mv AT nmprashant estimatingtheallelespecificexpressionofsnvsfrom10genomicssinglecellrnasequencingdata
AT liuhongyu estimatingtheallelespecificexpressionofsnvsfrom10genomicssinglecellrnasequencingdata
AT bousounispavlos estimatingtheallelespecificexpressionofsnvsfrom10genomicssinglecellrnasequencingdata
AT spurrliam estimatingtheallelespecificexpressionofsnvsfrom10genomicssinglecellrnasequencingdata
AT alomrannawaf estimatingtheallelespecificexpressionofsnvsfrom10genomicssinglecellrnasequencingdata
AT ibeawuchihelen estimatingtheallelespecificexpressionofsnvsfrom10genomicssinglecellrnasequencingdata
AT seinjustin estimatingtheallelespecificexpressionofsnvsfrom10genomicssinglecellrnasequencingdata
AT reecestremtandacian estimatingtheallelespecificexpressionofsnvsfrom10genomicssinglecellrnasequencingdata
AT horvathanelia estimatingtheallelespecificexpressionofsnvsfrom10genomicssinglecellrnasequencingdata