Cargando…

Isoform-level gene expression patterns in single-cell RNA-sequencing data

MOTIVATION: RNA sequencing of single cells enables characterization of transcriptional heterogeneity in seemingly homogeneous cell populations. Single-cell sequencing has been applied in a wide range of researches fields. However, few studies have focus on characterization of isoform-level expressio...

Descripción completa

Detalles Bibliográficos
Autores principales: Vu, Trung Nghia, Wills, Quin F, Kalari, Krishna R, Niu, Nifang, Wang, Liewei, Pawitan, Yudi, Rantalainen, Mattias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6041805/
https://www.ncbi.nlm.nih.gov/pubmed/29490015
http://dx.doi.org/10.1093/bioinformatics/bty100
_version_ 1783339050768269312
author Vu, Trung Nghia
Wills, Quin F
Kalari, Krishna R
Niu, Nifang
Wang, Liewei
Pawitan, Yudi
Rantalainen, Mattias
author_facet Vu, Trung Nghia
Wills, Quin F
Kalari, Krishna R
Niu, Nifang
Wang, Liewei
Pawitan, Yudi
Rantalainen, Mattias
author_sort Vu, Trung Nghia
collection PubMed
description MOTIVATION: RNA sequencing of single cells enables characterization of transcriptional heterogeneity in seemingly homogeneous cell populations. Single-cell sequencing has been applied in a wide range of researches fields. However, few studies have focus on characterization of isoform-level expression patterns at the single-cell level. In this study, we propose and apply a novel method, ISOform-Patterns (ISOP), based on mixture modeling, to characterize the expression patterns of isoform pairs from the same gene in single-cell isoform-level expression data. RESULTS: We define six principal patterns of isoform expression relationships and describe a method for differential-pattern analysis. We demonstrate ISOP through analysis of single-cell RNA-sequencing data from a breast cancer cell line, with replication in three independent datasets. We assigned the pattern types to each of 16 562 isoform-pairs from 4929 genes. Among those, 26% of the discovered patterns were significant (P<0.05), while remaining patterns are possibly effects of transcriptional bursting, drop-out and stochastic biological heterogeneity. Furthermore, 32% of genes discovered through differential-pattern analysis were not detected by differential-expression analysis. Finally, the effects of drop-out events and expression levels of isoforms on ISOP's performances were investigated through simulated datasets. To conclude, ISOP provides a novel approach for characterization of isoform-level preference, commitment and heterogeneity in single-cell RNA-sequencing data. AVAILABILITY AND IMPLEMENTATION: The ISOP method has been implemented as a R package and is available at https://github.com/nghiavtr/ISOP under a GPL-3 license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6041805
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60418052018-07-17 Isoform-level gene expression patterns in single-cell RNA-sequencing data Vu, Trung Nghia Wills, Quin F Kalari, Krishna R Niu, Nifang Wang, Liewei Pawitan, Yudi Rantalainen, Mattias Bioinformatics Original Papers MOTIVATION: RNA sequencing of single cells enables characterization of transcriptional heterogeneity in seemingly homogeneous cell populations. Single-cell sequencing has been applied in a wide range of researches fields. However, few studies have focus on characterization of isoform-level expression patterns at the single-cell level. In this study, we propose and apply a novel method, ISOform-Patterns (ISOP), based on mixture modeling, to characterize the expression patterns of isoform pairs from the same gene in single-cell isoform-level expression data. RESULTS: We define six principal patterns of isoform expression relationships and describe a method for differential-pattern analysis. We demonstrate ISOP through analysis of single-cell RNA-sequencing data from a breast cancer cell line, with replication in three independent datasets. We assigned the pattern types to each of 16 562 isoform-pairs from 4929 genes. Among those, 26% of the discovered patterns were significant (P<0.05), while remaining patterns are possibly effects of transcriptional bursting, drop-out and stochastic biological heterogeneity. Furthermore, 32% of genes discovered through differential-pattern analysis were not detected by differential-expression analysis. Finally, the effects of drop-out events and expression levels of isoforms on ISOP's performances were investigated through simulated datasets. To conclude, ISOP provides a novel approach for characterization of isoform-level preference, commitment and heterogeneity in single-cell RNA-sequencing data. AVAILABILITY AND IMPLEMENTATION: The ISOP method has been implemented as a R package and is available at https://github.com/nghiavtr/ISOP under a GPL-3 license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-07-15 2018-02-27 /pmc/articles/PMC6041805/ /pubmed/29490015 http://dx.doi.org/10.1093/bioinformatics/bty100 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Vu, Trung Nghia
Wills, Quin F
Kalari, Krishna R
Niu, Nifang
Wang, Liewei
Pawitan, Yudi
Rantalainen, Mattias
Isoform-level gene expression patterns in single-cell RNA-sequencing data
title Isoform-level gene expression patterns in single-cell RNA-sequencing data
title_full Isoform-level gene expression patterns in single-cell RNA-sequencing data
title_fullStr Isoform-level gene expression patterns in single-cell RNA-sequencing data
title_full_unstemmed Isoform-level gene expression patterns in single-cell RNA-sequencing data
title_short Isoform-level gene expression patterns in single-cell RNA-sequencing data
title_sort isoform-level gene expression patterns in single-cell rna-sequencing data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6041805/
https://www.ncbi.nlm.nih.gov/pubmed/29490015
http://dx.doi.org/10.1093/bioinformatics/bty100
work_keys_str_mv AT vutrungnghia isoformlevelgeneexpressionpatternsinsinglecellrnasequencingdata
AT willsquinf isoformlevelgeneexpressionpatternsinsinglecellrnasequencingdata
AT kalarikrishnar isoformlevelgeneexpressionpatternsinsinglecellrnasequencingdata
AT niunifang isoformlevelgeneexpressionpatternsinsinglecellrnasequencingdata
AT wangliewei isoformlevelgeneexpressionpatternsinsinglecellrnasequencingdata
AT pawitanyudi isoformlevelgeneexpressionpatternsinsinglecellrnasequencingdata
AT rantalainenmattias isoformlevelgeneexpressionpatternsinsinglecellrnasequencingdata