Cargando…

IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data

BACKGROUND: Most genes in mammals generate several transcript isoforms that differ in stability and translational efficiency through alternative splicing. Such alternative splicing can be tissue- and developmental stage-specific, and such specificity is sometimes associated with disease. Thus, detec...

Descripción completa

Detalles Bibliográficos
Autores principales: Niu, Liang, Huang, Weichun, Umbach, David M, Li, Leping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4195885/
https://www.ncbi.nlm.nih.gov/pubmed/25283306
http://dx.doi.org/10.1186/1471-2164-15-862
_version_ 1782339376297017344
author Niu, Liang
Huang, Weichun
Umbach, David M
Li, Leping
author_facet Niu, Liang
Huang, Weichun
Umbach, David M
Li, Leping
author_sort Niu, Liang
collection PubMed
description BACKGROUND: Most genes in mammals generate several transcript isoforms that differ in stability and translational efficiency through alternative splicing. Such alternative splicing can be tissue- and developmental stage-specific, and such specificity is sometimes associated with disease. Thus, detecting differential isoform usage for a gene between tissues or cell lines/types (differences in the fraction of total expression of a gene represented by the expression of each of its isoforms) is potentially important for cell and developmental biology. RESULTS: We present a new method IUTA that is designed to test each gene in the genome for differential isoform usage between two groups of samples. IUTA also estimates isoform usage for each gene in each sample as well as averaged across samples within each group. IUTA is the first method to formulate the testing problem as testing for equal means of two probability distributions under the Aitchison geometry, which is widely recognized as the most appropriate geometry for compositional data (vectors that contain the relative amount of each component comprising the whole). Evaluation using simulated data showed that IUTA was able to provide test results for many more genes than was Cuffdiff2 (version 2.2.0, released in Mar. 2014), and IUTA performed better than Cuffdiff2 for the limited number of genes that Cuffdiff2 did analyze. When applied to actual mouse RNA-Seq datasets from six tissues, IUTA identified 2,073 significant genes with clear patterns of differential isoform usage between a pair of tissues. IUTA is implemented as an R package and is available at http://www.niehs.nih.gov/research/resources/software/biostatistics/iuta/index.cfm. CONCLUSIONS: Both simulation and real-data results suggest that IUTA accurately detects differential isoform usage. We believe that our analysis of RNA-seq data from six mouse tissues represents the first comprehensive characterization of isoform usage in these tissues. IUTA will be a valuable resource for those who study the roles of alternative transcripts in cell development and disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-862) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4195885
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41958852014-10-15 IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data Niu, Liang Huang, Weichun Umbach, David M Li, Leping BMC Genomics Methodology Article BACKGROUND: Most genes in mammals generate several transcript isoforms that differ in stability and translational efficiency through alternative splicing. Such alternative splicing can be tissue- and developmental stage-specific, and such specificity is sometimes associated with disease. Thus, detecting differential isoform usage for a gene between tissues or cell lines/types (differences in the fraction of total expression of a gene represented by the expression of each of its isoforms) is potentially important for cell and developmental biology. RESULTS: We present a new method IUTA that is designed to test each gene in the genome for differential isoform usage between two groups of samples. IUTA also estimates isoform usage for each gene in each sample as well as averaged across samples within each group. IUTA is the first method to formulate the testing problem as testing for equal means of two probability distributions under the Aitchison geometry, which is widely recognized as the most appropriate geometry for compositional data (vectors that contain the relative amount of each component comprising the whole). Evaluation using simulated data showed that IUTA was able to provide test results for many more genes than was Cuffdiff2 (version 2.2.0, released in Mar. 2014), and IUTA performed better than Cuffdiff2 for the limited number of genes that Cuffdiff2 did analyze. When applied to actual mouse RNA-Seq datasets from six tissues, IUTA identified 2,073 significant genes with clear patterns of differential isoform usage between a pair of tissues. IUTA is implemented as an R package and is available at http://www.niehs.nih.gov/research/resources/software/biostatistics/iuta/index.cfm. CONCLUSIONS: Both simulation and real-data results suggest that IUTA accurately detects differential isoform usage. We believe that our analysis of RNA-seq data from six mouse tissues represents the first comprehensive characterization of isoform usage in these tissues. IUTA will be a valuable resource for those who study the roles of alternative transcripts in cell development and disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-862) contains supplementary material, which is available to authorized users. BioMed Central 2014-10-06 /pmc/articles/PMC4195885/ /pubmed/25283306 http://dx.doi.org/10.1186/1471-2164-15-862 Text en © Niu et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Niu, Liang
Huang, Weichun
Umbach, David M
Li, Leping
IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data
title IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data
title_full IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data
title_fullStr IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data
title_full_unstemmed IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data
title_short IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data
title_sort iuta: a tool for effectively detecting differential isoform usage from rna-seq data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4195885/
https://www.ncbi.nlm.nih.gov/pubmed/25283306
http://dx.doi.org/10.1186/1471-2164-15-862
work_keys_str_mv AT niuliang iutaatoolforeffectivelydetectingdifferentialisoformusagefromrnaseqdata
AT huangweichun iutaatoolforeffectivelydetectingdifferentialisoformusagefromrnaseqdata
AT umbachdavidm iutaatoolforeffectivelydetectingdifferentialisoformusagefromrnaseqdata
AT lileping iutaatoolforeffectivelydetectingdifferentialisoformusagefromrnaseqdata