Cargando…
IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data
BACKGROUND: Most genes in mammals generate several transcript isoforms that differ in stability and translational efficiency through alternative splicing. Such alternative splicing can be tissue- and developmental stage-specific, and such specificity is sometimes associated with disease. Thus, detec...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4195885/ https://www.ncbi.nlm.nih.gov/pubmed/25283306 http://dx.doi.org/10.1186/1471-2164-15-862 |
_version_ | 1782339376297017344 |
---|---|
author | Niu, Liang Huang, Weichun Umbach, David M Li, Leping |
author_facet | Niu, Liang Huang, Weichun Umbach, David M Li, Leping |
author_sort | Niu, Liang |
collection | PubMed |
description | BACKGROUND: Most genes in mammals generate several transcript isoforms that differ in stability and translational efficiency through alternative splicing. Such alternative splicing can be tissue- and developmental stage-specific, and such specificity is sometimes associated with disease. Thus, detecting differential isoform usage for a gene between tissues or cell lines/types (differences in the fraction of total expression of a gene represented by the expression of each of its isoforms) is potentially important for cell and developmental biology. RESULTS: We present a new method IUTA that is designed to test each gene in the genome for differential isoform usage between two groups of samples. IUTA also estimates isoform usage for each gene in each sample as well as averaged across samples within each group. IUTA is the first method to formulate the testing problem as testing for equal means of two probability distributions under the Aitchison geometry, which is widely recognized as the most appropriate geometry for compositional data (vectors that contain the relative amount of each component comprising the whole). Evaluation using simulated data showed that IUTA was able to provide test results for many more genes than was Cuffdiff2 (version 2.2.0, released in Mar. 2014), and IUTA performed better than Cuffdiff2 for the limited number of genes that Cuffdiff2 did analyze. When applied to actual mouse RNA-Seq datasets from six tissues, IUTA identified 2,073 significant genes with clear patterns of differential isoform usage between a pair of tissues. IUTA is implemented as an R package and is available at http://www.niehs.nih.gov/research/resources/software/biostatistics/iuta/index.cfm. CONCLUSIONS: Both simulation and real-data results suggest that IUTA accurately detects differential isoform usage. We believe that our analysis of RNA-seq data from six mouse tissues represents the first comprehensive characterization of isoform usage in these tissues. IUTA will be a valuable resource for those who study the roles of alternative transcripts in cell development and disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-862) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4195885 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41958852014-10-15 IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data Niu, Liang Huang, Weichun Umbach, David M Li, Leping BMC Genomics Methodology Article BACKGROUND: Most genes in mammals generate several transcript isoforms that differ in stability and translational efficiency through alternative splicing. Such alternative splicing can be tissue- and developmental stage-specific, and such specificity is sometimes associated with disease. Thus, detecting differential isoform usage for a gene between tissues or cell lines/types (differences in the fraction of total expression of a gene represented by the expression of each of its isoforms) is potentially important for cell and developmental biology. RESULTS: We present a new method IUTA that is designed to test each gene in the genome for differential isoform usage between two groups of samples. IUTA also estimates isoform usage for each gene in each sample as well as averaged across samples within each group. IUTA is the first method to formulate the testing problem as testing for equal means of two probability distributions under the Aitchison geometry, which is widely recognized as the most appropriate geometry for compositional data (vectors that contain the relative amount of each component comprising the whole). Evaluation using simulated data showed that IUTA was able to provide test results for many more genes than was Cuffdiff2 (version 2.2.0, released in Mar. 2014), and IUTA performed better than Cuffdiff2 for the limited number of genes that Cuffdiff2 did analyze. When applied to actual mouse RNA-Seq datasets from six tissues, IUTA identified 2,073 significant genes with clear patterns of differential isoform usage between a pair of tissues. IUTA is implemented as an R package and is available at http://www.niehs.nih.gov/research/resources/software/biostatistics/iuta/index.cfm. CONCLUSIONS: Both simulation and real-data results suggest that IUTA accurately detects differential isoform usage. We believe that our analysis of RNA-seq data from six mouse tissues represents the first comprehensive characterization of isoform usage in these tissues. IUTA will be a valuable resource for those who study the roles of alternative transcripts in cell development and disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-862) contains supplementary material, which is available to authorized users. BioMed Central 2014-10-06 /pmc/articles/PMC4195885/ /pubmed/25283306 http://dx.doi.org/10.1186/1471-2164-15-862 Text en © Niu et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Niu, Liang Huang, Weichun Umbach, David M Li, Leping IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data |
title | IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data |
title_full | IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data |
title_fullStr | IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data |
title_full_unstemmed | IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data |
title_short | IUTA: a tool for effectively detecting differential isoform usage from RNA-Seq data |
title_sort | iuta: a tool for effectively detecting differential isoform usage from rna-seq data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4195885/ https://www.ncbi.nlm.nih.gov/pubmed/25283306 http://dx.doi.org/10.1186/1471-2164-15-862 |
work_keys_str_mv | AT niuliang iutaatoolforeffectivelydetectingdifferentialisoformusagefromrnaseqdata AT huangweichun iutaatoolforeffectivelydetectingdifferentialisoformusagefromrnaseqdata AT umbachdavidm iutaatoolforeffectivelydetectingdifferentialisoformusagefromrnaseqdata AT lileping iutaatoolforeffectivelydetectingdifferentialisoformusagefromrnaseqdata |