Cargando…

IQSeq: Integrated Isoform Quantification Analysis Based on Next-Generation Sequencing

With the recent advances in high-throughput RNA sequencing (RNA-Seq), biologists are able to measure transcription with unprecedented precision. One problem that can now be tackled is that of isoform quantification: here one tries to reconstruct the abundances of isoforms of a gene. We have develope...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Jiang, Leng, Jing, Habegger, Lukas, Sboner, Andrea, McDermott, Drew, Gerstein, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3253133/
https://www.ncbi.nlm.nih.gov/pubmed/22238592
http://dx.doi.org/10.1371/journal.pone.0029175
_version_ 1782220712405106688
author Du, Jiang
Leng, Jing
Habegger, Lukas
Sboner, Andrea
McDermott, Drew
Gerstein, Mark
author_facet Du, Jiang
Leng, Jing
Habegger, Lukas
Sboner, Andrea
McDermott, Drew
Gerstein, Mark
author_sort Du, Jiang
collection PubMed
description With the recent advances in high-throughput RNA sequencing (RNA-Seq), biologists are able to measure transcription with unprecedented precision. One problem that can now be tackled is that of isoform quantification: here one tries to reconstruct the abundances of isoforms of a gene. We have developed a statistical solution for this problem, based on analyzing a set of RNA-Seq reads, and a practical implementation, available from archive.gersteinlab.org/proj/rnaseq/IQSeq, in a tool we call IQSeq (Isoform Quantification in next-generation Sequencing). Here, we present theoretical results which IQSeq is based on, and then use both simulated and real datasets to illustrate various applications of the tool. In order to measure the accuracy of an isoform-quantification result, one would try to estimate the average variance of the estimated isoform abundances for each gene (based on resampling the RNA-seq reads), and IQSeq has a particularly fast algorithm (based on the Fisher Information Matrix) for calculating this, achieving a speedup of [Image: see text] times compared to brute-force resampling. IQSeq also calculates an information theoretic measure of overall transcriptome complexity to describe isoform abundance for a whole experiment. IQSeq has many features that are particularly useful in RNA-Seq experimental design, allowing one to optimally model the integration of different sequencing technologies in a cost-effective way. In particular, the IQSeq formalism integrates the analysis of different sample (i.e. read) sets generated from different technologies within the same statistical framework. It also supports a generalized statistical partial-sample-generation function to model the sequencing process. This allows one to have a modular, “plugin-able” read-generation function to support the particularities of the many evolving sequencing technologies.
format Online
Article
Text
id pubmed-3253133
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32531332012-01-11 IQSeq: Integrated Isoform Quantification Analysis Based on Next-Generation Sequencing Du, Jiang Leng, Jing Habegger, Lukas Sboner, Andrea McDermott, Drew Gerstein, Mark PLoS One Research Article With the recent advances in high-throughput RNA sequencing (RNA-Seq), biologists are able to measure transcription with unprecedented precision. One problem that can now be tackled is that of isoform quantification: here one tries to reconstruct the abundances of isoforms of a gene. We have developed a statistical solution for this problem, based on analyzing a set of RNA-Seq reads, and a practical implementation, available from archive.gersteinlab.org/proj/rnaseq/IQSeq, in a tool we call IQSeq (Isoform Quantification in next-generation Sequencing). Here, we present theoretical results which IQSeq is based on, and then use both simulated and real datasets to illustrate various applications of the tool. In order to measure the accuracy of an isoform-quantification result, one would try to estimate the average variance of the estimated isoform abundances for each gene (based on resampling the RNA-seq reads), and IQSeq has a particularly fast algorithm (based on the Fisher Information Matrix) for calculating this, achieving a speedup of [Image: see text] times compared to brute-force resampling. IQSeq also calculates an information theoretic measure of overall transcriptome complexity to describe isoform abundance for a whole experiment. IQSeq has many features that are particularly useful in RNA-Seq experimental design, allowing one to optimally model the integration of different sequencing technologies in a cost-effective way. In particular, the IQSeq formalism integrates the analysis of different sample (i.e. read) sets generated from different technologies within the same statistical framework. It also supports a generalized statistical partial-sample-generation function to model the sequencing process. This allows one to have a modular, “plugin-able” read-generation function to support the particularities of the many evolving sequencing technologies. Public Library of Science 2012-01-06 /pmc/articles/PMC3253133/ /pubmed/22238592 http://dx.doi.org/10.1371/journal.pone.0029175 Text en Du et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Du, Jiang
Leng, Jing
Habegger, Lukas
Sboner, Andrea
McDermott, Drew
Gerstein, Mark
IQSeq: Integrated Isoform Quantification Analysis Based on Next-Generation Sequencing
title IQSeq: Integrated Isoform Quantification Analysis Based on Next-Generation Sequencing
title_full IQSeq: Integrated Isoform Quantification Analysis Based on Next-Generation Sequencing
title_fullStr IQSeq: Integrated Isoform Quantification Analysis Based on Next-Generation Sequencing
title_full_unstemmed IQSeq: Integrated Isoform Quantification Analysis Based on Next-Generation Sequencing
title_short IQSeq: Integrated Isoform Quantification Analysis Based on Next-Generation Sequencing
title_sort iqseq: integrated isoform quantification analysis based on next-generation sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3253133/
https://www.ncbi.nlm.nih.gov/pubmed/22238592
http://dx.doi.org/10.1371/journal.pone.0029175
work_keys_str_mv AT dujiang iqseqintegratedisoformquantificationanalysisbasedonnextgenerationsequencing
AT lengjing iqseqintegratedisoformquantificationanalysisbasedonnextgenerationsequencing
AT habeggerlukas iqseqintegratedisoformquantificationanalysisbasedonnextgenerationsequencing
AT sbonerandrea iqseqintegratedisoformquantificationanalysisbasedonnextgenerationsequencing
AT mcdermottdrew iqseqintegratedisoformquantificationanalysisbasedonnextgenerationsequencing
AT gersteinmark iqseqintegratedisoformquantificationanalysisbasedonnextgenerationsequencing