Cargando…
Efficient RNA isoform identification and quantification from RNA-Seq data with network flows
Motivation: Several state-of-the-art methods for isoform identification and quantification are based on [Formula: see text]-regularized regression, such as the Lasso. However, explicitly listing the—possibly exponentially—large set of candidate transcripts is intractable for genes with many exons. F...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147886/ https://www.ncbi.nlm.nih.gov/pubmed/24813214 http://dx.doi.org/10.1093/bioinformatics/btu317 |
_version_ | 1782332530456788992 |
---|---|
author | Bernard, Elsa Jacob, Laurent Mairal, Julien Vert, Jean-Philippe |
author_facet | Bernard, Elsa Jacob, Laurent Mairal, Julien Vert, Jean-Philippe |
author_sort | Bernard, Elsa |
collection | PubMed |
description | Motivation: Several state-of-the-art methods for isoform identification and quantification are based on [Formula: see text]-regularized regression, such as the Lasso. However, explicitly listing the—possibly exponentially—large set of candidate transcripts is intractable for genes with many exons. For this reason, existing approaches using the [Formula: see text]-penalty are either restricted to genes with few exons or only run the regression algorithm on a small set of preselected isoforms. Results: We introduce a new technique called FlipFlop, which can efficiently tackle the sparse estimation problem on the full set of candidate isoforms by using network flow optimization. Our technique removes the need of a preselection step, leading to better isoform identification while keeping a low computational cost. Experiments with synthetic and real RNA-Seq data confirm that our approach is more accurate than alternative methods and one of the fastest available. Availability and implementation: Source code is freely available as an R package from the Bioconductor Web site (http://www.bioconductor.org/), and more information is available at http://cbio.ensmp.fr/flipflop. Contact: Jean-Philippe.Vert@mines.org Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4147886 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-41478862014-09-02 Efficient RNA isoform identification and quantification from RNA-Seq data with network flows Bernard, Elsa Jacob, Laurent Mairal, Julien Vert, Jean-Philippe Bioinformatics Original Papers Motivation: Several state-of-the-art methods for isoform identification and quantification are based on [Formula: see text]-regularized regression, such as the Lasso. However, explicitly listing the—possibly exponentially—large set of candidate transcripts is intractable for genes with many exons. For this reason, existing approaches using the [Formula: see text]-penalty are either restricted to genes with few exons or only run the regression algorithm on a small set of preselected isoforms. Results: We introduce a new technique called FlipFlop, which can efficiently tackle the sparse estimation problem on the full set of candidate isoforms by using network flow optimization. Our technique removes the need of a preselection step, leading to better isoform identification while keeping a low computational cost. Experiments with synthetic and real RNA-Seq data confirm that our approach is more accurate than alternative methods and one of the fastest available. Availability and implementation: Source code is freely available as an R package from the Bioconductor Web site (http://www.bioconductor.org/), and more information is available at http://cbio.ensmp.fr/flipflop. Contact: Jean-Philippe.Vert@mines.org Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-09-01 2014-05-09 /pmc/articles/PMC4147886/ /pubmed/24813214 http://dx.doi.org/10.1093/bioinformatics/btu317 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Bernard, Elsa Jacob, Laurent Mairal, Julien Vert, Jean-Philippe Efficient RNA isoform identification and quantification from RNA-Seq data with network flows |
title | Efficient RNA isoform identification and quantification from RNA-Seq data with network flows |
title_full | Efficient RNA isoform identification and quantification from RNA-Seq data with network flows |
title_fullStr | Efficient RNA isoform identification and quantification from RNA-Seq data with network flows |
title_full_unstemmed | Efficient RNA isoform identification and quantification from RNA-Seq data with network flows |
title_short | Efficient RNA isoform identification and quantification from RNA-Seq data with network flows |
title_sort | efficient rna isoform identification and quantification from rna-seq data with network flows |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147886/ https://www.ncbi.nlm.nih.gov/pubmed/24813214 http://dx.doi.org/10.1093/bioinformatics/btu317 |
work_keys_str_mv | AT bernardelsa efficientrnaisoformidentificationandquantificationfromrnaseqdatawithnetworkflows AT jacoblaurent efficientrnaisoformidentificationandquantificationfromrnaseqdatawithnetworkflows AT mairaljulien efficientrnaisoformidentificationandquantificationfromrnaseqdatawithnetworkflows AT vertjeanphilippe efficientrnaisoformidentificationandquantificationfromrnaseqdatawithnetworkflows |