Cargando…

CLASS2: accurate and efficient splice variant annotation from RNA-seq reads

Next generation sequencing of cellular RNA is making it possible to characterize genes and alternative splicing in unprecedented detail. However, designing bioinformatics tools to accurately capture splicing variation has proven difficult. Current programs can find major isoforms of a gene but miss...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Li, Sabunciyan, Sarven, Florea, Liliana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4889935/
https://www.ncbi.nlm.nih.gov/pubmed/26975657
http://dx.doi.org/10.1093/nar/gkw158
_version_ 1782435037182951424
author Song, Li
Sabunciyan, Sarven
Florea, Liliana
author_facet Song, Li
Sabunciyan, Sarven
Florea, Liliana
author_sort Song, Li
collection PubMed
description Next generation sequencing of cellular RNA is making it possible to characterize genes and alternative splicing in unprecedented detail. However, designing bioinformatics tools to accurately capture splicing variation has proven difficult. Current programs can find major isoforms of a gene but miss lower abundance variants, or are sensitive but imprecise. CLASS2 is a novel open source tool for accurate genome-guided transcriptome assembly from RNA-seq reads based on the model of splice graph. An extension of our program CLASS, CLASS2 jointly optimizes read patterns and the number of supporting reads to score and prioritize transcripts, implemented in a novel, scalable and efficient dynamic programming algorithm. When compared against reference programs, CLASS2 had the best overall accuracy and could detect up to twice as many splicing events with precision similar to the best reference program. Notably, it was the only tool to produce consistently reliable transcript models for a wide range of applications and sequencing strategies, including ribosomal RNA-depleted samples. Lightweight and multi-threaded, CLASS2 requires <3GB RAM and can analyze a 350 million read set within hours, and can be widely applied to transcriptomics studies ranging from clinical RNA sequencing, to alternative splicing analyses, and to the annotation of new genomes.
format Online
Article
Text
id pubmed-4889935
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-48899352016-06-06 CLASS2: accurate and efficient splice variant annotation from RNA-seq reads Song, Li Sabunciyan, Sarven Florea, Liliana Nucleic Acids Res Methods Online Next generation sequencing of cellular RNA is making it possible to characterize genes and alternative splicing in unprecedented detail. However, designing bioinformatics tools to accurately capture splicing variation has proven difficult. Current programs can find major isoforms of a gene but miss lower abundance variants, or are sensitive but imprecise. CLASS2 is a novel open source tool for accurate genome-guided transcriptome assembly from RNA-seq reads based on the model of splice graph. An extension of our program CLASS, CLASS2 jointly optimizes read patterns and the number of supporting reads to score and prioritize transcripts, implemented in a novel, scalable and efficient dynamic programming algorithm. When compared against reference programs, CLASS2 had the best overall accuracy and could detect up to twice as many splicing events with precision similar to the best reference program. Notably, it was the only tool to produce consistently reliable transcript models for a wide range of applications and sequencing strategies, including ribosomal RNA-depleted samples. Lightweight and multi-threaded, CLASS2 requires <3GB RAM and can analyze a 350 million read set within hours, and can be widely applied to transcriptomics studies ranging from clinical RNA sequencing, to alternative splicing analyses, and to the annotation of new genomes. Oxford University Press 2016-06-02 2016-03-14 /pmc/articles/PMC4889935/ /pubmed/26975657 http://dx.doi.org/10.1093/nar/gkw158 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Song, Li
Sabunciyan, Sarven
Florea, Liliana
CLASS2: accurate and efficient splice variant annotation from RNA-seq reads
title CLASS2: accurate and efficient splice variant annotation from RNA-seq reads
title_full CLASS2: accurate and efficient splice variant annotation from RNA-seq reads
title_fullStr CLASS2: accurate and efficient splice variant annotation from RNA-seq reads
title_full_unstemmed CLASS2: accurate and efficient splice variant annotation from RNA-seq reads
title_short CLASS2: accurate and efficient splice variant annotation from RNA-seq reads
title_sort class2: accurate and efficient splice variant annotation from rna-seq reads
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4889935/
https://www.ncbi.nlm.nih.gov/pubmed/26975657
http://dx.doi.org/10.1093/nar/gkw158
work_keys_str_mv AT songli class2accurateandefficientsplicevariantannotationfromrnaseqreads
AT sabunciyansarven class2accurateandefficientsplicevariantannotationfromrnaseqreads
AT florealiliana class2accurateandefficientsplicevariantannotationfromrnaseqreads