Cargando…
CLASS2: accurate and efficient splice variant annotation from RNA-seq reads
Next generation sequencing of cellular RNA is making it possible to characterize genes and alternative splicing in unprecedented detail. However, designing bioinformatics tools to accurately capture splicing variation has proven difficult. Current programs can find major isoforms of a gene but miss...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4889935/ https://www.ncbi.nlm.nih.gov/pubmed/26975657 http://dx.doi.org/10.1093/nar/gkw158 |
_version_ | 1782435037182951424 |
---|---|
author | Song, Li Sabunciyan, Sarven Florea, Liliana |
author_facet | Song, Li Sabunciyan, Sarven Florea, Liliana |
author_sort | Song, Li |
collection | PubMed |
description | Next generation sequencing of cellular RNA is making it possible to characterize genes and alternative splicing in unprecedented detail. However, designing bioinformatics tools to accurately capture splicing variation has proven difficult. Current programs can find major isoforms of a gene but miss lower abundance variants, or are sensitive but imprecise. CLASS2 is a novel open source tool for accurate genome-guided transcriptome assembly from RNA-seq reads based on the model of splice graph. An extension of our program CLASS, CLASS2 jointly optimizes read patterns and the number of supporting reads to score and prioritize transcripts, implemented in a novel, scalable and efficient dynamic programming algorithm. When compared against reference programs, CLASS2 had the best overall accuracy and could detect up to twice as many splicing events with precision similar to the best reference program. Notably, it was the only tool to produce consistently reliable transcript models for a wide range of applications and sequencing strategies, including ribosomal RNA-depleted samples. Lightweight and multi-threaded, CLASS2 requires <3GB RAM and can analyze a 350 million read set within hours, and can be widely applied to transcriptomics studies ranging from clinical RNA sequencing, to alternative splicing analyses, and to the annotation of new genomes. |
format | Online Article Text |
id | pubmed-4889935 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-48899352016-06-06 CLASS2: accurate and efficient splice variant annotation from RNA-seq reads Song, Li Sabunciyan, Sarven Florea, Liliana Nucleic Acids Res Methods Online Next generation sequencing of cellular RNA is making it possible to characterize genes and alternative splicing in unprecedented detail. However, designing bioinformatics tools to accurately capture splicing variation has proven difficult. Current programs can find major isoforms of a gene but miss lower abundance variants, or are sensitive but imprecise. CLASS2 is a novel open source tool for accurate genome-guided transcriptome assembly from RNA-seq reads based on the model of splice graph. An extension of our program CLASS, CLASS2 jointly optimizes read patterns and the number of supporting reads to score and prioritize transcripts, implemented in a novel, scalable and efficient dynamic programming algorithm. When compared against reference programs, CLASS2 had the best overall accuracy and could detect up to twice as many splicing events with precision similar to the best reference program. Notably, it was the only tool to produce consistently reliable transcript models for a wide range of applications and sequencing strategies, including ribosomal RNA-depleted samples. Lightweight and multi-threaded, CLASS2 requires <3GB RAM and can analyze a 350 million read set within hours, and can be widely applied to transcriptomics studies ranging from clinical RNA sequencing, to alternative splicing analyses, and to the annotation of new genomes. Oxford University Press 2016-06-02 2016-03-14 /pmc/articles/PMC4889935/ /pubmed/26975657 http://dx.doi.org/10.1093/nar/gkw158 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Song, Li Sabunciyan, Sarven Florea, Liliana CLASS2: accurate and efficient splice variant annotation from RNA-seq reads |
title | CLASS2: accurate and efficient splice variant annotation from RNA-seq reads |
title_full | CLASS2: accurate and efficient splice variant annotation from RNA-seq reads |
title_fullStr | CLASS2: accurate and efficient splice variant annotation from RNA-seq reads |
title_full_unstemmed | CLASS2: accurate and efficient splice variant annotation from RNA-seq reads |
title_short | CLASS2: accurate and efficient splice variant annotation from RNA-seq reads |
title_sort | class2: accurate and efficient splice variant annotation from rna-seq reads |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4889935/ https://www.ncbi.nlm.nih.gov/pubmed/26975657 http://dx.doi.org/10.1093/nar/gkw158 |
work_keys_str_mv | AT songli class2accurateandefficientsplicevariantannotationfromrnaseqreads AT sabunciyansarven class2accurateandefficientsplicevariantannotationfromrnaseqreads AT florealiliana class2accurateandefficientsplicevariantannotationfromrnaseqreads |