Cargando…

iSeeRNA: identification of long intergenic non-coding RNA transcripts from transcriptome sequencing data

BACKGROUND: Long intergenic non-coding RNAs (lincRNAs) are emerging as a novel class of non-coding RNAs and potent gene regulators. High-throughput RNA-sequencing combined with de novo assembly promises quantity discovery of novel transcripts. However, the identification of lincRNAs from thousands o...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Kun, Chen, Xiaona, Jiang, Peiyong, Song, Xiaofeng, Wang, Huating, Sun, Hao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3582448/
https://www.ncbi.nlm.nih.gov/pubmed/23445546
http://dx.doi.org/10.1186/1471-2164-14-S2-S7
Descripción
Sumario:BACKGROUND: Long intergenic non-coding RNAs (lincRNAs) are emerging as a novel class of non-coding RNAs and potent gene regulators. High-throughput RNA-sequencing combined with de novo assembly promises quantity discovery of novel transcripts. However, the identification of lincRNAs from thousands of assembled transcripts is still challenging due to the difficulties of separating them from protein coding transcripts (PCTs). RESULTS: We have implemented iSeeRNA, a support vector machine (SVM)-based classifier for the identification of lincRNAs. iSeeRNA shows better performance compared to other software. A public available webserver for iSeeRNA is also provided for small size dataset. CONCLUSIONS: iSeeRNA demonstrates high prediction accuracy and runs several magnitudes faster than other similar programs. It can be integrated into the transcriptome data analysis pipelines or run as a web server, thus offering a valuable tool for lincRNA study.