Cargando…

CircMarker: a fast and accurate algorithm for circular RNA detection

BACKGROUND: While RNA is often created from linear splicing during transcription, recent studies have found that non-canonical splicing sometimes occurs. Non-canonical splicing joins 3’ and 5’ and forms the so-called circular RNA. It is now believed that circular RNA plays important biological roles...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Xin, Chu, Chong, Pei, Jingwen, Măndoiu, Ion, Wu, Yufeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6101086/
https://www.ncbi.nlm.nih.gov/pubmed/30367583
http://dx.doi.org/10.1186/s12864-018-4926-0
Descripción
Sumario:BACKGROUND: While RNA is often created from linear splicing during transcription, recent studies have found that non-canonical splicing sometimes occurs. Non-canonical splicing joins 3’ and 5’ and forms the so-called circular RNA. It is now believed that circular RNA plays important biological roles such as affecting susceptibility of some diseases. During the past several years, multiple experimental methods have been developed to enrich circular RNA while degrade linear RNA. Although several useful software tools for circular RNA detection have been developed as well, these tools are based on reads mapping may miss many circular RNA. Also, existing tools are slow for large data due to their dependence on reads mapping. METHOD: In this paper, we present a new computational approach, named CircMarker, based on k-mers rather than reads mapping for circular RNA detection. CircMarker takes advantage of transcriptome annotation files to create the k-mer table for circular RNA detection. RESULTS: Empirical results show that CircMarker outperforms existing tools in circular RNA detection on accuracy and efficiency in many simulated and real datasets. CONCLUSIONS: We develop a new circular RNA detection method called CircMarker based on k-mer analysis. Our results on both simulation data and real data demonstrate that CircMarker runs much faster and can find more circular RNA with higher consensus-based sensitivity and high accuracy ratio compared with existing tools. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4926-0) contains supplementary material, which is available to authorized users.