Cargando…

CircMarker: a fast and accurate algorithm for circular RNA detection

BACKGROUND: While RNA is often created from linear splicing during transcription, recent studies have found that non-canonical splicing sometimes occurs. Non-canonical splicing joins 3’ and 5’ and forms the so-called circular RNA. It is now believed that circular RNA plays important biological roles...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Xin, Chu, Chong, Pei, Jingwen, Măndoiu, Ion, Wu, Yufeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6101086/
https://www.ncbi.nlm.nih.gov/pubmed/30367583
http://dx.doi.org/10.1186/s12864-018-4926-0
_version_ 1783348985460686848
author Li, Xin
Chu, Chong
Pei, Jingwen
Măndoiu, Ion
Wu, Yufeng
author_facet Li, Xin
Chu, Chong
Pei, Jingwen
Măndoiu, Ion
Wu, Yufeng
author_sort Li, Xin
collection PubMed
description BACKGROUND: While RNA is often created from linear splicing during transcription, recent studies have found that non-canonical splicing sometimes occurs. Non-canonical splicing joins 3’ and 5’ and forms the so-called circular RNA. It is now believed that circular RNA plays important biological roles such as affecting susceptibility of some diseases. During the past several years, multiple experimental methods have been developed to enrich circular RNA while degrade linear RNA. Although several useful software tools for circular RNA detection have been developed as well, these tools are based on reads mapping may miss many circular RNA. Also, existing tools are slow for large data due to their dependence on reads mapping. METHOD: In this paper, we present a new computational approach, named CircMarker, based on k-mers rather than reads mapping for circular RNA detection. CircMarker takes advantage of transcriptome annotation files to create the k-mer table for circular RNA detection. RESULTS: Empirical results show that CircMarker outperforms existing tools in circular RNA detection on accuracy and efficiency in many simulated and real datasets. CONCLUSIONS: We develop a new circular RNA detection method called CircMarker based on k-mer analysis. Our results on both simulation data and real data demonstrate that CircMarker runs much faster and can find more circular RNA with higher consensus-based sensitivity and high accuracy ratio compared with existing tools. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4926-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6101086
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61010862018-08-27 CircMarker: a fast and accurate algorithm for circular RNA detection Li, Xin Chu, Chong Pei, Jingwen Măndoiu, Ion Wu, Yufeng BMC Genomics Software BACKGROUND: While RNA is often created from linear splicing during transcription, recent studies have found that non-canonical splicing sometimes occurs. Non-canonical splicing joins 3’ and 5’ and forms the so-called circular RNA. It is now believed that circular RNA plays important biological roles such as affecting susceptibility of some diseases. During the past several years, multiple experimental methods have been developed to enrich circular RNA while degrade linear RNA. Although several useful software tools for circular RNA detection have been developed as well, these tools are based on reads mapping may miss many circular RNA. Also, existing tools are slow for large data due to their dependence on reads mapping. METHOD: In this paper, we present a new computational approach, named CircMarker, based on k-mers rather than reads mapping for circular RNA detection. CircMarker takes advantage of transcriptome annotation files to create the k-mer table for circular RNA detection. RESULTS: Empirical results show that CircMarker outperforms existing tools in circular RNA detection on accuracy and efficiency in many simulated and real datasets. CONCLUSIONS: We develop a new circular RNA detection method called CircMarker based on k-mer analysis. Our results on both simulation data and real data demonstrate that CircMarker runs much faster and can find more circular RNA with higher consensus-based sensitivity and high accuracy ratio compared with existing tools. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4926-0) contains supplementary material, which is available to authorized users. BioMed Central 2018-08-13 /pmc/articles/PMC6101086/ /pubmed/30367583 http://dx.doi.org/10.1186/s12864-018-4926-0 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Li, Xin
Chu, Chong
Pei, Jingwen
Măndoiu, Ion
Wu, Yufeng
CircMarker: a fast and accurate algorithm for circular RNA detection
title CircMarker: a fast and accurate algorithm for circular RNA detection
title_full CircMarker: a fast and accurate algorithm for circular RNA detection
title_fullStr CircMarker: a fast and accurate algorithm for circular RNA detection
title_full_unstemmed CircMarker: a fast and accurate algorithm for circular RNA detection
title_short CircMarker: a fast and accurate algorithm for circular RNA detection
title_sort circmarker: a fast and accurate algorithm for circular rna detection
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6101086/
https://www.ncbi.nlm.nih.gov/pubmed/30367583
http://dx.doi.org/10.1186/s12864-018-4926-0
work_keys_str_mv AT lixin circmarkerafastandaccuratealgorithmforcircularrnadetection
AT chuchong circmarkerafastandaccuratealgorithmforcircularrnadetection
AT peijingwen circmarkerafastandaccuratealgorithmforcircularrnadetection
AT mandoiuion circmarkerafastandaccuratealgorithmforcircularrnadetection
AT wuyufeng circmarkerafastandaccuratealgorithmforcircularrnadetection