Cargando…
CircMarker: a fast and accurate algorithm for circular RNA detection
BACKGROUND: While RNA is often created from linear splicing during transcription, recent studies have found that non-canonical splicing sometimes occurs. Non-canonical splicing joins 3’ and 5’ and forms the so-called circular RNA. It is now believed that circular RNA plays important biological roles...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6101086/ https://www.ncbi.nlm.nih.gov/pubmed/30367583 http://dx.doi.org/10.1186/s12864-018-4926-0 |
_version_ | 1783348985460686848 |
---|---|
author | Li, Xin Chu, Chong Pei, Jingwen Măndoiu, Ion Wu, Yufeng |
author_facet | Li, Xin Chu, Chong Pei, Jingwen Măndoiu, Ion Wu, Yufeng |
author_sort | Li, Xin |
collection | PubMed |
description | BACKGROUND: While RNA is often created from linear splicing during transcription, recent studies have found that non-canonical splicing sometimes occurs. Non-canonical splicing joins 3’ and 5’ and forms the so-called circular RNA. It is now believed that circular RNA plays important biological roles such as affecting susceptibility of some diseases. During the past several years, multiple experimental methods have been developed to enrich circular RNA while degrade linear RNA. Although several useful software tools for circular RNA detection have been developed as well, these tools are based on reads mapping may miss many circular RNA. Also, existing tools are slow for large data due to their dependence on reads mapping. METHOD: In this paper, we present a new computational approach, named CircMarker, based on k-mers rather than reads mapping for circular RNA detection. CircMarker takes advantage of transcriptome annotation files to create the k-mer table for circular RNA detection. RESULTS: Empirical results show that CircMarker outperforms existing tools in circular RNA detection on accuracy and efficiency in many simulated and real datasets. CONCLUSIONS: We develop a new circular RNA detection method called CircMarker based on k-mer analysis. Our results on both simulation data and real data demonstrate that CircMarker runs much faster and can find more circular RNA with higher consensus-based sensitivity and high accuracy ratio compared with existing tools. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4926-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6101086 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-61010862018-08-27 CircMarker: a fast and accurate algorithm for circular RNA detection Li, Xin Chu, Chong Pei, Jingwen Măndoiu, Ion Wu, Yufeng BMC Genomics Software BACKGROUND: While RNA is often created from linear splicing during transcription, recent studies have found that non-canonical splicing sometimes occurs. Non-canonical splicing joins 3’ and 5’ and forms the so-called circular RNA. It is now believed that circular RNA plays important biological roles such as affecting susceptibility of some diseases. During the past several years, multiple experimental methods have been developed to enrich circular RNA while degrade linear RNA. Although several useful software tools for circular RNA detection have been developed as well, these tools are based on reads mapping may miss many circular RNA. Also, existing tools are slow for large data due to their dependence on reads mapping. METHOD: In this paper, we present a new computational approach, named CircMarker, based on k-mers rather than reads mapping for circular RNA detection. CircMarker takes advantage of transcriptome annotation files to create the k-mer table for circular RNA detection. RESULTS: Empirical results show that CircMarker outperforms existing tools in circular RNA detection on accuracy and efficiency in many simulated and real datasets. CONCLUSIONS: We develop a new circular RNA detection method called CircMarker based on k-mer analysis. Our results on both simulation data and real data demonstrate that CircMarker runs much faster and can find more circular RNA with higher consensus-based sensitivity and high accuracy ratio compared with existing tools. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4926-0) contains supplementary material, which is available to authorized users. BioMed Central 2018-08-13 /pmc/articles/PMC6101086/ /pubmed/30367583 http://dx.doi.org/10.1186/s12864-018-4926-0 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Li, Xin Chu, Chong Pei, Jingwen Măndoiu, Ion Wu, Yufeng CircMarker: a fast and accurate algorithm for circular RNA detection |
title | CircMarker: a fast and accurate algorithm for circular RNA detection |
title_full | CircMarker: a fast and accurate algorithm for circular RNA detection |
title_fullStr | CircMarker: a fast and accurate algorithm for circular RNA detection |
title_full_unstemmed | CircMarker: a fast and accurate algorithm for circular RNA detection |
title_short | CircMarker: a fast and accurate algorithm for circular RNA detection |
title_sort | circmarker: a fast and accurate algorithm for circular rna detection |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6101086/ https://www.ncbi.nlm.nih.gov/pubmed/30367583 http://dx.doi.org/10.1186/s12864-018-4926-0 |
work_keys_str_mv | AT lixin circmarkerafastandaccuratealgorithmforcircularrnadetection AT chuchong circmarkerafastandaccuratealgorithmforcircularrnadetection AT peijingwen circmarkerafastandaccuratealgorithmforcircularrnadetection AT mandoiuion circmarkerafastandaccuratealgorithmforcircularrnadetection AT wuyufeng circmarkerafastandaccuratealgorithmforcircularrnadetection |