Cargando…

Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs

BACKGROUND: In cancer cells, fusion genes can produce linear and chimeric fusion-circular RNAs (f-circRNAs), which are functional in gene expression regulation and implicated in malignant transformation, cancer progression, and therapeutic resistance. For specific cancers, proteins encoded by fusion...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, Zhaoqing, Xue, Hongzhang, Xu, Yue, Köhler, Jens, Cheng, Xiaojie, Dai, Yao, Zheng, Jie, Wang, Haiyun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7259471/
https://www.ncbi.nlm.nih.gov/pubmed/32470133
http://dx.doi.org/10.1093/gigascience/giaa054
_version_ 1783540142902870016
author Cai, Zhaoqing
Xue, Hongzhang
Xu, Yue
Köhler, Jens
Cheng, Xiaojie
Dai, Yao
Zheng, Jie
Wang, Haiyun
author_facet Cai, Zhaoqing
Xue, Hongzhang
Xu, Yue
Köhler, Jens
Cheng, Xiaojie
Dai, Yao
Zheng, Jie
Wang, Haiyun
author_sort Cai, Zhaoqing
collection PubMed
description BACKGROUND: In cancer cells, fusion genes can produce linear and chimeric fusion-circular RNAs (f-circRNAs), which are functional in gene expression regulation and implicated in malignant transformation, cancer progression, and therapeutic resistance. For specific cancers, proteins encoded by fusion transcripts have been identified as innovative therapeutic targets (e.g., EML4-ALK). Even though RNA sequencing (RNA-Seq) technologies combined with existing bioinformatics approaches have enabled researchers to systematically identify fusion transcripts, specifically detecting f-circRNAs in cells remains challenging owing to their general sparsity and low abundance in cancer cells but also owing to imperfect computational methods. RESULTS: We developed the Python-based workflow “Fcirc” to identify fusion linear and f-circRNAs from RNA-Seq data with high specificity. We applied Fcirc to 3 different types of RNA-Seq data scenarios: (i) actual synthetic spike-in RNA-Seq data, (ii) simulated RNA-Seq data, and (iii) actual cancer cell–derived RNA-Seq data. Fcirc showed significant advantages over existing methods regarding both detection accuracy (i.e., precision, recall, F-measure) and computing performance (i.e., lower runtimes). CONCLUSION: Fcirc is a powerful and comprehensive Python-based pipeline to identify linear and circular RNA transcripts from known fusion events in RNA-Seq datasets with higher accuracy and shorter computing times compared with previously published algorithms. Fcirc empowers the research community to study the biology of fusion RNAs in cancer more effectively.
format Online
Article
Text
id pubmed-7259471
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-72594712020-06-03 Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs Cai, Zhaoqing Xue, Hongzhang Xu, Yue Köhler, Jens Cheng, Xiaojie Dai, Yao Zheng, Jie Wang, Haiyun Gigascience Research BACKGROUND: In cancer cells, fusion genes can produce linear and chimeric fusion-circular RNAs (f-circRNAs), which are functional in gene expression regulation and implicated in malignant transformation, cancer progression, and therapeutic resistance. For specific cancers, proteins encoded by fusion transcripts have been identified as innovative therapeutic targets (e.g., EML4-ALK). Even though RNA sequencing (RNA-Seq) technologies combined with existing bioinformatics approaches have enabled researchers to systematically identify fusion transcripts, specifically detecting f-circRNAs in cells remains challenging owing to their general sparsity and low abundance in cancer cells but also owing to imperfect computational methods. RESULTS: We developed the Python-based workflow “Fcirc” to identify fusion linear and f-circRNAs from RNA-Seq data with high specificity. We applied Fcirc to 3 different types of RNA-Seq data scenarios: (i) actual synthetic spike-in RNA-Seq data, (ii) simulated RNA-Seq data, and (iii) actual cancer cell–derived RNA-Seq data. Fcirc showed significant advantages over existing methods regarding both detection accuracy (i.e., precision, recall, F-measure) and computing performance (i.e., lower runtimes). CONCLUSION: Fcirc is a powerful and comprehensive Python-based pipeline to identify linear and circular RNA transcripts from known fusion events in RNA-Seq datasets with higher accuracy and shorter computing times compared with previously published algorithms. Fcirc empowers the research community to study the biology of fusion RNAs in cancer more effectively. Oxford University Press 2020-05-29 /pmc/articles/PMC7259471/ /pubmed/32470133 http://dx.doi.org/10.1093/gigascience/giaa054 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Cai, Zhaoqing
Xue, Hongzhang
Xu, Yue
Köhler, Jens
Cheng, Xiaojie
Dai, Yao
Zheng, Jie
Wang, Haiyun
Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs
title Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs
title_full Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs
title_fullStr Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs
title_full_unstemmed Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs
title_short Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs
title_sort fcirc: a comprehensive pipeline for the exploration of fusion linear and circular rnas
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7259471/
https://www.ncbi.nlm.nih.gov/pubmed/32470133
http://dx.doi.org/10.1093/gigascience/giaa054
work_keys_str_mv AT caizhaoqing fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas
AT xuehongzhang fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas
AT xuyue fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas
AT kohlerjens fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas
AT chengxiaojie fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas
AT daiyao fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas
AT zhengjie fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas
AT wanghaiyun fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas