Cargando…
Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs
BACKGROUND: In cancer cells, fusion genes can produce linear and chimeric fusion-circular RNAs (f-circRNAs), which are functional in gene expression regulation and implicated in malignant transformation, cancer progression, and therapeutic resistance. For specific cancers, proteins encoded by fusion...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7259471/ https://www.ncbi.nlm.nih.gov/pubmed/32470133 http://dx.doi.org/10.1093/gigascience/giaa054 |
_version_ | 1783540142902870016 |
---|---|
author | Cai, Zhaoqing Xue, Hongzhang Xu, Yue Köhler, Jens Cheng, Xiaojie Dai, Yao Zheng, Jie Wang, Haiyun |
author_facet | Cai, Zhaoqing Xue, Hongzhang Xu, Yue Köhler, Jens Cheng, Xiaojie Dai, Yao Zheng, Jie Wang, Haiyun |
author_sort | Cai, Zhaoqing |
collection | PubMed |
description | BACKGROUND: In cancer cells, fusion genes can produce linear and chimeric fusion-circular RNAs (f-circRNAs), which are functional in gene expression regulation and implicated in malignant transformation, cancer progression, and therapeutic resistance. For specific cancers, proteins encoded by fusion transcripts have been identified as innovative therapeutic targets (e.g., EML4-ALK). Even though RNA sequencing (RNA-Seq) technologies combined with existing bioinformatics approaches have enabled researchers to systematically identify fusion transcripts, specifically detecting f-circRNAs in cells remains challenging owing to their general sparsity and low abundance in cancer cells but also owing to imperfect computational methods. RESULTS: We developed the Python-based workflow “Fcirc” to identify fusion linear and f-circRNAs from RNA-Seq data with high specificity. We applied Fcirc to 3 different types of RNA-Seq data scenarios: (i) actual synthetic spike-in RNA-Seq data, (ii) simulated RNA-Seq data, and (iii) actual cancer cell–derived RNA-Seq data. Fcirc showed significant advantages over existing methods regarding both detection accuracy (i.e., precision, recall, F-measure) and computing performance (i.e., lower runtimes). CONCLUSION: Fcirc is a powerful and comprehensive Python-based pipeline to identify linear and circular RNA transcripts from known fusion events in RNA-Seq datasets with higher accuracy and shorter computing times compared with previously published algorithms. Fcirc empowers the research community to study the biology of fusion RNAs in cancer more effectively. |
format | Online Article Text |
id | pubmed-7259471 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-72594712020-06-03 Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs Cai, Zhaoqing Xue, Hongzhang Xu, Yue Köhler, Jens Cheng, Xiaojie Dai, Yao Zheng, Jie Wang, Haiyun Gigascience Research BACKGROUND: In cancer cells, fusion genes can produce linear and chimeric fusion-circular RNAs (f-circRNAs), which are functional in gene expression regulation and implicated in malignant transformation, cancer progression, and therapeutic resistance. For specific cancers, proteins encoded by fusion transcripts have been identified as innovative therapeutic targets (e.g., EML4-ALK). Even though RNA sequencing (RNA-Seq) technologies combined with existing bioinformatics approaches have enabled researchers to systematically identify fusion transcripts, specifically detecting f-circRNAs in cells remains challenging owing to their general sparsity and low abundance in cancer cells but also owing to imperfect computational methods. RESULTS: We developed the Python-based workflow “Fcirc” to identify fusion linear and f-circRNAs from RNA-Seq data with high specificity. We applied Fcirc to 3 different types of RNA-Seq data scenarios: (i) actual synthetic spike-in RNA-Seq data, (ii) simulated RNA-Seq data, and (iii) actual cancer cell–derived RNA-Seq data. Fcirc showed significant advantages over existing methods regarding both detection accuracy (i.e., precision, recall, F-measure) and computing performance (i.e., lower runtimes). CONCLUSION: Fcirc is a powerful and comprehensive Python-based pipeline to identify linear and circular RNA transcripts from known fusion events in RNA-Seq datasets with higher accuracy and shorter computing times compared with previously published algorithms. Fcirc empowers the research community to study the biology of fusion RNAs in cancer more effectively. Oxford University Press 2020-05-29 /pmc/articles/PMC7259471/ /pubmed/32470133 http://dx.doi.org/10.1093/gigascience/giaa054 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Cai, Zhaoqing Xue, Hongzhang Xu, Yue Köhler, Jens Cheng, Xiaojie Dai, Yao Zheng, Jie Wang, Haiyun Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs |
title | Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs |
title_full | Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs |
title_fullStr | Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs |
title_full_unstemmed | Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs |
title_short | Fcirc: A comprehensive pipeline for the exploration of fusion linear and circular RNAs |
title_sort | fcirc: a comprehensive pipeline for the exploration of fusion linear and circular rnas |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7259471/ https://www.ncbi.nlm.nih.gov/pubmed/32470133 http://dx.doi.org/10.1093/gigascience/giaa054 |
work_keys_str_mv | AT caizhaoqing fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas AT xuehongzhang fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas AT xuyue fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas AT kohlerjens fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas AT chengxiaojie fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas AT daiyao fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas AT zhengjie fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas AT wanghaiyun fcircacomprehensivepipelinefortheexplorationoffusionlinearandcircularrnas |