Cargando…

GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data

Fusion gene derived from genomic rearrangement plays a key role in cancer initiation. The discovery of novel gene fusions may be of significant importance in cancer diagnosis and treatment. Meanwhile, next generation sequencing technology provide a sensitive and efficient way to identify gene fusion...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Jian, Chen, Qi, Wu, Jing, Han, Ping, Song, Xiaofeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5537242/
https://www.ncbi.nlm.nih.gov/pubmed/28761119
http://dx.doi.org/10.1038/s41598-017-07070-6
_version_ 1783254132697595904
author Zhao, Jian
Chen, Qi
Wu, Jing
Han, Ping
Song, Xiaofeng
author_facet Zhao, Jian
Chen, Qi
Wu, Jing
Han, Ping
Song, Xiaofeng
author_sort Zhao, Jian
collection PubMed
description Fusion gene derived from genomic rearrangement plays a key role in cancer initiation. The discovery of novel gene fusions may be of significant importance in cancer diagnosis and treatment. Meanwhile, next generation sequencing technology provide a sensitive and efficient way to identify gene fusions in genomic levels. However, there are still many challenges and limitations remaining in the existing methods which only rely on unmapped reads or discordant alignment fragments. In this work we have developed GFusion, a novel method using RNA-Seq data, to identify the fusion genes. This pipeline performs multiple alignments and strict filtering algorithm to improve sensitivity and reduce the false positive rate. GFusion successfully detected 34 from 43 previously reported fusions in four cancer datasets. We also demonstrated the effectiveness of GFusion using 24 million 76 bp paired-end reads simulation data which contains 42 artificial fusion genes, among which GFusion successfully discovered 37 fusion genes. Compared with existing methods, GFusion presented higher sensitivity and lower false positive rate. The GFusion pipeline can be accessed freely for non-commercial purposes at: https://github.com/xiaofengsong/GFusion.
format Online
Article
Text
id pubmed-5537242
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-55372422017-08-03 GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data Zhao, Jian Chen, Qi Wu, Jing Han, Ping Song, Xiaofeng Sci Rep Article Fusion gene derived from genomic rearrangement plays a key role in cancer initiation. The discovery of novel gene fusions may be of significant importance in cancer diagnosis and treatment. Meanwhile, next generation sequencing technology provide a sensitive and efficient way to identify gene fusions in genomic levels. However, there are still many challenges and limitations remaining in the existing methods which only rely on unmapped reads or discordant alignment fragments. In this work we have developed GFusion, a novel method using RNA-Seq data, to identify the fusion genes. This pipeline performs multiple alignments and strict filtering algorithm to improve sensitivity and reduce the false positive rate. GFusion successfully detected 34 from 43 previously reported fusions in four cancer datasets. We also demonstrated the effectiveness of GFusion using 24 million 76 bp paired-end reads simulation data which contains 42 artificial fusion genes, among which GFusion successfully discovered 37 fusion genes. Compared with existing methods, GFusion presented higher sensitivity and lower false positive rate. The GFusion pipeline can be accessed freely for non-commercial purposes at: https://github.com/xiaofengsong/GFusion. Nature Publishing Group UK 2017-07-31 /pmc/articles/PMC5537242/ /pubmed/28761119 http://dx.doi.org/10.1038/s41598-017-07070-6 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Zhao, Jian
Chen, Qi
Wu, Jing
Han, Ping
Song, Xiaofeng
GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data
title GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data
title_full GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data
title_fullStr GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data
title_full_unstemmed GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data
title_short GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data
title_sort gfusion: an effective algorithm to identify fusion genes from cancer rna-seq data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5537242/
https://www.ncbi.nlm.nih.gov/pubmed/28761119
http://dx.doi.org/10.1038/s41598-017-07070-6
work_keys_str_mv AT zhaojian gfusionaneffectivealgorithmtoidentifyfusiongenesfromcancerrnaseqdata
AT chenqi gfusionaneffectivealgorithmtoidentifyfusiongenesfromcancerrnaseqdata
AT wujing gfusionaneffectivealgorithmtoidentifyfusiongenesfromcancerrnaseqdata
AT hanping gfusionaneffectivealgorithmtoidentifyfusiongenesfromcancerrnaseqdata
AT songxiaofeng gfusionaneffectivealgorithmtoidentifyfusiongenesfromcancerrnaseqdata