Cargando…
GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data
Fusion gene derived from genomic rearrangement plays a key role in cancer initiation. The discovery of novel gene fusions may be of significant importance in cancer diagnosis and treatment. Meanwhile, next generation sequencing technology provide a sensitive and efficient way to identify gene fusion...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5537242/ https://www.ncbi.nlm.nih.gov/pubmed/28761119 http://dx.doi.org/10.1038/s41598-017-07070-6 |
_version_ | 1783254132697595904 |
---|---|
author | Zhao, Jian Chen, Qi Wu, Jing Han, Ping Song, Xiaofeng |
author_facet | Zhao, Jian Chen, Qi Wu, Jing Han, Ping Song, Xiaofeng |
author_sort | Zhao, Jian |
collection | PubMed |
description | Fusion gene derived from genomic rearrangement plays a key role in cancer initiation. The discovery of novel gene fusions may be of significant importance in cancer diagnosis and treatment. Meanwhile, next generation sequencing technology provide a sensitive and efficient way to identify gene fusions in genomic levels. However, there are still many challenges and limitations remaining in the existing methods which only rely on unmapped reads or discordant alignment fragments. In this work we have developed GFusion, a novel method using RNA-Seq data, to identify the fusion genes. This pipeline performs multiple alignments and strict filtering algorithm to improve sensitivity and reduce the false positive rate. GFusion successfully detected 34 from 43 previously reported fusions in four cancer datasets. We also demonstrated the effectiveness of GFusion using 24 million 76 bp paired-end reads simulation data which contains 42 artificial fusion genes, among which GFusion successfully discovered 37 fusion genes. Compared with existing methods, GFusion presented higher sensitivity and lower false positive rate. The GFusion pipeline can be accessed freely for non-commercial purposes at: https://github.com/xiaofengsong/GFusion. |
format | Online Article Text |
id | pubmed-5537242 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-55372422017-08-03 GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data Zhao, Jian Chen, Qi Wu, Jing Han, Ping Song, Xiaofeng Sci Rep Article Fusion gene derived from genomic rearrangement plays a key role in cancer initiation. The discovery of novel gene fusions may be of significant importance in cancer diagnosis and treatment. Meanwhile, next generation sequencing technology provide a sensitive and efficient way to identify gene fusions in genomic levels. However, there are still many challenges and limitations remaining in the existing methods which only rely on unmapped reads or discordant alignment fragments. In this work we have developed GFusion, a novel method using RNA-Seq data, to identify the fusion genes. This pipeline performs multiple alignments and strict filtering algorithm to improve sensitivity and reduce the false positive rate. GFusion successfully detected 34 from 43 previously reported fusions in four cancer datasets. We also demonstrated the effectiveness of GFusion using 24 million 76 bp paired-end reads simulation data which contains 42 artificial fusion genes, among which GFusion successfully discovered 37 fusion genes. Compared with existing methods, GFusion presented higher sensitivity and lower false positive rate. The GFusion pipeline can be accessed freely for non-commercial purposes at: https://github.com/xiaofengsong/GFusion. Nature Publishing Group UK 2017-07-31 /pmc/articles/PMC5537242/ /pubmed/28761119 http://dx.doi.org/10.1038/s41598-017-07070-6 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Zhao, Jian Chen, Qi Wu, Jing Han, Ping Song, Xiaofeng GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data |
title | GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data |
title_full | GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data |
title_fullStr | GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data |
title_full_unstemmed | GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data |
title_short | GFusion: an Effective Algorithm to Identify Fusion Genes from Cancer RNA-Seq Data |
title_sort | gfusion: an effective algorithm to identify fusion genes from cancer rna-seq data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5537242/ https://www.ncbi.nlm.nih.gov/pubmed/28761119 http://dx.doi.org/10.1038/s41598-017-07070-6 |
work_keys_str_mv | AT zhaojian gfusionaneffectivealgorithmtoidentifyfusiongenesfromcancerrnaseqdata AT chenqi gfusionaneffectivealgorithmtoidentifyfusiongenesfromcancerrnaseqdata AT wujing gfusionaneffectivealgorithmtoidentifyfusiongenesfromcancerrnaseqdata AT hanping gfusionaneffectivealgorithmtoidentifyfusiongenesfromcancerrnaseqdata AT songxiaofeng gfusionaneffectivealgorithmtoidentifyfusiongenesfromcancerrnaseqdata |