Cargando…

deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data

Gene fusions created by somatic genomic rearrangements are known to play an important role in the onset and development of some cancers, such as lymphomas and sarcomas. RNA-Seq (whole transcriptome shotgun sequencing) is proving to be a useful tool for the discovery of novel gene fusions in cancer t...

Descripción completa

Detalles Bibliográficos
Autores principales: McPherson, Andrew, Hormozdiari, Fereydoun, Zayed, Abdalnasser, Giuliany, Ryan, Ha, Gavin, Sun, Mark G. F., Griffith, Malachi, Heravi Moussavi, Alireza, Senz, Janine, Melnyk, Nataliya, Pacheco, Marina, Marra, Marco A., Hirst, Martin, Nielsen, Torsten O., Sahinalp, S. Cenk, Huntsman, David, Shah, Sohrab P.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3098195/
https://www.ncbi.nlm.nih.gov/pubmed/21625565
http://dx.doi.org/10.1371/journal.pcbi.1001138
_version_ 1782203933883629568
author McPherson, Andrew
Hormozdiari, Fereydoun
Zayed, Abdalnasser
Giuliany, Ryan
Ha, Gavin
Sun, Mark G. F.
Griffith, Malachi
Heravi Moussavi, Alireza
Senz, Janine
Melnyk, Nataliya
Pacheco, Marina
Marra, Marco A.
Hirst, Martin
Nielsen, Torsten O.
Sahinalp, S. Cenk
Huntsman, David
Shah, Sohrab P.
author_facet McPherson, Andrew
Hormozdiari, Fereydoun
Zayed, Abdalnasser
Giuliany, Ryan
Ha, Gavin
Sun, Mark G. F.
Griffith, Malachi
Heravi Moussavi, Alireza
Senz, Janine
Melnyk, Nataliya
Pacheco, Marina
Marra, Marco A.
Hirst, Martin
Nielsen, Torsten O.
Sahinalp, S. Cenk
Huntsman, David
Shah, Sohrab P.
author_sort McPherson, Andrew
collection PubMed
description Gene fusions created by somatic genomic rearrangements are known to play an important role in the onset and development of some cancers, such as lymphomas and sarcomas. RNA-Seq (whole transcriptome shotgun sequencing) is proving to be a useful tool for the discovery of novel gene fusions in cancer transcriptomes. However, algorithmic methods for the discovery of gene fusions using RNA-Seq data remain underdeveloped. We have developed deFuse, a novel computational method for fusion discovery in tumor RNA-Seq data. Unlike existing methods that use only unique best-hit alignments and consider only fusion boundaries at the ends of known exons, deFuse considers all alignments and all possible locations for fusion boundaries. As a result, deFuse is able to identify fusion sequences with demonstrably better sensitivity than previous approaches. To increase the specificity of our approach, we curated a list of 60 true positive and 61 true negative fusion sequences (as confirmed by RT-PCR), and have trained an adaboost classifier on 11 novel features of the sequence data. The resulting classifier has an estimated value of 0.91 for the area under the ROC curve. We have used deFuse to discover gene fusions in 40 ovarian tumor samples, one ovarian cancer cell line, and three sarcoma samples. We report herein the first gene fusions discovered in ovarian cancer. We conclude that gene fusions are not infrequent events in ovarian cancer and that these events have the potential to substantially alter the expression patterns of the genes involved; gene fusions should therefore be considered in efforts to comprehensively characterize the mutational profiles of ovarian cancer transcriptomes.
format Text
id pubmed-3098195
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30981952011-05-27 deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data McPherson, Andrew Hormozdiari, Fereydoun Zayed, Abdalnasser Giuliany, Ryan Ha, Gavin Sun, Mark G. F. Griffith, Malachi Heravi Moussavi, Alireza Senz, Janine Melnyk, Nataliya Pacheco, Marina Marra, Marco A. Hirst, Martin Nielsen, Torsten O. Sahinalp, S. Cenk Huntsman, David Shah, Sohrab P. PLoS Comput Biol Research Article Gene fusions created by somatic genomic rearrangements are known to play an important role in the onset and development of some cancers, such as lymphomas and sarcomas. RNA-Seq (whole transcriptome shotgun sequencing) is proving to be a useful tool for the discovery of novel gene fusions in cancer transcriptomes. However, algorithmic methods for the discovery of gene fusions using RNA-Seq data remain underdeveloped. We have developed deFuse, a novel computational method for fusion discovery in tumor RNA-Seq data. Unlike existing methods that use only unique best-hit alignments and consider only fusion boundaries at the ends of known exons, deFuse considers all alignments and all possible locations for fusion boundaries. As a result, deFuse is able to identify fusion sequences with demonstrably better sensitivity than previous approaches. To increase the specificity of our approach, we curated a list of 60 true positive and 61 true negative fusion sequences (as confirmed by RT-PCR), and have trained an adaboost classifier on 11 novel features of the sequence data. The resulting classifier has an estimated value of 0.91 for the area under the ROC curve. We have used deFuse to discover gene fusions in 40 ovarian tumor samples, one ovarian cancer cell line, and three sarcoma samples. We report herein the first gene fusions discovered in ovarian cancer. We conclude that gene fusions are not infrequent events in ovarian cancer and that these events have the potential to substantially alter the expression patterns of the genes involved; gene fusions should therefore be considered in efforts to comprehensively characterize the mutational profiles of ovarian cancer transcriptomes. Public Library of Science 2011-05-19 /pmc/articles/PMC3098195/ /pubmed/21625565 http://dx.doi.org/10.1371/journal.pcbi.1001138 Text en McPherson et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
McPherson, Andrew
Hormozdiari, Fereydoun
Zayed, Abdalnasser
Giuliany, Ryan
Ha, Gavin
Sun, Mark G. F.
Griffith, Malachi
Heravi Moussavi, Alireza
Senz, Janine
Melnyk, Nataliya
Pacheco, Marina
Marra, Marco A.
Hirst, Martin
Nielsen, Torsten O.
Sahinalp, S. Cenk
Huntsman, David
Shah, Sohrab P.
deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data
title deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data
title_full deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data
title_fullStr deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data
title_full_unstemmed deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data
title_short deFuse: An Algorithm for Gene Fusion Discovery in Tumor RNA-Seq Data
title_sort defuse: an algorithm for gene fusion discovery in tumor rna-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3098195/
https://www.ncbi.nlm.nih.gov/pubmed/21625565
http://dx.doi.org/10.1371/journal.pcbi.1001138
work_keys_str_mv AT mcphersonandrew defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT hormozdiarifereydoun defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT zayedabdalnasser defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT giulianyryan defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT hagavin defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT sunmarkgf defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT griffithmalachi defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT heravimoussavialireza defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT senzjanine defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT melnyknataliya defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT pachecomarina defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT marramarcoa defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT hirstmartin defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT nielsentorsteno defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT sahinalpscenk defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT huntsmandavid defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT shahsohrabp defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata