Cargando…

Statistical algorithms improve accuracy of gene fusion detection

Gene fusions are known to play critical roles in tumor pathogenesis. Yet, sensitive and specific algorithms to detect gene fusions in cancer do not currently exist. In this paper, we present a new statistical algorithm, MACHETE (Mismatched Alignment CHimEra Tracking Engine), which achieves highly se...

Descripción completa

Detalles Bibliográficos
Autores principales: Hsieh, Gillian, Bierman, Rob, Szabo, Linda, Lee, Alex Gia, Freeman, Donald E., Watson, Nathaniel, Sweet-Cordero, E. Alejandro, Salzman, Julia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737606/
https://www.ncbi.nlm.nih.gov/pubmed/28541529
http://dx.doi.org/10.1093/nar/gkx453
_version_ 1783287545592807424
author Hsieh, Gillian
Bierman, Rob
Szabo, Linda
Lee, Alex Gia
Freeman, Donald E.
Watson, Nathaniel
Sweet-Cordero, E. Alejandro
Salzman, Julia
author_facet Hsieh, Gillian
Bierman, Rob
Szabo, Linda
Lee, Alex Gia
Freeman, Donald E.
Watson, Nathaniel
Sweet-Cordero, E. Alejandro
Salzman, Julia
author_sort Hsieh, Gillian
collection PubMed
description Gene fusions are known to play critical roles in tumor pathogenesis. Yet, sensitive and specific algorithms to detect gene fusions in cancer do not currently exist. In this paper, we present a new statistical algorithm, MACHETE (Mismatched Alignment CHimEra Tracking Engine), which achieves highly sensitive and specific detection of gene fusions from RNA-Seq data, including the highest Positive Predictive Value (PPV) compared to the current state-of-the-art, as assessed in simulated data. We show that the best performing published algorithms either find large numbers of fusions in negative control data or suffer from low sensitivity detecting known driving fusions in gold standard settings, such as EWSR1-FLI1. As proof of principle that MACHETE discovers novel gene fusions with high accuracy in vivo, we mined public data to discover and subsequently PCR validate novel gene fusions missed by other algorithms in the ovarian cancer cell line OVCAR3. These results highlight the gains in accuracy achieved by introducing statistical models into fusion detection, and pave the way for unbiased discovery of potentially driving and druggable gene fusions in primary tumors.
format Online
Article
Text
id pubmed-5737606
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57376062018-01-04 Statistical algorithms improve accuracy of gene fusion detection Hsieh, Gillian Bierman, Rob Szabo, Linda Lee, Alex Gia Freeman, Donald E. Watson, Nathaniel Sweet-Cordero, E. Alejandro Salzman, Julia Nucleic Acids Res Methods Online Gene fusions are known to play critical roles in tumor pathogenesis. Yet, sensitive and specific algorithms to detect gene fusions in cancer do not currently exist. In this paper, we present a new statistical algorithm, MACHETE (Mismatched Alignment CHimEra Tracking Engine), which achieves highly sensitive and specific detection of gene fusions from RNA-Seq data, including the highest Positive Predictive Value (PPV) compared to the current state-of-the-art, as assessed in simulated data. We show that the best performing published algorithms either find large numbers of fusions in negative control data or suffer from low sensitivity detecting known driving fusions in gold standard settings, such as EWSR1-FLI1. As proof of principle that MACHETE discovers novel gene fusions with high accuracy in vivo, we mined public data to discover and subsequently PCR validate novel gene fusions missed by other algorithms in the ovarian cancer cell line OVCAR3. These results highlight the gains in accuracy achieved by introducing statistical models into fusion detection, and pave the way for unbiased discovery of potentially driving and druggable gene fusions in primary tumors. Oxford University Press 2017-07-27 2017-05-24 /pmc/articles/PMC5737606/ /pubmed/28541529 http://dx.doi.org/10.1093/nar/gkx453 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Hsieh, Gillian
Bierman, Rob
Szabo, Linda
Lee, Alex Gia
Freeman, Donald E.
Watson, Nathaniel
Sweet-Cordero, E. Alejandro
Salzman, Julia
Statistical algorithms improve accuracy of gene fusion detection
title Statistical algorithms improve accuracy of gene fusion detection
title_full Statistical algorithms improve accuracy of gene fusion detection
title_fullStr Statistical algorithms improve accuracy of gene fusion detection
title_full_unstemmed Statistical algorithms improve accuracy of gene fusion detection
title_short Statistical algorithms improve accuracy of gene fusion detection
title_sort statistical algorithms improve accuracy of gene fusion detection
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737606/
https://www.ncbi.nlm.nih.gov/pubmed/28541529
http://dx.doi.org/10.1093/nar/gkx453
work_keys_str_mv AT hsiehgillian statisticalalgorithmsimproveaccuracyofgenefusiondetection
AT biermanrob statisticalalgorithmsimproveaccuracyofgenefusiondetection
AT szabolinda statisticalalgorithmsimproveaccuracyofgenefusiondetection
AT leealexgia statisticalalgorithmsimproveaccuracyofgenefusiondetection
AT freemandonalde statisticalalgorithmsimproveaccuracyofgenefusiondetection
AT watsonnathaniel statisticalalgorithmsimproveaccuracyofgenefusiondetection
AT sweetcorderoealejandro statisticalalgorithmsimproveaccuracyofgenefusiondetection
AT salzmanjulia statisticalalgorithmsimproveaccuracyofgenefusiondetection