Cargando…
Statistical algorithms improve accuracy of gene fusion detection
Gene fusions are known to play critical roles in tumor pathogenesis. Yet, sensitive and specific algorithms to detect gene fusions in cancer do not currently exist. In this paper, we present a new statistical algorithm, MACHETE (Mismatched Alignment CHimEra Tracking Engine), which achieves highly se...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737606/ https://www.ncbi.nlm.nih.gov/pubmed/28541529 http://dx.doi.org/10.1093/nar/gkx453 |
_version_ | 1783287545592807424 |
---|---|
author | Hsieh, Gillian Bierman, Rob Szabo, Linda Lee, Alex Gia Freeman, Donald E. Watson, Nathaniel Sweet-Cordero, E. Alejandro Salzman, Julia |
author_facet | Hsieh, Gillian Bierman, Rob Szabo, Linda Lee, Alex Gia Freeman, Donald E. Watson, Nathaniel Sweet-Cordero, E. Alejandro Salzman, Julia |
author_sort | Hsieh, Gillian |
collection | PubMed |
description | Gene fusions are known to play critical roles in tumor pathogenesis. Yet, sensitive and specific algorithms to detect gene fusions in cancer do not currently exist. In this paper, we present a new statistical algorithm, MACHETE (Mismatched Alignment CHimEra Tracking Engine), which achieves highly sensitive and specific detection of gene fusions from RNA-Seq data, including the highest Positive Predictive Value (PPV) compared to the current state-of-the-art, as assessed in simulated data. We show that the best performing published algorithms either find large numbers of fusions in negative control data or suffer from low sensitivity detecting known driving fusions in gold standard settings, such as EWSR1-FLI1. As proof of principle that MACHETE discovers novel gene fusions with high accuracy in vivo, we mined public data to discover and subsequently PCR validate novel gene fusions missed by other algorithms in the ovarian cancer cell line OVCAR3. These results highlight the gains in accuracy achieved by introducing statistical models into fusion detection, and pave the way for unbiased discovery of potentially driving and druggable gene fusions in primary tumors. |
format | Online Article Text |
id | pubmed-5737606 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-57376062018-01-04 Statistical algorithms improve accuracy of gene fusion detection Hsieh, Gillian Bierman, Rob Szabo, Linda Lee, Alex Gia Freeman, Donald E. Watson, Nathaniel Sweet-Cordero, E. Alejandro Salzman, Julia Nucleic Acids Res Methods Online Gene fusions are known to play critical roles in tumor pathogenesis. Yet, sensitive and specific algorithms to detect gene fusions in cancer do not currently exist. In this paper, we present a new statistical algorithm, MACHETE (Mismatched Alignment CHimEra Tracking Engine), which achieves highly sensitive and specific detection of gene fusions from RNA-Seq data, including the highest Positive Predictive Value (PPV) compared to the current state-of-the-art, as assessed in simulated data. We show that the best performing published algorithms either find large numbers of fusions in negative control data or suffer from low sensitivity detecting known driving fusions in gold standard settings, such as EWSR1-FLI1. As proof of principle that MACHETE discovers novel gene fusions with high accuracy in vivo, we mined public data to discover and subsequently PCR validate novel gene fusions missed by other algorithms in the ovarian cancer cell line OVCAR3. These results highlight the gains in accuracy achieved by introducing statistical models into fusion detection, and pave the way for unbiased discovery of potentially driving and druggable gene fusions in primary tumors. Oxford University Press 2017-07-27 2017-05-24 /pmc/articles/PMC5737606/ /pubmed/28541529 http://dx.doi.org/10.1093/nar/gkx453 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Hsieh, Gillian Bierman, Rob Szabo, Linda Lee, Alex Gia Freeman, Donald E. Watson, Nathaniel Sweet-Cordero, E. Alejandro Salzman, Julia Statistical algorithms improve accuracy of gene fusion detection |
title | Statistical algorithms improve accuracy of gene fusion detection |
title_full | Statistical algorithms improve accuracy of gene fusion detection |
title_fullStr | Statistical algorithms improve accuracy of gene fusion detection |
title_full_unstemmed | Statistical algorithms improve accuracy of gene fusion detection |
title_short | Statistical algorithms improve accuracy of gene fusion detection |
title_sort | statistical algorithms improve accuracy of gene fusion detection |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737606/ https://www.ncbi.nlm.nih.gov/pubmed/28541529 http://dx.doi.org/10.1093/nar/gkx453 |
work_keys_str_mv | AT hsiehgillian statisticalalgorithmsimproveaccuracyofgenefusiondetection AT biermanrob statisticalalgorithmsimproveaccuracyofgenefusiondetection AT szabolinda statisticalalgorithmsimproveaccuracyofgenefusiondetection AT leealexgia statisticalalgorithmsimproveaccuracyofgenefusiondetection AT freemandonalde statisticalalgorithmsimproveaccuracyofgenefusiondetection AT watsonnathaniel statisticalalgorithmsimproveaccuracyofgenefusiondetection AT sweetcorderoealejandro statisticalalgorithmsimproveaccuracyofgenefusiondetection AT salzmanjulia statisticalalgorithmsimproveaccuracyofgenefusiondetection |