Cargando…

Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data

Background: Fusion transcripts are formed by either fusion genes (DNA level) or trans-splicing events (RNA level). They have been recognized as a promising tool for diagnosing, subtyping and treating cancers. RNA-seq has become a precise and efficient standard for genome-wide screening of such aberr...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Silvia, Tsai, Wei-Hsiang, Ding, Ying, Chen, Rui, Fang, Zhou, Huo, Zhiguang, Kim, SungHwan, Ma, Tianzhou, Chang, Ting-Yu, Priedigkeit, Nolan Michael, Lee, Adrian V., Luo, Jianhua, Wang, Hsei-Wei, Chung, I-Fang, Tseng, George C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4797269/
https://www.ncbi.nlm.nih.gov/pubmed/26582927
http://dx.doi.org/10.1093/nar/gkv1234
_version_ 1782421923920084992
author Liu, Silvia
Tsai, Wei-Hsiang
Ding, Ying
Chen, Rui
Fang, Zhou
Huo, Zhiguang
Kim, SungHwan
Ma, Tianzhou
Chang, Ting-Yu
Priedigkeit, Nolan Michael
Lee, Adrian V.
Luo, Jianhua
Wang, Hsei-Wei
Chung, I-Fang
Tseng, George C.
author_facet Liu, Silvia
Tsai, Wei-Hsiang
Ding, Ying
Chen, Rui
Fang, Zhou
Huo, Zhiguang
Kim, SungHwan
Ma, Tianzhou
Chang, Ting-Yu
Priedigkeit, Nolan Michael
Lee, Adrian V.
Luo, Jianhua
Wang, Hsei-Wei
Chung, I-Fang
Tseng, George C.
author_sort Liu, Silvia
collection PubMed
description Background: Fusion transcripts are formed by either fusion genes (DNA level) or trans-splicing events (RNA level). They have been recognized as a promising tool for diagnosing, subtyping and treating cancers. RNA-seq has become a precise and efficient standard for genome-wide screening of such aberration events. Many fusion transcript detection algorithms have been developed for paired-end RNA-seq data but their performance has not been comprehensively evaluated to guide practitioners. In this paper, we evaluated 15 popular algorithms by their precision and recall trade-off, accuracy of supporting reads and computational cost. We further combine top-performing methods for improved ensemble detection. Results: Fifteen fusion transcript detection tools were compared using three synthetic data sets under different coverage, read length, insert size and background noise, and three real data sets with selected experimental validations. No single method dominantly performed the best but SOAPfuse generally performed well, followed by FusionCatcher and JAFFA. We further demonstrated the potential of a meta-caller algorithm by combining top performing methods to re-prioritize candidate fusion transcripts with high confidence that can be followed by experimental validation. Conclusion: Our result provides insightful recommendations when applying individual tool or combining top performers to identify fusion transcript candidates.
format Online
Article
Text
id pubmed-4797269
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-47972692016-03-21 Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data Liu, Silvia Tsai, Wei-Hsiang Ding, Ying Chen, Rui Fang, Zhou Huo, Zhiguang Kim, SungHwan Ma, Tianzhou Chang, Ting-Yu Priedigkeit, Nolan Michael Lee, Adrian V. Luo, Jianhua Wang, Hsei-Wei Chung, I-Fang Tseng, George C. Nucleic Acids Res Methods Online Background: Fusion transcripts are formed by either fusion genes (DNA level) or trans-splicing events (RNA level). They have been recognized as a promising tool for diagnosing, subtyping and treating cancers. RNA-seq has become a precise and efficient standard for genome-wide screening of such aberration events. Many fusion transcript detection algorithms have been developed for paired-end RNA-seq data but their performance has not been comprehensively evaluated to guide practitioners. In this paper, we evaluated 15 popular algorithms by their precision and recall trade-off, accuracy of supporting reads and computational cost. We further combine top-performing methods for improved ensemble detection. Results: Fifteen fusion transcript detection tools were compared using three synthetic data sets under different coverage, read length, insert size and background noise, and three real data sets with selected experimental validations. No single method dominantly performed the best but SOAPfuse generally performed well, followed by FusionCatcher and JAFFA. We further demonstrated the potential of a meta-caller algorithm by combining top performing methods to re-prioritize candidate fusion transcripts with high confidence that can be followed by experimental validation. Conclusion: Our result provides insightful recommendations when applying individual tool or combining top performers to identify fusion transcript candidates. Oxford University Press 2016-03-18 2015-11-17 /pmc/articles/PMC4797269/ /pubmed/26582927 http://dx.doi.org/10.1093/nar/gkv1234 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Liu, Silvia
Tsai, Wei-Hsiang
Ding, Ying
Chen, Rui
Fang, Zhou
Huo, Zhiguang
Kim, SungHwan
Ma, Tianzhou
Chang, Ting-Yu
Priedigkeit, Nolan Michael
Lee, Adrian V.
Luo, Jianhua
Wang, Hsei-Wei
Chung, I-Fang
Tseng, George C.
Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data
title Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data
title_full Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data
title_fullStr Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data
title_full_unstemmed Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data
title_short Comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end RNA-seq data
title_sort comprehensive evaluation of fusion transcript detection algorithms and a meta-caller to combine top performing methods in paired-end rna-seq data
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4797269/
https://www.ncbi.nlm.nih.gov/pubmed/26582927
http://dx.doi.org/10.1093/nar/gkv1234
work_keys_str_mv AT liusilvia comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT tsaiweihsiang comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT dingying comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT chenrui comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT fangzhou comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT huozhiguang comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT kimsunghwan comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT matianzhou comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT changtingyu comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT priedigkeitnolanmichael comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT leeadrianv comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT luojianhua comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT wanghseiwei comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT chungifang comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata
AT tsenggeorgec comprehensiveevaluationoffusiontranscriptdetectionalgorithmsandametacallertocombinetopperformingmethodsinpairedendrnaseqdata