Cargando…

Reproducible Bioinformatics Analysis Workflows for Detecting IGH Gene Fusions in B-Cell Acute Lymphoblastic Leukaemia Patients

SIMPLE SUMMARY: B-cell acute lymphoblastic leukaemia (B-ALL) is a haematological malignancy driven by diverse genomic alterations, the most common being gene fusions, which can be detected via transcriptomic analysis. However, detecting gene fusions involving the Immunoglobulin Heavy Chain (IGH) reg...

Descripción completa

Detalles Bibliográficos
Autores principales: Thomson, Ashlee J., Rehn, Jacqueline A., Heatley, Susan L., Eadie, Laura N., Page, Elyse C., Schutz, Caitlin, McClure, Barbara J., Sutton, Rosemary, Dalla-Pozza, Luciano, Moore, Andrew S., Greenwood, Matthew, Kotecha, Rishi S., Fong, Chun Y., Yong, Agnes S. M., Yeung, David T., Breen, James, White, Deborah L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10571859/
https://www.ncbi.nlm.nih.gov/pubmed/37835427
http://dx.doi.org/10.3390/cancers15194731
_version_ 1785120100268900352
author Thomson, Ashlee J.
Rehn, Jacqueline A.
Heatley, Susan L.
Eadie, Laura N.
Page, Elyse C.
Schutz, Caitlin
McClure, Barbara J.
Sutton, Rosemary
Dalla-Pozza, Luciano
Moore, Andrew S.
Greenwood, Matthew
Kotecha, Rishi S.
Fong, Chun Y.
Yong, Agnes S. M.
Yeung, David T.
Breen, James
White, Deborah L.
author_facet Thomson, Ashlee J.
Rehn, Jacqueline A.
Heatley, Susan L.
Eadie, Laura N.
Page, Elyse C.
Schutz, Caitlin
McClure, Barbara J.
Sutton, Rosemary
Dalla-Pozza, Luciano
Moore, Andrew S.
Greenwood, Matthew
Kotecha, Rishi S.
Fong, Chun Y.
Yong, Agnes S. M.
Yeung, David T.
Breen, James
White, Deborah L.
author_sort Thomson, Ashlee J.
collection PubMed
description SIMPLE SUMMARY: B-cell acute lymphoblastic leukaemia (B-ALL) is a haematological malignancy driven by diverse genomic alterations, the most common being gene fusions, which can be detected via transcriptomic analysis. However, detecting gene fusions involving the Immunoglobulin Heavy Chain (IGH) region can be challenging due to its hyper variability. Our aim was to develop a workflow containing the algorithms FusionCatcher, Arriba, and STAR-Fusion, to achieve best-case sensitivity for IGH gene fusion detection. We analysed 35 patient samples harbouring IGH gene fusions and assessed the detection rates for each caller, before optimising the parameters to enhance sensitivity for IGH fusions. FusionCatcher and Arriba outperformed STAR-Fusion; however, by adjusting specific filtering parameters, we were able to improve STAR-Fusion’s performance. This analysis highlights the importance of filtering optimization for IGH gene fusion events, offering alternative workflows for difficult-to-detect high-risk B-ALL alterations. ABSTRACT: B-cell acute lymphoblastic leukaemia (B-ALL) is characterised by diverse genomic alterations, the most frequent being gene fusions detected via transcriptomic analysis (mRNA-seq). Due to its hypervariable nature, gene fusions involving the Immunoglobulin Heavy Chain (IGH) locus can be difficult to detect with standard gene fusion calling algorithms and significant computational resources and analysis times are required. We aimed to optimize a gene fusion calling workflow to achieve best-case sensitivity for IGH gene fusion detection. Using Nextflow, we developed a simplified workflow containing the algorithms FusionCatcher, Arriba, and STAR-Fusion. We analysed samples from 35 patients harbouring IGH fusions (IGH::CRLF2 n = 17, IGH::DUX4 n = 15, IGH::EPOR n = 3) and assessed the detection rates for each caller, before optimizing the parameters to enhance sensitivity for IGH fusions. Initial results showed that FusionCatcher and Arriba outperformed STAR-Fusion (85–89% vs. 29% of IGH fusions reported). We found that extensive filtering in STAR-Fusion hindered IGH reporting. By adjusting specific filtering steps (e.g., read support, fusion fragments per million total reads), we achieved a 94% reporting rate for IGH fusions with STAR-Fusion. This analysis highlights the importance of filtering optimization for IGH gene fusion events, offering alternative workflows for difficult-to-detect high-risk B-ALL subtypes.
format Online
Article
Text
id pubmed-10571859
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-105718592023-10-14 Reproducible Bioinformatics Analysis Workflows for Detecting IGH Gene Fusions in B-Cell Acute Lymphoblastic Leukaemia Patients Thomson, Ashlee J. Rehn, Jacqueline A. Heatley, Susan L. Eadie, Laura N. Page, Elyse C. Schutz, Caitlin McClure, Barbara J. Sutton, Rosemary Dalla-Pozza, Luciano Moore, Andrew S. Greenwood, Matthew Kotecha, Rishi S. Fong, Chun Y. Yong, Agnes S. M. Yeung, David T. Breen, James White, Deborah L. Cancers (Basel) Article SIMPLE SUMMARY: B-cell acute lymphoblastic leukaemia (B-ALL) is a haematological malignancy driven by diverse genomic alterations, the most common being gene fusions, which can be detected via transcriptomic analysis. However, detecting gene fusions involving the Immunoglobulin Heavy Chain (IGH) region can be challenging due to its hyper variability. Our aim was to develop a workflow containing the algorithms FusionCatcher, Arriba, and STAR-Fusion, to achieve best-case sensitivity for IGH gene fusion detection. We analysed 35 patient samples harbouring IGH gene fusions and assessed the detection rates for each caller, before optimising the parameters to enhance sensitivity for IGH fusions. FusionCatcher and Arriba outperformed STAR-Fusion; however, by adjusting specific filtering parameters, we were able to improve STAR-Fusion’s performance. This analysis highlights the importance of filtering optimization for IGH gene fusion events, offering alternative workflows for difficult-to-detect high-risk B-ALL alterations. ABSTRACT: B-cell acute lymphoblastic leukaemia (B-ALL) is characterised by diverse genomic alterations, the most frequent being gene fusions detected via transcriptomic analysis (mRNA-seq). Due to its hypervariable nature, gene fusions involving the Immunoglobulin Heavy Chain (IGH) locus can be difficult to detect with standard gene fusion calling algorithms and significant computational resources and analysis times are required. We aimed to optimize a gene fusion calling workflow to achieve best-case sensitivity for IGH gene fusion detection. Using Nextflow, we developed a simplified workflow containing the algorithms FusionCatcher, Arriba, and STAR-Fusion. We analysed samples from 35 patients harbouring IGH fusions (IGH::CRLF2 n = 17, IGH::DUX4 n = 15, IGH::EPOR n = 3) and assessed the detection rates for each caller, before optimizing the parameters to enhance sensitivity for IGH fusions. Initial results showed that FusionCatcher and Arriba outperformed STAR-Fusion (85–89% vs. 29% of IGH fusions reported). We found that extensive filtering in STAR-Fusion hindered IGH reporting. By adjusting specific filtering steps (e.g., read support, fusion fragments per million total reads), we achieved a 94% reporting rate for IGH fusions with STAR-Fusion. This analysis highlights the importance of filtering optimization for IGH gene fusion events, offering alternative workflows for difficult-to-detect high-risk B-ALL subtypes. MDPI 2023-09-26 /pmc/articles/PMC10571859/ /pubmed/37835427 http://dx.doi.org/10.3390/cancers15194731 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Thomson, Ashlee J.
Rehn, Jacqueline A.
Heatley, Susan L.
Eadie, Laura N.
Page, Elyse C.
Schutz, Caitlin
McClure, Barbara J.
Sutton, Rosemary
Dalla-Pozza, Luciano
Moore, Andrew S.
Greenwood, Matthew
Kotecha, Rishi S.
Fong, Chun Y.
Yong, Agnes S. M.
Yeung, David T.
Breen, James
White, Deborah L.
Reproducible Bioinformatics Analysis Workflows for Detecting IGH Gene Fusions in B-Cell Acute Lymphoblastic Leukaemia Patients
title Reproducible Bioinformatics Analysis Workflows for Detecting IGH Gene Fusions in B-Cell Acute Lymphoblastic Leukaemia Patients
title_full Reproducible Bioinformatics Analysis Workflows for Detecting IGH Gene Fusions in B-Cell Acute Lymphoblastic Leukaemia Patients
title_fullStr Reproducible Bioinformatics Analysis Workflows for Detecting IGH Gene Fusions in B-Cell Acute Lymphoblastic Leukaemia Patients
title_full_unstemmed Reproducible Bioinformatics Analysis Workflows for Detecting IGH Gene Fusions in B-Cell Acute Lymphoblastic Leukaemia Patients
title_short Reproducible Bioinformatics Analysis Workflows for Detecting IGH Gene Fusions in B-Cell Acute Lymphoblastic Leukaemia Patients
title_sort reproducible bioinformatics analysis workflows for detecting igh gene fusions in b-cell acute lymphoblastic leukaemia patients
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10571859/
https://www.ncbi.nlm.nih.gov/pubmed/37835427
http://dx.doi.org/10.3390/cancers15194731
work_keys_str_mv AT thomsonashleej reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT rehnjacquelinea reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT heatleysusanl reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT eadielauran reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT pageelysec reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT schutzcaitlin reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT mcclurebarbaraj reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT suttonrosemary reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT dallapozzaluciano reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT mooreandrews reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT greenwoodmatthew reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT kotecharishis reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT fongchuny reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT yongagnessm reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT yeungdavidt reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT breenjames reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients
AT whitedeborahl reproduciblebioinformaticsanalysisworkflowsfordetectingighgenefusionsinbcellacutelymphoblasticleukaemiapatients