Cargando…

Multi-Cohort Transcriptomic Subtyping of B-Cell Acute Lymphoblastic Leukemia

RNA sequencing provides a snapshot of the functional consequences of genomic lesions that drive acute lymphoblastic leukemia (ALL). The aims of this study were to elucidate diagnostic associations (via machine learning) between mRNA-seq profiles, independently verify ALL lesions and develop easy-to-...

Descripción completa

Detalles Bibliográficos
Autores principales: Mäkinen, Ville-Petteri, Rehn, Jacqueline, Breen, James, Yeung, David, White, Deborah L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9099612/
https://www.ncbi.nlm.nih.gov/pubmed/35562965
http://dx.doi.org/10.3390/ijms23094574
_version_ 1784706649083084800
author Mäkinen, Ville-Petteri
Rehn, Jacqueline
Breen, James
Yeung, David
White, Deborah L.
author_facet Mäkinen, Ville-Petteri
Rehn, Jacqueline
Breen, James
Yeung, David
White, Deborah L.
author_sort Mäkinen, Ville-Petteri
collection PubMed
description RNA sequencing provides a snapshot of the functional consequences of genomic lesions that drive acute lymphoblastic leukemia (ALL). The aims of this study were to elucidate diagnostic associations (via machine learning) between mRNA-seq profiles, independently verify ALL lesions and develop easy-to-interpret transcriptome-wide biomarkers for ALL subtyping in the clinical setting. A training dataset of 1279 ALL patients from six North American cohorts was used for developing machine learning models. Results were validated in 767 patients from Australia with a quality control dataset across 31 tissues from 1160 non-ALL donors. A novel batch correction method was introduced and applied to adjust for cohort differences. Out of 18,503 genes with usable expression, 11,830 (64%) were confounded by cohort effects and excluded. Six ALL subtypes (ETV6::RUNX1, KMT2A, DUX4, PAX5 P80R, TCF3::PBX1, ZNF384) that covered 32% of patients were robustly detected by mRNA-seq (positive predictive value ≥ 87%). Five other frequent subtypes (CRLF2, hypodiploid, hyperdiploid, PAX5 alterations and Ph-positive) were distinguishable in 40% of patients at lower accuracy (52% ≤ positive predictive value ≤ 73%). Based on these findings, we introduce the Allspice R package to predict ALL subtypes and driver genes from unadjusted mRNA-seq read counts as encountered in real-world settings. Two examples of Allspice applied to previously unseen ALL patient samples with atypical lesions are included.
format Online
Article
Text
id pubmed-9099612
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-90996122022-05-14 Multi-Cohort Transcriptomic Subtyping of B-Cell Acute Lymphoblastic Leukemia Mäkinen, Ville-Petteri Rehn, Jacqueline Breen, James Yeung, David White, Deborah L. Int J Mol Sci Article RNA sequencing provides a snapshot of the functional consequences of genomic lesions that drive acute lymphoblastic leukemia (ALL). The aims of this study were to elucidate diagnostic associations (via machine learning) between mRNA-seq profiles, independently verify ALL lesions and develop easy-to-interpret transcriptome-wide biomarkers for ALL subtyping in the clinical setting. A training dataset of 1279 ALL patients from six North American cohorts was used for developing machine learning models. Results were validated in 767 patients from Australia with a quality control dataset across 31 tissues from 1160 non-ALL donors. A novel batch correction method was introduced and applied to adjust for cohort differences. Out of 18,503 genes with usable expression, 11,830 (64%) were confounded by cohort effects and excluded. Six ALL subtypes (ETV6::RUNX1, KMT2A, DUX4, PAX5 P80R, TCF3::PBX1, ZNF384) that covered 32% of patients were robustly detected by mRNA-seq (positive predictive value ≥ 87%). Five other frequent subtypes (CRLF2, hypodiploid, hyperdiploid, PAX5 alterations and Ph-positive) were distinguishable in 40% of patients at lower accuracy (52% ≤ positive predictive value ≤ 73%). Based on these findings, we introduce the Allspice R package to predict ALL subtypes and driver genes from unadjusted mRNA-seq read counts as encountered in real-world settings. Two examples of Allspice applied to previously unseen ALL patient samples with atypical lesions are included. MDPI 2022-04-20 /pmc/articles/PMC9099612/ /pubmed/35562965 http://dx.doi.org/10.3390/ijms23094574 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mäkinen, Ville-Petteri
Rehn, Jacqueline
Breen, James
Yeung, David
White, Deborah L.
Multi-Cohort Transcriptomic Subtyping of B-Cell Acute Lymphoblastic Leukemia
title Multi-Cohort Transcriptomic Subtyping of B-Cell Acute Lymphoblastic Leukemia
title_full Multi-Cohort Transcriptomic Subtyping of B-Cell Acute Lymphoblastic Leukemia
title_fullStr Multi-Cohort Transcriptomic Subtyping of B-Cell Acute Lymphoblastic Leukemia
title_full_unstemmed Multi-Cohort Transcriptomic Subtyping of B-Cell Acute Lymphoblastic Leukemia
title_short Multi-Cohort Transcriptomic Subtyping of B-Cell Acute Lymphoblastic Leukemia
title_sort multi-cohort transcriptomic subtyping of b-cell acute lymphoblastic leukemia
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9099612/
https://www.ncbi.nlm.nih.gov/pubmed/35562965
http://dx.doi.org/10.3390/ijms23094574
work_keys_str_mv AT makinenvillepetteri multicohorttranscriptomicsubtypingofbcellacutelymphoblasticleukemia
AT rehnjacqueline multicohorttranscriptomicsubtypingofbcellacutelymphoblasticleukemia
AT breenjames multicohorttranscriptomicsubtypingofbcellacutelymphoblasticleukemia
AT yeungdavid multicohorttranscriptomicsubtypingofbcellacutelymphoblasticleukemia
AT whitedeborahl multicohorttranscriptomicsubtypingofbcellacutelymphoblasticleukemia