Cargando…

A cell-level discriminative neural network model for diagnosis of blood cancers

MOTIVATION: Precise identification of cancer cells in patient samples is essential for accurate diagnosis and clinical monitoring but has been a significant challenge in machine learning approaches for cancer precision medicine. In most scenarios, training data are only available with disease annota...

Descripción completa

Detalles Bibliográficos
Autores principales: Robles, Edgar E, Jin, Ye, Smyth, Padhraic, Scheuermann, Richard H, Bui, Jack D, Wang, Huan-You, Oak, Jean, Qian, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10563151/
https://www.ncbi.nlm.nih.gov/pubmed/37756695
http://dx.doi.org/10.1093/bioinformatics/btad585
_version_ 1785118277794529280
author Robles, Edgar E
Jin, Ye
Smyth, Padhraic
Scheuermann, Richard H
Bui, Jack D
Wang, Huan-You
Oak, Jean
Qian, Yu
author_facet Robles, Edgar E
Jin, Ye
Smyth, Padhraic
Scheuermann, Richard H
Bui, Jack D
Wang, Huan-You
Oak, Jean
Qian, Yu
author_sort Robles, Edgar E
collection PubMed
description MOTIVATION: Precise identification of cancer cells in patient samples is essential for accurate diagnosis and clinical monitoring but has been a significant challenge in machine learning approaches for cancer precision medicine. In most scenarios, training data are only available with disease annotation at the subject or sample level. Traditional approaches separate the classification process into multiple steps that are optimized independently. Recent methods either focus on predicting sample-level diagnosis without identifying individual pathologic cells or are less effective for identifying heterogeneous cancer cell phenotypes. RESULTS: We developed a generalized end-to-end differentiable model, the Cell Scoring Neural Network (CSNN), which takes sample-level training data and predicts the diagnosis of the testing samples and the identity of the diagnostic cells in the sample, simultaneously. The cell-level density differences between samples are linked to the sample diagnosis, which allows the probabilities of individual cells being diagnostic to be calculated using backpropagation. We applied CSNN to two independent clinical flow cytometry datasets for leukemia diagnosis. In both qualitative and quantitative assessments, CSNN outperformed preexisting neural network modeling approaches for both cancer diagnosis and cell-level classification. Post hoc decision trees and 2D dot plots were generated for interpretation of the identified cancer cells, showing that the identified cell phenotypes match the cancer endotypes observed clinically in patient cohorts. Independent data clustering analysis confirmed the identified cancer cell populations. AVAILABILITY AND IMPLEMENTATION: The source code of CSNN and datasets used in the experiments are publicly available on GitHub (http://github.com/erobl/csnn). Raw FCS files can be downloaded from FlowRepository (ID: FR-FCM-Z6YK).
format Online
Article
Text
id pubmed-10563151
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105631512023-10-11 A cell-level discriminative neural network model for diagnosis of blood cancers Robles, Edgar E Jin, Ye Smyth, Padhraic Scheuermann, Richard H Bui, Jack D Wang, Huan-You Oak, Jean Qian, Yu Bioinformatics Original Paper MOTIVATION: Precise identification of cancer cells in patient samples is essential for accurate diagnosis and clinical monitoring but has been a significant challenge in machine learning approaches for cancer precision medicine. In most scenarios, training data are only available with disease annotation at the subject or sample level. Traditional approaches separate the classification process into multiple steps that are optimized independently. Recent methods either focus on predicting sample-level diagnosis without identifying individual pathologic cells or are less effective for identifying heterogeneous cancer cell phenotypes. RESULTS: We developed a generalized end-to-end differentiable model, the Cell Scoring Neural Network (CSNN), which takes sample-level training data and predicts the diagnosis of the testing samples and the identity of the diagnostic cells in the sample, simultaneously. The cell-level density differences between samples are linked to the sample diagnosis, which allows the probabilities of individual cells being diagnostic to be calculated using backpropagation. We applied CSNN to two independent clinical flow cytometry datasets for leukemia diagnosis. In both qualitative and quantitative assessments, CSNN outperformed preexisting neural network modeling approaches for both cancer diagnosis and cell-level classification. Post hoc decision trees and 2D dot plots were generated for interpretation of the identified cancer cells, showing that the identified cell phenotypes match the cancer endotypes observed clinically in patient cohorts. Independent data clustering analysis confirmed the identified cancer cell populations. AVAILABILITY AND IMPLEMENTATION: The source code of CSNN and datasets used in the experiments are publicly available on GitHub (http://github.com/erobl/csnn). Raw FCS files can be downloaded from FlowRepository (ID: FR-FCM-Z6YK). Oxford University Press 2023-09-26 /pmc/articles/PMC10563151/ /pubmed/37756695 http://dx.doi.org/10.1093/bioinformatics/btad585 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Robles, Edgar E
Jin, Ye
Smyth, Padhraic
Scheuermann, Richard H
Bui, Jack D
Wang, Huan-You
Oak, Jean
Qian, Yu
A cell-level discriminative neural network model for diagnosis of blood cancers
title A cell-level discriminative neural network model for diagnosis of blood cancers
title_full A cell-level discriminative neural network model for diagnosis of blood cancers
title_fullStr A cell-level discriminative neural network model for diagnosis of blood cancers
title_full_unstemmed A cell-level discriminative neural network model for diagnosis of blood cancers
title_short A cell-level discriminative neural network model for diagnosis of blood cancers
title_sort cell-level discriminative neural network model for diagnosis of blood cancers
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10563151/
https://www.ncbi.nlm.nih.gov/pubmed/37756695
http://dx.doi.org/10.1093/bioinformatics/btad585
work_keys_str_mv AT roblesedgare acellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT jinye acellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT smythpadhraic acellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT scheuermannrichardh acellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT buijackd acellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT wanghuanyou acellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT oakjean acellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT qianyu acellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT roblesedgare cellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT jinye cellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT smythpadhraic cellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT scheuermannrichardh cellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT buijackd cellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT wanghuanyou cellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT oakjean cellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers
AT qianyu cellleveldiscriminativeneuralnetworkmodelfordiagnosisofbloodcancers