Cargando…

Machine learning-based improvement of MDS-CBC score brings platelets into the limelight to optimize smear review in the hematology laboratory

BACKGROUND: Myelodysplastic syndromes (MDS) are clonal hematopoietic diseases of the elderly characterized by chronic cytopenias, ineffective and dysplastic haematopoiesis, recurrent genetic abnormalities and increased risk of progression to acute myeloid leukemia. A challenge of routine laboratory...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Jaja, Lemaire, Pierre, Mathis, Stéphanie, Ronez, Emily, Clauser, Sylvain, Jondeau, Katayoun, Fenaux, Pierre, Adès, Lionel, Bardet, Valérie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9464379/
https://www.ncbi.nlm.nih.gov/pubmed/36088307
http://dx.doi.org/10.1186/s12885-022-10059-8
_version_ 1784787568775135232
author Zhu, Jaja
Lemaire, Pierre
Mathis, Stéphanie
Ronez, Emily
Clauser, Sylvain
Jondeau, Katayoun
Fenaux, Pierre
Adès, Lionel
Bardet, Valérie
author_facet Zhu, Jaja
Lemaire, Pierre
Mathis, Stéphanie
Ronez, Emily
Clauser, Sylvain
Jondeau, Katayoun
Fenaux, Pierre
Adès, Lionel
Bardet, Valérie
author_sort Zhu, Jaja
collection PubMed
description BACKGROUND: Myelodysplastic syndromes (MDS) are clonal hematopoietic diseases of the elderly characterized by chronic cytopenias, ineffective and dysplastic haematopoiesis, recurrent genetic abnormalities and increased risk of progression to acute myeloid leukemia. A challenge of routine laboratory Complete Blood Counts (CBC) is to correctly identify MDS patients while simultaneously avoiding excess smear reviews. To optimize smear review, the latest generations of hematology analyzers provide new cell population data (CPD) parameters with an increased ability to screen MDS, among which the previously described MDS-CBC Score, based on Absolute Neutrophil Count (ANC), structural neutrophil dispersion (Ne-WX) and mean corpuscular volume (MCV). Ne-WX is increased in the presence of hypogranulated/degranulated neutrophils, a hallmark of dysplasia in the context of MDS or chronic myelomonocytic leukemia. Ne-WX and MCV are CPD derived from leukocytes and red blood cells, therefore the MDS-CBC score does not include any platelet-derived CPD. We asked whether this score could be improved by adding the immature platelet fraction (IPF), a CPD used as a surrogate marker of dysplastic thrombopoiesis. METHODS: Here, we studied a cohort of more than 500 individuals with cytopenias, including 168 MDS patients. In a first step, we used Breiman’s random forests algorithm, a machine-learning approach, to identify the most relevant parameters for MDS prediction. We then designed Classification And Regression Trees (CART) to evaluate, using resampling, the effect of model tuning parameters on performance and choose the “optimal” model across these parameters. RESULTS: Using random forests algorithm, we identified Ne-WX and IPF as the strongest discriminatory predictors, explaining 37 and 33% of diagnoses respectively. To obtain “simplified” trees, which could be easily implemented into laboratory middlewares, we designed CART combining MDS-CBC score and IPF. Optimal results were obtained using a MDS-CBC score threshold equal to 0.23, and an IPF threshold equal to 3%. CONCLUSIONS: We propose an extended MDS-CBC score, including CPD from the three myeloid lineages, to improve MDS diagnosis on routine laboratory CBCs and optimize smear reviews. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12885-022-10059-8.
format Online
Article
Text
id pubmed-9464379
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94643792022-09-12 Machine learning-based improvement of MDS-CBC score brings platelets into the limelight to optimize smear review in the hematology laboratory Zhu, Jaja Lemaire, Pierre Mathis, Stéphanie Ronez, Emily Clauser, Sylvain Jondeau, Katayoun Fenaux, Pierre Adès, Lionel Bardet, Valérie BMC Cancer Research BACKGROUND: Myelodysplastic syndromes (MDS) are clonal hematopoietic diseases of the elderly characterized by chronic cytopenias, ineffective and dysplastic haematopoiesis, recurrent genetic abnormalities and increased risk of progression to acute myeloid leukemia. A challenge of routine laboratory Complete Blood Counts (CBC) is to correctly identify MDS patients while simultaneously avoiding excess smear reviews. To optimize smear review, the latest generations of hematology analyzers provide new cell population data (CPD) parameters with an increased ability to screen MDS, among which the previously described MDS-CBC Score, based on Absolute Neutrophil Count (ANC), structural neutrophil dispersion (Ne-WX) and mean corpuscular volume (MCV). Ne-WX is increased in the presence of hypogranulated/degranulated neutrophils, a hallmark of dysplasia in the context of MDS or chronic myelomonocytic leukemia. Ne-WX and MCV are CPD derived from leukocytes and red blood cells, therefore the MDS-CBC score does not include any platelet-derived CPD. We asked whether this score could be improved by adding the immature platelet fraction (IPF), a CPD used as a surrogate marker of dysplastic thrombopoiesis. METHODS: Here, we studied a cohort of more than 500 individuals with cytopenias, including 168 MDS patients. In a first step, we used Breiman’s random forests algorithm, a machine-learning approach, to identify the most relevant parameters for MDS prediction. We then designed Classification And Regression Trees (CART) to evaluate, using resampling, the effect of model tuning parameters on performance and choose the “optimal” model across these parameters. RESULTS: Using random forests algorithm, we identified Ne-WX and IPF as the strongest discriminatory predictors, explaining 37 and 33% of diagnoses respectively. To obtain “simplified” trees, which could be easily implemented into laboratory middlewares, we designed CART combining MDS-CBC score and IPF. Optimal results were obtained using a MDS-CBC score threshold equal to 0.23, and an IPF threshold equal to 3%. CONCLUSIONS: We propose an extended MDS-CBC score, including CPD from the three myeloid lineages, to improve MDS diagnosis on routine laboratory CBCs and optimize smear reviews. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12885-022-10059-8. BioMed Central 2022-09-10 /pmc/articles/PMC9464379/ /pubmed/36088307 http://dx.doi.org/10.1186/s12885-022-10059-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhu, Jaja
Lemaire, Pierre
Mathis, Stéphanie
Ronez, Emily
Clauser, Sylvain
Jondeau, Katayoun
Fenaux, Pierre
Adès, Lionel
Bardet, Valérie
Machine learning-based improvement of MDS-CBC score brings platelets into the limelight to optimize smear review in the hematology laboratory
title Machine learning-based improvement of MDS-CBC score brings platelets into the limelight to optimize smear review in the hematology laboratory
title_full Machine learning-based improvement of MDS-CBC score brings platelets into the limelight to optimize smear review in the hematology laboratory
title_fullStr Machine learning-based improvement of MDS-CBC score brings platelets into the limelight to optimize smear review in the hematology laboratory
title_full_unstemmed Machine learning-based improvement of MDS-CBC score brings platelets into the limelight to optimize smear review in the hematology laboratory
title_short Machine learning-based improvement of MDS-CBC score brings platelets into the limelight to optimize smear review in the hematology laboratory
title_sort machine learning-based improvement of mds-cbc score brings platelets into the limelight to optimize smear review in the hematology laboratory
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9464379/
https://www.ncbi.nlm.nih.gov/pubmed/36088307
http://dx.doi.org/10.1186/s12885-022-10059-8
work_keys_str_mv AT zhujaja machinelearningbasedimprovementofmdscbcscorebringsplateletsintothelimelighttooptimizesmearreviewinthehematologylaboratory
AT lemairepierre machinelearningbasedimprovementofmdscbcscorebringsplateletsintothelimelighttooptimizesmearreviewinthehematologylaboratory
AT mathisstephanie machinelearningbasedimprovementofmdscbcscorebringsplateletsintothelimelighttooptimizesmearreviewinthehematologylaboratory
AT ronezemily machinelearningbasedimprovementofmdscbcscorebringsplateletsintothelimelighttooptimizesmearreviewinthehematologylaboratory
AT clausersylvain machinelearningbasedimprovementofmdscbcscorebringsplateletsintothelimelighttooptimizesmearreviewinthehematologylaboratory
AT jondeaukatayoun machinelearningbasedimprovementofmdscbcscorebringsplateletsintothelimelighttooptimizesmearreviewinthehematologylaboratory
AT fenauxpierre machinelearningbasedimprovementofmdscbcscorebringsplateletsintothelimelighttooptimizesmearreviewinthehematologylaboratory
AT adeslionel machinelearningbasedimprovementofmdscbcscorebringsplateletsintothelimelighttooptimizesmearreviewinthehematologylaboratory
AT bardetvalerie machinelearningbasedimprovementofmdscbcscorebringsplateletsintothelimelighttooptimizesmearreviewinthehematologylaboratory