Cargando…

Classifying flow cytometry data using Bayesian analysis helps to distinguish ALS patients from healthy controls

INTRODUCTION: Given its wide availability and cost-effectiveness, multidimensional flow cytometry (mFC) became a core method in the field of immunology allowing for the analysis of a broad range of individual cells providing insights into cell subset composition, cellular behavior, and cell-to-cell...

Descripción completa

Detalles Bibliográficos
Autores principales: Räuber, Saskia, Nelke, Christopher, Schroeter, Christina B., Barman, Sumanta, Pawlitzki, Marc, Ingwersen, Jens, Akgün, Katja, Günther, Rene, Garza, Alejandra P., Marggraf, Michaela, Dunay, Ildiko Rita, Schreiber, Stefanie, Vielhaber, Stefan, Ziemssen, Tjalf, Melzer, Nico, Ruck, Tobias, Meuth, Sven G., Herty, Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10434536/
https://www.ncbi.nlm.nih.gov/pubmed/37600819
http://dx.doi.org/10.3389/fimmu.2023.1198860
_version_ 1785091916407242752
author Räuber, Saskia
Nelke, Christopher
Schroeter, Christina B.
Barman, Sumanta
Pawlitzki, Marc
Ingwersen, Jens
Akgün, Katja
Günther, Rene
Garza, Alejandra P.
Marggraf, Michaela
Dunay, Ildiko Rita
Schreiber, Stefanie
Vielhaber, Stefan
Ziemssen, Tjalf
Melzer, Nico
Ruck, Tobias
Meuth, Sven G.
Herty, Michael
author_facet Räuber, Saskia
Nelke, Christopher
Schroeter, Christina B.
Barman, Sumanta
Pawlitzki, Marc
Ingwersen, Jens
Akgün, Katja
Günther, Rene
Garza, Alejandra P.
Marggraf, Michaela
Dunay, Ildiko Rita
Schreiber, Stefanie
Vielhaber, Stefan
Ziemssen, Tjalf
Melzer, Nico
Ruck, Tobias
Meuth, Sven G.
Herty, Michael
author_sort Räuber, Saskia
collection PubMed
description INTRODUCTION: Given its wide availability and cost-effectiveness, multidimensional flow cytometry (mFC) became a core method in the field of immunology allowing for the analysis of a broad range of individual cells providing insights into cell subset composition, cellular behavior, and cell-to-cell interactions. Formerly, the analysis of mFC data solely relied on manual gating strategies. With the advent of novel computational approaches, (semi-)automated gating strategies and analysis tools complemented manual approaches. METHODS: Using Bayesian network analysis, we developed a mathematical model for the dependencies of different obtained mFC markers. The algorithm creates a Bayesian network that is a HC tree when including raw, ungated mFC data of a randomly selected healthy control cohort (HC). The HC tree is used to classify whether the observed marker distribution (either patients with amyotrophic lateral sclerosis (ALS) or HC) is predicted. The relative number of cells where the probability q is equal to zero is calculated reflecting the similarity in the marker distribution between a randomly chosen mFC file (ALS or HC) and the HC tree. RESULTS: Including peripheral blood mFC data from 68 ALS and 35 HC, the algorithm could correctly identify 64/68 ALS cases. Tuning of parameters revealed that the combination of 7 markers, 200 bins, and 20 patients achieved the highest AUC on a significance level of p < 0.0001. The markers CD4 and CD38 showed the highest zero probability. We successfully validated our approach by including a second, independent ALS and HC cohort (55 ALS and 30 HC). In this case, all ALS were correctly identified and side scatter and CD20 yielded the highest zero probability. Finally, both datasets were analyzed by the commercially available algorithm ‘Citrus’, which indicated superior ability of Bayesian network analysis when including raw, ungated mFC data. DISCUSSION: Bayesian network analysis might present a novel approach for classifying mFC data, which does not rely on reduction techniques, thus, allowing to retain information on the entire dataset. Future studies will have to assess the performance when discriminating clinically relevant differential diagnoses to evaluate the complementary diagnostic benefit of Bayesian network analysis to the clinical routine workup.
format Online
Article
Text
id pubmed-10434536
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-104345362023-08-18 Classifying flow cytometry data using Bayesian analysis helps to distinguish ALS patients from healthy controls Räuber, Saskia Nelke, Christopher Schroeter, Christina B. Barman, Sumanta Pawlitzki, Marc Ingwersen, Jens Akgün, Katja Günther, Rene Garza, Alejandra P. Marggraf, Michaela Dunay, Ildiko Rita Schreiber, Stefanie Vielhaber, Stefan Ziemssen, Tjalf Melzer, Nico Ruck, Tobias Meuth, Sven G. Herty, Michael Front Immunol Immunology INTRODUCTION: Given its wide availability and cost-effectiveness, multidimensional flow cytometry (mFC) became a core method in the field of immunology allowing for the analysis of a broad range of individual cells providing insights into cell subset composition, cellular behavior, and cell-to-cell interactions. Formerly, the analysis of mFC data solely relied on manual gating strategies. With the advent of novel computational approaches, (semi-)automated gating strategies and analysis tools complemented manual approaches. METHODS: Using Bayesian network analysis, we developed a mathematical model for the dependencies of different obtained mFC markers. The algorithm creates a Bayesian network that is a HC tree when including raw, ungated mFC data of a randomly selected healthy control cohort (HC). The HC tree is used to classify whether the observed marker distribution (either patients with amyotrophic lateral sclerosis (ALS) or HC) is predicted. The relative number of cells where the probability q is equal to zero is calculated reflecting the similarity in the marker distribution between a randomly chosen mFC file (ALS or HC) and the HC tree. RESULTS: Including peripheral blood mFC data from 68 ALS and 35 HC, the algorithm could correctly identify 64/68 ALS cases. Tuning of parameters revealed that the combination of 7 markers, 200 bins, and 20 patients achieved the highest AUC on a significance level of p < 0.0001. The markers CD4 and CD38 showed the highest zero probability. We successfully validated our approach by including a second, independent ALS and HC cohort (55 ALS and 30 HC). In this case, all ALS were correctly identified and side scatter and CD20 yielded the highest zero probability. Finally, both datasets were analyzed by the commercially available algorithm ‘Citrus’, which indicated superior ability of Bayesian network analysis when including raw, ungated mFC data. DISCUSSION: Bayesian network analysis might present a novel approach for classifying mFC data, which does not rely on reduction techniques, thus, allowing to retain information on the entire dataset. Future studies will have to assess the performance when discriminating clinically relevant differential diagnoses to evaluate the complementary diagnostic benefit of Bayesian network analysis to the clinical routine workup. Frontiers Media S.A. 2023-08-01 /pmc/articles/PMC10434536/ /pubmed/37600819 http://dx.doi.org/10.3389/fimmu.2023.1198860 Text en Copyright © 2023 Räuber, Nelke, Schroeter, Barman, Pawlitzki, Ingwersen, Akgün, Günther, Garza, Marggraf, Dunay, Schreiber, Vielhaber, Ziemssen, Melzer, Ruck, Meuth and Herty https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Immunology
Räuber, Saskia
Nelke, Christopher
Schroeter, Christina B.
Barman, Sumanta
Pawlitzki, Marc
Ingwersen, Jens
Akgün, Katja
Günther, Rene
Garza, Alejandra P.
Marggraf, Michaela
Dunay, Ildiko Rita
Schreiber, Stefanie
Vielhaber, Stefan
Ziemssen, Tjalf
Melzer, Nico
Ruck, Tobias
Meuth, Sven G.
Herty, Michael
Classifying flow cytometry data using Bayesian analysis helps to distinguish ALS patients from healthy controls
title Classifying flow cytometry data using Bayesian analysis helps to distinguish ALS patients from healthy controls
title_full Classifying flow cytometry data using Bayesian analysis helps to distinguish ALS patients from healthy controls
title_fullStr Classifying flow cytometry data using Bayesian analysis helps to distinguish ALS patients from healthy controls
title_full_unstemmed Classifying flow cytometry data using Bayesian analysis helps to distinguish ALS patients from healthy controls
title_short Classifying flow cytometry data using Bayesian analysis helps to distinguish ALS patients from healthy controls
title_sort classifying flow cytometry data using bayesian analysis helps to distinguish als patients from healthy controls
topic Immunology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10434536/
https://www.ncbi.nlm.nih.gov/pubmed/37600819
http://dx.doi.org/10.3389/fimmu.2023.1198860
work_keys_str_mv AT raubersaskia classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT nelkechristopher classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT schroeterchristinab classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT barmansumanta classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT pawlitzkimarc classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT ingwersenjens classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT akgunkatja classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT guntherrene classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT garzaalejandrap classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT marggrafmichaela classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT dunayildikorita classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT schreiberstefanie classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT vielhaberstefan classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT ziemssentjalf classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT melzernico classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT rucktobias classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT meuthsveng classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols
AT hertymichael classifyingflowcytometrydatausingbayesiananalysishelpstodistinguishalspatientsfromhealthycontrols