Cargando…
Development of diagnostic algorithm using machine learning for distinguishing between active tuberculosis and latent tuberculosis infection
BACKGROUND: The discrimination between active tuberculosis (ATB) and latent tuberculosis infection (LTBI) remains challenging. The present study aims to investigate the value of diagnostic models established by machine learning based on multiple laboratory data for distinguishing Mycobacterium tuber...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798640/ https://www.ncbi.nlm.nih.gov/pubmed/36581808 http://dx.doi.org/10.1186/s12879-022-07954-7 |
_version_ | 1784860945901682688 |
---|---|
author | Luo, Ying Xue, Ying Liu, Wei Song, Huijuan Huang, Yi Tang, Guoxing Wang, Feng Wang, Qi Cai, Yimin Sun, Ziyong |
author_facet | Luo, Ying Xue, Ying Liu, Wei Song, Huijuan Huang, Yi Tang, Guoxing Wang, Feng Wang, Qi Cai, Yimin Sun, Ziyong |
author_sort | Luo, Ying |
collection | PubMed |
description | BACKGROUND: The discrimination between active tuberculosis (ATB) and latent tuberculosis infection (LTBI) remains challenging. The present study aims to investigate the value of diagnostic models established by machine learning based on multiple laboratory data for distinguishing Mycobacterium tuberculosis (Mtb) infection status. METHODS: T-SPOT, lymphocyte characteristic detection, and routine laboratory tests were performed on participants. Diagnostic models were built according to various algorithms. RESULTS: A total of 892 participants (468 ATB and 424 LTBI) and another 263 participants (125 ATB and 138 LTBI), were respectively enrolled at Tongji Hospital (discovery cohort) and Sino-French New City Hospital (validation cohort). Receiver operating characteristic (ROC) curve analysis showed that the value of individual indicator for differentiating ATB from LTBI was limited (area under the ROC curve (AUC) < 0.8). A total of 28 models were successfully established using machine learning. Among them, the AUCs of 25 models were more than 0.9 in test set. It was found that conditional random forests (cforest) model, based on the implementation of the random forest and bagging ensemble algorithms utilizing conditional inference trees as base learners, presented best discriminative power in segregating ATB from LTBI. Specially, cforest model presented an AUC of 0.978, with the sensitivity of 93.39% and the specificity of 91.18%. Mtb-specific response represented by early secreted antigenic target 6 (ESAT-6) and culture filtrate protein 10 (CFP-10) spot-forming cell (SFC) in T-SPOT assay, as well as global adaptive immunity assessed by CD4 cell IFN-γ secretion, CD8 cell IFN-γ secretion, and CD4 cell number, were found to contribute greatly to the cforest model. Superior performance obtained in the discovery cohort was further confirmed in the validation cohort. The sensitivity and specificity of cforest model in validation set were 92.80% and 89.86%, respectively. CONCLUSIONS: Cforest model developed upon machine learning could serve as a valuable and prospective tool for identifying Mtb infection status. The present study provided a novel and viable idea for realizing the clinical diagnostic application of the combination of machine learning and laboratory findings. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12879-022-07954-7. |
format | Online Article Text |
id | pubmed-9798640 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-97986402022-12-30 Development of diagnostic algorithm using machine learning for distinguishing between active tuberculosis and latent tuberculosis infection Luo, Ying Xue, Ying Liu, Wei Song, Huijuan Huang, Yi Tang, Guoxing Wang, Feng Wang, Qi Cai, Yimin Sun, Ziyong BMC Infect Dis Research BACKGROUND: The discrimination between active tuberculosis (ATB) and latent tuberculosis infection (LTBI) remains challenging. The present study aims to investigate the value of diagnostic models established by machine learning based on multiple laboratory data for distinguishing Mycobacterium tuberculosis (Mtb) infection status. METHODS: T-SPOT, lymphocyte characteristic detection, and routine laboratory tests were performed on participants. Diagnostic models were built according to various algorithms. RESULTS: A total of 892 participants (468 ATB and 424 LTBI) and another 263 participants (125 ATB and 138 LTBI), were respectively enrolled at Tongji Hospital (discovery cohort) and Sino-French New City Hospital (validation cohort). Receiver operating characteristic (ROC) curve analysis showed that the value of individual indicator for differentiating ATB from LTBI was limited (area under the ROC curve (AUC) < 0.8). A total of 28 models were successfully established using machine learning. Among them, the AUCs of 25 models were more than 0.9 in test set. It was found that conditional random forests (cforest) model, based on the implementation of the random forest and bagging ensemble algorithms utilizing conditional inference trees as base learners, presented best discriminative power in segregating ATB from LTBI. Specially, cforest model presented an AUC of 0.978, with the sensitivity of 93.39% and the specificity of 91.18%. Mtb-specific response represented by early secreted antigenic target 6 (ESAT-6) and culture filtrate protein 10 (CFP-10) spot-forming cell (SFC) in T-SPOT assay, as well as global adaptive immunity assessed by CD4 cell IFN-γ secretion, CD8 cell IFN-γ secretion, and CD4 cell number, were found to contribute greatly to the cforest model. Superior performance obtained in the discovery cohort was further confirmed in the validation cohort. The sensitivity and specificity of cforest model in validation set were 92.80% and 89.86%, respectively. CONCLUSIONS: Cforest model developed upon machine learning could serve as a valuable and prospective tool for identifying Mtb infection status. The present study provided a novel and viable idea for realizing the clinical diagnostic application of the combination of machine learning and laboratory findings. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12879-022-07954-7. BioMed Central 2022-12-29 /pmc/articles/PMC9798640/ /pubmed/36581808 http://dx.doi.org/10.1186/s12879-022-07954-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Luo, Ying Xue, Ying Liu, Wei Song, Huijuan Huang, Yi Tang, Guoxing Wang, Feng Wang, Qi Cai, Yimin Sun, Ziyong Development of diagnostic algorithm using machine learning for distinguishing between active tuberculosis and latent tuberculosis infection |
title | Development of diagnostic algorithm using machine learning for distinguishing between active tuberculosis and latent tuberculosis infection |
title_full | Development of diagnostic algorithm using machine learning for distinguishing between active tuberculosis and latent tuberculosis infection |
title_fullStr | Development of diagnostic algorithm using machine learning for distinguishing between active tuberculosis and latent tuberculosis infection |
title_full_unstemmed | Development of diagnostic algorithm using machine learning for distinguishing between active tuberculosis and latent tuberculosis infection |
title_short | Development of diagnostic algorithm using machine learning for distinguishing between active tuberculosis and latent tuberculosis infection |
title_sort | development of diagnostic algorithm using machine learning for distinguishing between active tuberculosis and latent tuberculosis infection |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798640/ https://www.ncbi.nlm.nih.gov/pubmed/36581808 http://dx.doi.org/10.1186/s12879-022-07954-7 |
work_keys_str_mv | AT luoying developmentofdiagnosticalgorithmusingmachinelearningfordistinguishingbetweenactivetuberculosisandlatenttuberculosisinfection AT xueying developmentofdiagnosticalgorithmusingmachinelearningfordistinguishingbetweenactivetuberculosisandlatenttuberculosisinfection AT liuwei developmentofdiagnosticalgorithmusingmachinelearningfordistinguishingbetweenactivetuberculosisandlatenttuberculosisinfection AT songhuijuan developmentofdiagnosticalgorithmusingmachinelearningfordistinguishingbetweenactivetuberculosisandlatenttuberculosisinfection AT huangyi developmentofdiagnosticalgorithmusingmachinelearningfordistinguishingbetweenactivetuberculosisandlatenttuberculosisinfection AT tangguoxing developmentofdiagnosticalgorithmusingmachinelearningfordistinguishingbetweenactivetuberculosisandlatenttuberculosisinfection AT wangfeng developmentofdiagnosticalgorithmusingmachinelearningfordistinguishingbetweenactivetuberculosisandlatenttuberculosisinfection AT wangqi developmentofdiagnosticalgorithmusingmachinelearningfordistinguishingbetweenactivetuberculosisandlatenttuberculosisinfection AT caiyimin developmentofdiagnosticalgorithmusingmachinelearningfordistinguishingbetweenactivetuberculosisandlatenttuberculosisinfection AT sunziyong developmentofdiagnosticalgorithmusingmachinelearningfordistinguishingbetweenactivetuberculosisandlatenttuberculosisinfection |