Cargando…
Explainable machine learning methods and respiratory oscillometry for the diagnosis of respiratory abnormalities in sarcoidosis
BACKGROUND: In this work, we developed many machine learning classifiers to assist in diagnosing respiratory changes associated with sarcoidosis, based on results from the Forced Oscillation Technique (FOT), a non-invasive method used to assess pulmonary mechanics. In addition to accurate results, t...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9583465/ https://www.ncbi.nlm.nih.gov/pubmed/36266674 http://dx.doi.org/10.1186/s12911-022-02021-2 |
_version_ | 1784813077270626304 |
---|---|
author | de Lima, Allan Danilo Lopes, Agnaldo J. do Amaral, Jorge Luis Machado de Melo, Pedro Lopes |
author_facet | de Lima, Allan Danilo Lopes, Agnaldo J. do Amaral, Jorge Luis Machado de Melo, Pedro Lopes |
author_sort | de Lima, Allan Danilo |
collection | PubMed |
description | BACKGROUND: In this work, we developed many machine learning classifiers to assist in diagnosing respiratory changes associated with sarcoidosis, based on results from the Forced Oscillation Technique (FOT), a non-invasive method used to assess pulmonary mechanics. In addition to accurate results, there is a particular interest in their interpretability and explainability, so we used Genetic Programming since the classification is made with intelligible expressions and we also evaluate the feature importance in different experiments to find the more discriminative features. METHODOLOGY/PRINCIPAL FINDINGS: We used genetic programming in its traditional tree form and a grammar-based form. To check if interpretable results are competitive, we compared their performance to K-Nearest Neighbors, Support Vector Machine, AdaBoost, Random Forest, LightGBM, XGBoost, Decision Trees and Logistic Regressor. We also performed experiments with fuzzy features and tested a feature selection technique to bring even more interpretability. The data used to feed the classifiers come from the FOT exams in 72 individuals, of which 25 were healthy, and 47 were diagnosed with sarcoidosis. Among the latter, 24 showed normal conditions by spirometry, and 23 showed respiratory changes. The results achieved high accuracy (AUC > 0.90) in two analyses performed (controls vs. individuals with sarcoidosis and normal spirometry and controls vs. individuals with sarcoidosis and altered spirometry). Genetic Programming and Grammatical Evolution were particularly beneficial because they provide intelligible expressions to make the classification. The observation of which features were selected most frequently also brought explainability to the study of sarcoidosis. CONCLUSIONS: The proposed system may provide decision support for clinicians when they are struggling to give a confirmed clinical diagnosis. Clinicians may reference the prediction results and make better decisions, improving the productivity of pulmonary function services by AI-assisted workflow. |
format | Online Article Text |
id | pubmed-9583465 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-95834652022-10-21 Explainable machine learning methods and respiratory oscillometry for the diagnosis of respiratory abnormalities in sarcoidosis de Lima, Allan Danilo Lopes, Agnaldo J. do Amaral, Jorge Luis Machado de Melo, Pedro Lopes BMC Med Inform Decis Mak Research BACKGROUND: In this work, we developed many machine learning classifiers to assist in diagnosing respiratory changes associated with sarcoidosis, based on results from the Forced Oscillation Technique (FOT), a non-invasive method used to assess pulmonary mechanics. In addition to accurate results, there is a particular interest in their interpretability and explainability, so we used Genetic Programming since the classification is made with intelligible expressions and we also evaluate the feature importance in different experiments to find the more discriminative features. METHODOLOGY/PRINCIPAL FINDINGS: We used genetic programming in its traditional tree form and a grammar-based form. To check if interpretable results are competitive, we compared their performance to K-Nearest Neighbors, Support Vector Machine, AdaBoost, Random Forest, LightGBM, XGBoost, Decision Trees and Logistic Regressor. We also performed experiments with fuzzy features and tested a feature selection technique to bring even more interpretability. The data used to feed the classifiers come from the FOT exams in 72 individuals, of which 25 were healthy, and 47 were diagnosed with sarcoidosis. Among the latter, 24 showed normal conditions by spirometry, and 23 showed respiratory changes. The results achieved high accuracy (AUC > 0.90) in two analyses performed (controls vs. individuals with sarcoidosis and normal spirometry and controls vs. individuals with sarcoidosis and altered spirometry). Genetic Programming and Grammatical Evolution were particularly beneficial because they provide intelligible expressions to make the classification. The observation of which features were selected most frequently also brought explainability to the study of sarcoidosis. CONCLUSIONS: The proposed system may provide decision support for clinicians when they are struggling to give a confirmed clinical diagnosis. Clinicians may reference the prediction results and make better decisions, improving the productivity of pulmonary function services by AI-assisted workflow. BioMed Central 2022-10-20 /pmc/articles/PMC9583465/ /pubmed/36266674 http://dx.doi.org/10.1186/s12911-022-02021-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research de Lima, Allan Danilo Lopes, Agnaldo J. do Amaral, Jorge Luis Machado de Melo, Pedro Lopes Explainable machine learning methods and respiratory oscillometry for the diagnosis of respiratory abnormalities in sarcoidosis |
title | Explainable machine learning methods and respiratory oscillometry for the diagnosis of respiratory abnormalities in sarcoidosis |
title_full | Explainable machine learning methods and respiratory oscillometry for the diagnosis of respiratory abnormalities in sarcoidosis |
title_fullStr | Explainable machine learning methods and respiratory oscillometry for the diagnosis of respiratory abnormalities in sarcoidosis |
title_full_unstemmed | Explainable machine learning methods and respiratory oscillometry for the diagnosis of respiratory abnormalities in sarcoidosis |
title_short | Explainable machine learning methods and respiratory oscillometry for the diagnosis of respiratory abnormalities in sarcoidosis |
title_sort | explainable machine learning methods and respiratory oscillometry for the diagnosis of respiratory abnormalities in sarcoidosis |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9583465/ https://www.ncbi.nlm.nih.gov/pubmed/36266674 http://dx.doi.org/10.1186/s12911-022-02021-2 |
work_keys_str_mv | AT delimaallandanilo explainablemachinelearningmethodsandrespiratoryoscillometryforthediagnosisofrespiratoryabnormalitiesinsarcoidosis AT lopesagnaldoj explainablemachinelearningmethodsandrespiratoryoscillometryforthediagnosisofrespiratoryabnormalitiesinsarcoidosis AT doamaraljorgeluismachado explainablemachinelearningmethodsandrespiratoryoscillometryforthediagnosisofrespiratoryabnormalitiesinsarcoidosis AT demelopedrolopes explainablemachinelearningmethodsandrespiratoryoscillometryforthediagnosisofrespiratoryabnormalitiesinsarcoidosis |