Cargando…

Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis

BACKGROUND: Few studies on rheumatoid arthritis (RA) have generated machine learning models to predict biologic disease-modifying antirheumatic drugs (bDMARDs) responses; however, these studies included insufficient analysis on important features. Moreover, machine learning is yet to be used to pred...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Seulkee, Kang, Seonyoung, Eun, Yeonghee, Won, Hong-Hee, Kim, Hyungjin, Lee, Jaejoon, Koh, Eun-Mi, Cha, Hoon-Suk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8501710/
https://www.ncbi.nlm.nih.gov/pubmed/34627335
http://dx.doi.org/10.1186/s13075-021-02635-3
_version_ 1784580743602634752
author Lee, Seulkee
Kang, Seonyoung
Eun, Yeonghee
Won, Hong-Hee
Kim, Hyungjin
Lee, Jaejoon
Koh, Eun-Mi
Cha, Hoon-Suk
author_facet Lee, Seulkee
Kang, Seonyoung
Eun, Yeonghee
Won, Hong-Hee
Kim, Hyungjin
Lee, Jaejoon
Koh, Eun-Mi
Cha, Hoon-Suk
author_sort Lee, Seulkee
collection PubMed
description BACKGROUND: Few studies on rheumatoid arthritis (RA) have generated machine learning models to predict biologic disease-modifying antirheumatic drugs (bDMARDs) responses; however, these studies included insufficient analysis on important features. Moreover, machine learning is yet to be used to predict bDMARD responses in ankylosing spondylitis (AS). Thus, in this study, machine learning was used to predict such responses in RA and AS patients. METHODS: Data were retrieved from the Korean College of Rheumatology Biologics therapy (KOBIO) registry. The number of RA and AS patients in the training dataset were 625 and 611, respectively. We prepared independent test datasets that did not participate in any process of generating machine learning models. Baseline clinical characteristics were used as input features. Responders were defined as those who met the ACR 20% improvement response criteria (ACR20) and ASAS 20% improvement response criteria (ASAS20) in RA and AS, respectively, at the first follow-up. Multiple machine learning methods, including random forest (RF-method), were used to generate models to predict bDMARD responses, and we compared them with the logistic regression model. RESULTS: The RF-method model had superior prediction performance to logistic regression model (accuracy: 0.726 [95% confidence interval (CI): 0.725–0.730] vs. 0.689 [0.606–0.717], area under curve (AUC) of the receiver operating characteristic curve (ROC) 0.638 [0.576–0.658] vs. 0.565 [0.493–0.605], F1 score 0.841 [0.837–0.843] vs. 0.803 [0.732–0.828], AUC of the precision-recall curve 0.808 [0.763–0.829] vs. 0.754 [0.714–0.789]) with independent test datasets in patients with RA. However, machine learning and logistic regression exhibited similar prediction performance in AS patients. Furthermore, the patient self-reporting scales, which are patient global assessment of disease activity (PtGA) in RA and Bath Ankylosing Spondylitis Functional Index (BASFI) in AS, were revealed as the most important features in both diseases. CONCLUSIONS: RF-method exhibited superior prediction performance for responses of bDMARDs to a conventional statistical method, i.e., logistic regression, in RA patients. In contrast, despite the comparable size of the dataset, machine learning did not outperform in AS patients. The most important features of both diseases, according to feature importance analysis were patient self-reporting scales. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13075-021-02635-3.
format Online
Article
Text
id pubmed-8501710
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-85017102021-10-20 Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis Lee, Seulkee Kang, Seonyoung Eun, Yeonghee Won, Hong-Hee Kim, Hyungjin Lee, Jaejoon Koh, Eun-Mi Cha, Hoon-Suk Arthritis Res Ther Research Article BACKGROUND: Few studies on rheumatoid arthritis (RA) have generated machine learning models to predict biologic disease-modifying antirheumatic drugs (bDMARDs) responses; however, these studies included insufficient analysis on important features. Moreover, machine learning is yet to be used to predict bDMARD responses in ankylosing spondylitis (AS). Thus, in this study, machine learning was used to predict such responses in RA and AS patients. METHODS: Data were retrieved from the Korean College of Rheumatology Biologics therapy (KOBIO) registry. The number of RA and AS patients in the training dataset were 625 and 611, respectively. We prepared independent test datasets that did not participate in any process of generating machine learning models. Baseline clinical characteristics were used as input features. Responders were defined as those who met the ACR 20% improvement response criteria (ACR20) and ASAS 20% improvement response criteria (ASAS20) in RA and AS, respectively, at the first follow-up. Multiple machine learning methods, including random forest (RF-method), were used to generate models to predict bDMARD responses, and we compared them with the logistic regression model. RESULTS: The RF-method model had superior prediction performance to logistic regression model (accuracy: 0.726 [95% confidence interval (CI): 0.725–0.730] vs. 0.689 [0.606–0.717], area under curve (AUC) of the receiver operating characteristic curve (ROC) 0.638 [0.576–0.658] vs. 0.565 [0.493–0.605], F1 score 0.841 [0.837–0.843] vs. 0.803 [0.732–0.828], AUC of the precision-recall curve 0.808 [0.763–0.829] vs. 0.754 [0.714–0.789]) with independent test datasets in patients with RA. However, machine learning and logistic regression exhibited similar prediction performance in AS patients. Furthermore, the patient self-reporting scales, which are patient global assessment of disease activity (PtGA) in RA and Bath Ankylosing Spondylitis Functional Index (BASFI) in AS, were revealed as the most important features in both diseases. CONCLUSIONS: RF-method exhibited superior prediction performance for responses of bDMARDs to a conventional statistical method, i.e., logistic regression, in RA patients. In contrast, despite the comparable size of the dataset, machine learning did not outperform in AS patients. The most important features of both diseases, according to feature importance analysis were patient self-reporting scales. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13075-021-02635-3. BioMed Central 2021-10-09 2021 /pmc/articles/PMC8501710/ /pubmed/34627335 http://dx.doi.org/10.1186/s13075-021-02635-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Lee, Seulkee
Kang, Seonyoung
Eun, Yeonghee
Won, Hong-Hee
Kim, Hyungjin
Lee, Jaejoon
Koh, Eun-Mi
Cha, Hoon-Suk
Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis
title Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis
title_full Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis
title_fullStr Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis
title_full_unstemmed Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis
title_short Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis
title_sort machine learning-based prediction model for responses of bdmards in patients with rheumatoid arthritis and ankylosing spondylitis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8501710/
https://www.ncbi.nlm.nih.gov/pubmed/34627335
http://dx.doi.org/10.1186/s13075-021-02635-3
work_keys_str_mv AT leeseulkee machinelearningbasedpredictionmodelforresponsesofbdmardsinpatientswithrheumatoidarthritisandankylosingspondylitis
AT kangseonyoung machinelearningbasedpredictionmodelforresponsesofbdmardsinpatientswithrheumatoidarthritisandankylosingspondylitis
AT eunyeonghee machinelearningbasedpredictionmodelforresponsesofbdmardsinpatientswithrheumatoidarthritisandankylosingspondylitis
AT wonhonghee machinelearningbasedpredictionmodelforresponsesofbdmardsinpatientswithrheumatoidarthritisandankylosingspondylitis
AT kimhyungjin machinelearningbasedpredictionmodelforresponsesofbdmardsinpatientswithrheumatoidarthritisandankylosingspondylitis
AT leejaejoon machinelearningbasedpredictionmodelforresponsesofbdmardsinpatientswithrheumatoidarthritisandankylosingspondylitis
AT koheunmi machinelearningbasedpredictionmodelforresponsesofbdmardsinpatientswithrheumatoidarthritisandankylosingspondylitis
AT chahoonsuk machinelearningbasedpredictionmodelforresponsesofbdmardsinpatientswithrheumatoidarthritisandankylosingspondylitis