Cargando…

Assessing clusters of comorbidities in rheumatoid arthritis: a machine learning approach

BACKGROUND: Comorbid conditions are very common in rheumatoid arthritis (RA) and several prior studies have clustered them using machine learning (ML). We applied various ML algorithms to compare the clusters of comorbidities derived and to assess the value of the clusters for predicting future clin...

Descripción completa

Detalles Bibliográficos
Autores principales: Solomon, Daniel H., Guan, Hongshu, Johansson, Fredrik D., Santacroce, Leah, Malley, Wendi, Guo, Lin, Litman, Heather
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10664370/
https://www.ncbi.nlm.nih.gov/pubmed/37993918
http://dx.doi.org/10.1186/s13075-023-03191-8
_version_ 1785148723527942144
author Solomon, Daniel H.
Guan, Hongshu
Johansson, Fredrik D.
Santacroce, Leah
Malley, Wendi
Guo, Lin
Litman, Heather
author_facet Solomon, Daniel H.
Guan, Hongshu
Johansson, Fredrik D.
Santacroce, Leah
Malley, Wendi
Guo, Lin
Litman, Heather
author_sort Solomon, Daniel H.
collection PubMed
description BACKGROUND: Comorbid conditions are very common in rheumatoid arthritis (RA) and several prior studies have clustered them using machine learning (ML). We applied various ML algorithms to compare the clusters of comorbidities derived and to assess the value of the clusters for predicting future clinical outcomes. METHODS: A large US-based RA registry, CorEvitas, was used to identify patients for the analysis. We assessed the presence of 24 comorbidities, and ML was used to derive clusters of patients with given comorbidities. K-mode, K-mean, regression-based, and hierarchical clustering were used. To assess the value of these clusters, we compared clusters across different ML algorithms in clinical outcome models predicting clinical disease activity index (CDAI) and health assessment questionnaire (HAQ-DI). We used data from the first 3 years of the 6-year study period to derive clusters and assess time-averaged values for CDAI and HAQ-DI during the latter 3 years. Model fit was assessed via adjusted R(2) and root mean square error for a series of models that included clusters from ML clustering and each of the 24 comorbidities separately. RESULTS: 11,883 patients with RA were included who had longitudinal data over 6 years. At baseline, patients were on average 59 (SD 12) years of age, 77% were women, CDAI was 11.3 (SD 11.9, moderate disease activity), HAQ-DI was 0.32 (SD 0.42), and disease duration was 10.8 (SD 9.9) years. During the 6 years of follow-up, the percentage of patients with various comorbidities increased. Using five clusters produced by each of the ML algorithms, multivariable regression models with time-averaged CDAI as an outcome found that the ML-derived comorbidity clusters produced similarly strong models as models with each of the 24 separate comorbidities entered individually. The same patterns were observed for HAQ-DI. CONCLUSIONS: Clustering comorbidities using ML algorithms is not computationally complex but often results in clusters that are difficult to interpret from a clinical standpoint. While ML clustering is useful for modeling multi-omics, using clusters to predict clinical outcomes produces models with a similar fit as those with individual comorbidities. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13075-023-03191-8.
format Online
Article
Text
id pubmed-10664370
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-106643702023-11-22 Assessing clusters of comorbidities in rheumatoid arthritis: a machine learning approach Solomon, Daniel H. Guan, Hongshu Johansson, Fredrik D. Santacroce, Leah Malley, Wendi Guo, Lin Litman, Heather Arthritis Res Ther Research BACKGROUND: Comorbid conditions are very common in rheumatoid arthritis (RA) and several prior studies have clustered them using machine learning (ML). We applied various ML algorithms to compare the clusters of comorbidities derived and to assess the value of the clusters for predicting future clinical outcomes. METHODS: A large US-based RA registry, CorEvitas, was used to identify patients for the analysis. We assessed the presence of 24 comorbidities, and ML was used to derive clusters of patients with given comorbidities. K-mode, K-mean, regression-based, and hierarchical clustering were used. To assess the value of these clusters, we compared clusters across different ML algorithms in clinical outcome models predicting clinical disease activity index (CDAI) and health assessment questionnaire (HAQ-DI). We used data from the first 3 years of the 6-year study period to derive clusters and assess time-averaged values for CDAI and HAQ-DI during the latter 3 years. Model fit was assessed via adjusted R(2) and root mean square error for a series of models that included clusters from ML clustering and each of the 24 comorbidities separately. RESULTS: 11,883 patients with RA were included who had longitudinal data over 6 years. At baseline, patients were on average 59 (SD 12) years of age, 77% were women, CDAI was 11.3 (SD 11.9, moderate disease activity), HAQ-DI was 0.32 (SD 0.42), and disease duration was 10.8 (SD 9.9) years. During the 6 years of follow-up, the percentage of patients with various comorbidities increased. Using five clusters produced by each of the ML algorithms, multivariable regression models with time-averaged CDAI as an outcome found that the ML-derived comorbidity clusters produced similarly strong models as models with each of the 24 separate comorbidities entered individually. The same patterns were observed for HAQ-DI. CONCLUSIONS: Clustering comorbidities using ML algorithms is not computationally complex but often results in clusters that are difficult to interpret from a clinical standpoint. While ML clustering is useful for modeling multi-omics, using clusters to predict clinical outcomes produces models with a similar fit as those with individual comorbidities. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13075-023-03191-8. BioMed Central 2023-11-22 2023 /pmc/articles/PMC10664370/ /pubmed/37993918 http://dx.doi.org/10.1186/s13075-023-03191-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Solomon, Daniel H.
Guan, Hongshu
Johansson, Fredrik D.
Santacroce, Leah
Malley, Wendi
Guo, Lin
Litman, Heather
Assessing clusters of comorbidities in rheumatoid arthritis: a machine learning approach
title Assessing clusters of comorbidities in rheumatoid arthritis: a machine learning approach
title_full Assessing clusters of comorbidities in rheumatoid arthritis: a machine learning approach
title_fullStr Assessing clusters of comorbidities in rheumatoid arthritis: a machine learning approach
title_full_unstemmed Assessing clusters of comorbidities in rheumatoid arthritis: a machine learning approach
title_short Assessing clusters of comorbidities in rheumatoid arthritis: a machine learning approach
title_sort assessing clusters of comorbidities in rheumatoid arthritis: a machine learning approach
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10664370/
https://www.ncbi.nlm.nih.gov/pubmed/37993918
http://dx.doi.org/10.1186/s13075-023-03191-8
work_keys_str_mv AT solomondanielh assessingclustersofcomorbiditiesinrheumatoidarthritisamachinelearningapproach
AT guanhongshu assessingclustersofcomorbiditiesinrheumatoidarthritisamachinelearningapproach
AT johanssonfredrikd assessingclustersofcomorbiditiesinrheumatoidarthritisamachinelearningapproach
AT santacroceleah assessingclustersofcomorbiditiesinrheumatoidarthritisamachinelearningapproach
AT malleywendi assessingclustersofcomorbiditiesinrheumatoidarthritisamachinelearningapproach
AT guolin assessingclustersofcomorbiditiesinrheumatoidarthritisamachinelearningapproach
AT litmanheather assessingclustersofcomorbiditiesinrheumatoidarthritisamachinelearningapproach