Cargando…
Identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon
BACKGROUND: The new concept of difficult-to-treat rheumatoid arthritis (D2T RA) refers to RA patients who remain symptomatic after several lines of treatment, resulting in a high patient and economic burden. During a hackathon, we aimed to identify and predict D2T RA patients in structured and unstr...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8265126/ https://www.ncbi.nlm.nih.gov/pubmed/34238346 http://dx.doi.org/10.1186/s13075-021-02560-5 |
_version_ | 1783719707690401792 |
---|---|
author | Messelink, Marianne A. Roodenrijs, Nadia M. T. van Es, Bram Hulsbergen-Veelken, Cornelia A. R. Jong, Sebastiaan Overmars, L. Malin Reteig, Leon C. Tan, Sander C. Tauber, Tjebbe van Laar, Jacob M. Welsing, Paco M. J. Haitjema, Saskia |
author_facet | Messelink, Marianne A. Roodenrijs, Nadia M. T. van Es, Bram Hulsbergen-Veelken, Cornelia A. R. Jong, Sebastiaan Overmars, L. Malin Reteig, Leon C. Tan, Sander C. Tauber, Tjebbe van Laar, Jacob M. Welsing, Paco M. J. Haitjema, Saskia |
author_sort | Messelink, Marianne A. |
collection | PubMed |
description | BACKGROUND: The new concept of difficult-to-treat rheumatoid arthritis (D2T RA) refers to RA patients who remain symptomatic after several lines of treatment, resulting in a high patient and economic burden. During a hackathon, we aimed to identify and predict D2T RA patients in structured and unstructured routine care data. METHODS: Routine care data of 1873 RA patients were extracted from the Utrecht Patient Oriented Database. Data from a previous cross-sectional study, in which 152 RA patients were clinically classified as either D2T or non-D2T, served as a validation set. Machine learning techniques, text mining, and feature importance analyses were performed to identify and predict D2T RA patients based on structured and unstructured routine care data. RESULTS: We identified 123 potentially new D2T RA patients by applying the D2T RA definition in structured and unstructured routine care data. Additionally, we developed a D2T RA identification model derived from a feature importance analysis of all available structured data (AUC-ROC 0.88 (95% CI 0.82–0.94)), and we demonstrated the potential of longitudinal hematological data to differentiate D2T from non-D2T RA patients using supervised dimension reduction. Lastly, using data up to the time of starting the first biological treatment, we predicted future development of D2TRA (AUC-ROC 0.73 (95% CI 0.71–0.75)). CONCLUSIONS: During this hackathon, we have demonstrated the potential of different techniques for the identification and prediction of D2T RA patients in structured as well as unstructured routine care data. The results are promising and should be optimized and validated in future research. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13075-021-02560-5. |
format | Online Article Text |
id | pubmed-8265126 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-82651262021-07-08 Identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon Messelink, Marianne A. Roodenrijs, Nadia M. T. van Es, Bram Hulsbergen-Veelken, Cornelia A. R. Jong, Sebastiaan Overmars, L. Malin Reteig, Leon C. Tan, Sander C. Tauber, Tjebbe van Laar, Jacob M. Welsing, Paco M. J. Haitjema, Saskia Arthritis Res Ther Research Article BACKGROUND: The new concept of difficult-to-treat rheumatoid arthritis (D2T RA) refers to RA patients who remain symptomatic after several lines of treatment, resulting in a high patient and economic burden. During a hackathon, we aimed to identify and predict D2T RA patients in structured and unstructured routine care data. METHODS: Routine care data of 1873 RA patients were extracted from the Utrecht Patient Oriented Database. Data from a previous cross-sectional study, in which 152 RA patients were clinically classified as either D2T or non-D2T, served as a validation set. Machine learning techniques, text mining, and feature importance analyses were performed to identify and predict D2T RA patients based on structured and unstructured routine care data. RESULTS: We identified 123 potentially new D2T RA patients by applying the D2T RA definition in structured and unstructured routine care data. Additionally, we developed a D2T RA identification model derived from a feature importance analysis of all available structured data (AUC-ROC 0.88 (95% CI 0.82–0.94)), and we demonstrated the potential of longitudinal hematological data to differentiate D2T from non-D2T RA patients using supervised dimension reduction. Lastly, using data up to the time of starting the first biological treatment, we predicted future development of D2TRA (AUC-ROC 0.73 (95% CI 0.71–0.75)). CONCLUSIONS: During this hackathon, we have demonstrated the potential of different techniques for the identification and prediction of D2T RA patients in structured as well as unstructured routine care data. The results are promising and should be optimized and validated in future research. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13075-021-02560-5. BioMed Central 2021-07-08 2021 /pmc/articles/PMC8265126/ /pubmed/34238346 http://dx.doi.org/10.1186/s13075-021-02560-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Messelink, Marianne A. Roodenrijs, Nadia M. T. van Es, Bram Hulsbergen-Veelken, Cornelia A. R. Jong, Sebastiaan Overmars, L. Malin Reteig, Leon C. Tan, Sander C. Tauber, Tjebbe van Laar, Jacob M. Welsing, Paco M. J. Haitjema, Saskia Identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon |
title | Identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon |
title_full | Identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon |
title_fullStr | Identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon |
title_full_unstemmed | Identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon |
title_short | Identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon |
title_sort | identification and prediction of difficult-to-treat rheumatoid arthritis patients in structured and unstructured routine care data: results from a hackathon |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8265126/ https://www.ncbi.nlm.nih.gov/pubmed/34238346 http://dx.doi.org/10.1186/s13075-021-02560-5 |
work_keys_str_mv | AT messelinkmariannea identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon AT roodenrijsnadiamt identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon AT vanesbram identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon AT hulsbergenveelkencorneliaar identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon AT jongsebastiaan identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon AT overmarslmalin identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon AT reteigleonc identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon AT tansanderc identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon AT taubertjebbe identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon AT vanlaarjacobm identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon AT welsingpacomj identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon AT haitjemasaskia identificationandpredictionofdifficulttotreatrheumatoidarthritispatientsinstructuredandunstructuredroutinecaredataresultsfromahackathon |