Cargando…
Models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity
BACKGROUND: This study developed and validated a claims-based statistical model to predict rheumatoid arthritis (RA) disease activity, measured by the 28-joint count Disease Activity Score (DAS28). METHOD: Veterans enrolled in the Veterans Affairs Rheumatoid Arthritis (VARA) registry with one year o...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5422885/ https://www.ncbi.nlm.nih.gov/pubmed/28482933 http://dx.doi.org/10.1186/s13075-017-1294-0 |
_version_ | 1783234850445066240 |
---|---|
author | Sauer, Brian C. Teng, Chia-Chen Accortt, Neil A. Burningham, Zachary Collier, David Trivedi, Mona Cannon, Grant W. |
author_facet | Sauer, Brian C. Teng, Chia-Chen Accortt, Neil A. Burningham, Zachary Collier, David Trivedi, Mona Cannon, Grant W. |
author_sort | Sauer, Brian C. |
collection | PubMed |
description | BACKGROUND: This study developed and validated a claims-based statistical model to predict rheumatoid arthritis (RA) disease activity, measured by the 28-joint count Disease Activity Score (DAS28). METHOD: Veterans enrolled in the Veterans Affairs Rheumatoid Arthritis (VARA) registry with one year of data available for review before being assessed by the DAS28, were studied. Three models were developed based on initial selection of variables for analyses. The first model was based on clinically defined variables, the second leveraged grouping systems for high dimensional data and the third approach prescreened all possible predictors based on a significant bivariate association with the DAS28. The least absolute shrinkage and selection operator (LASSO) with fivefold cross-validation was used for variable selection and model development. Models were also compared for patients with <5 years to those ≥5 years of RA disease. Classification accuracy was examined for remission (DAS28 < 2.6) and for low (2.6–3.1), moderate (3.2–5.1) and high (>5.1) activity. RESULTS: There were 1582 Veterans who fulfilled inclusion criteria. The adjusted r-square for the three models tested ranged from 0.221 to 0.223. The models performed slightly better for patients with <5 years of RA disease than for patients with ≥5 years of RA disease. Correct classification of DAS28 categories ranged from 39.9% to 40.5% for the three models. CONCLUSION: The multiple models tested showed weak overall predictive accuracy in measuring DAS28. The models performed poorly at predicting patients with remission and high disease activity. Future research should investigate components of disease activity measures directly from medical records and incorporate additional laboratory and other clinical data. |
format | Online Article Text |
id | pubmed-5422885 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54228852017-05-12 Models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity Sauer, Brian C. Teng, Chia-Chen Accortt, Neil A. Burningham, Zachary Collier, David Trivedi, Mona Cannon, Grant W. Arthritis Res Ther Research Article BACKGROUND: This study developed and validated a claims-based statistical model to predict rheumatoid arthritis (RA) disease activity, measured by the 28-joint count Disease Activity Score (DAS28). METHOD: Veterans enrolled in the Veterans Affairs Rheumatoid Arthritis (VARA) registry with one year of data available for review before being assessed by the DAS28, were studied. Three models were developed based on initial selection of variables for analyses. The first model was based on clinically defined variables, the second leveraged grouping systems for high dimensional data and the third approach prescreened all possible predictors based on a significant bivariate association with the DAS28. The least absolute shrinkage and selection operator (LASSO) with fivefold cross-validation was used for variable selection and model development. Models were also compared for patients with <5 years to those ≥5 years of RA disease. Classification accuracy was examined for remission (DAS28 < 2.6) and for low (2.6–3.1), moderate (3.2–5.1) and high (>5.1) activity. RESULTS: There were 1582 Veterans who fulfilled inclusion criteria. The adjusted r-square for the three models tested ranged from 0.221 to 0.223. The models performed slightly better for patients with <5 years of RA disease than for patients with ≥5 years of RA disease. Correct classification of DAS28 categories ranged from 39.9% to 40.5% for the three models. CONCLUSION: The multiple models tested showed weak overall predictive accuracy in measuring DAS28. The models performed poorly at predicting patients with remission and high disease activity. Future research should investigate components of disease activity measures directly from medical records and incorporate additional laboratory and other clinical data. BioMed Central 2017-05-08 2017 /pmc/articles/PMC5422885/ /pubmed/28482933 http://dx.doi.org/10.1186/s13075-017-1294-0 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Sauer, Brian C. Teng, Chia-Chen Accortt, Neil A. Burningham, Zachary Collier, David Trivedi, Mona Cannon, Grant W. Models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity |
title | Models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity |
title_full | Models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity |
title_fullStr | Models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity |
title_full_unstemmed | Models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity |
title_short | Models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity |
title_sort | models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5422885/ https://www.ncbi.nlm.nih.gov/pubmed/28482933 http://dx.doi.org/10.1186/s13075-017-1294-0 |
work_keys_str_mv | AT sauerbrianc modelssolelyusingclaimsbasedadministrativedataarepoorpredictorsofrheumatoidarthritisdiseaseactivity AT tengchiachen modelssolelyusingclaimsbasedadministrativedataarepoorpredictorsofrheumatoidarthritisdiseaseactivity AT accorttneila modelssolelyusingclaimsbasedadministrativedataarepoorpredictorsofrheumatoidarthritisdiseaseactivity AT burninghamzachary modelssolelyusingclaimsbasedadministrativedataarepoorpredictorsofrheumatoidarthritisdiseaseactivity AT collierdavid modelssolelyusingclaimsbasedadministrativedataarepoorpredictorsofrheumatoidarthritisdiseaseactivity AT trivedimona modelssolelyusingclaimsbasedadministrativedataarepoorpredictorsofrheumatoidarthritisdiseaseactivity AT cannongrantw modelssolelyusingclaimsbasedadministrativedataarepoorpredictorsofrheumatoidarthritisdiseaseactivity |