Cargando…

Exploring the use of machine learning for risk adjustment: A comparison of standard and penalized linear regression models in predicting health care costs in older adults

BACKGROUND: Payers and providers still primarily use ordinary least squares (OLS) to estimate expected economic and clinical outcomes for risk adjustment purposes. Penalized linear regression represents a practical and incremental step forward that provides transparency and interpretability within t...

Descripción completa

Detalles Bibliográficos
Autores principales: Kan, Hong J., Kharrazi, Hadi, Chang, Hsien-Yen, Bodycombe, Dave, Lemke, Klaus, Weiner, Jonathan P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6402678/
https://www.ncbi.nlm.nih.gov/pubmed/30840682
http://dx.doi.org/10.1371/journal.pone.0213258
_version_ 1783400444897263616
author Kan, Hong J.
Kharrazi, Hadi
Chang, Hsien-Yen
Bodycombe, Dave
Lemke, Klaus
Weiner, Jonathan P.
author_facet Kan, Hong J.
Kharrazi, Hadi
Chang, Hsien-Yen
Bodycombe, Dave
Lemke, Klaus
Weiner, Jonathan P.
author_sort Kan, Hong J.
collection PubMed
description BACKGROUND: Payers and providers still primarily use ordinary least squares (OLS) to estimate expected economic and clinical outcomes for risk adjustment purposes. Penalized linear regression represents a practical and incremental step forward that provides transparency and interpretability within the familiar regression framework. This study conducted an in-depth comparison of prediction performance of standard and penalized linear regression in predicting future health care costs in older adults. METHODS AND FINDINGS: This retrospective cohort study included 81,106 Medicare Advantage patients with 5 years of continuous medical and pharmacy insurance from 2009 to 2013. Total health care costs in 2013 were predicted with comorbidity indicators from 2009 to 2012. Using 2012 predictors only, OLS performed poorly (e.g., R(2) = 16.3%) compared to penalized linear regression models (R(2) ranging from 16.8 to 16.9%); using 2009–2012 predictors, the gap in prediction performance increased (R(2):15.0% versus 18.0–18.2%). OLS with a reduced set of predictors selected by lasso showed improved performance (R(2) = 16.6% with 2012 predictors, 17.4% with 2009–2012 predictors) relative to OLS without variable selection but still lagged behind the prediction performance of penalized regression. Lasso regression consistently generated prediction ratios closer to 1 across different levels of predicted risk compared to other models. CONCLUSIONS: This study demonstrated the advantages of using transparent and easy-to-interpret penalized linear regression for predicting future health care costs in older adults relative to standard linear regression. Penalized regression showed better performance than OLS in predicting health care costs. Applying penalized regression to longitudinal data increased prediction accuracy. Lasso regression in particular showed superior prediction ratios across low and high levels of predicted risk. Health care insurers, providers and policy makers may benefit from adopting penalized regression such as lasso regression for cost prediction to improve risk adjustment and population health management and thus better address the underlying needs and risk of the populations they serve.
format Online
Article
Text
id pubmed-6402678
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-64026782019-03-17 Exploring the use of machine learning for risk adjustment: A comparison of standard and penalized linear regression models in predicting health care costs in older adults Kan, Hong J. Kharrazi, Hadi Chang, Hsien-Yen Bodycombe, Dave Lemke, Klaus Weiner, Jonathan P. PLoS One Research Article BACKGROUND: Payers and providers still primarily use ordinary least squares (OLS) to estimate expected economic and clinical outcomes for risk adjustment purposes. Penalized linear regression represents a practical and incremental step forward that provides transparency and interpretability within the familiar regression framework. This study conducted an in-depth comparison of prediction performance of standard and penalized linear regression in predicting future health care costs in older adults. METHODS AND FINDINGS: This retrospective cohort study included 81,106 Medicare Advantage patients with 5 years of continuous medical and pharmacy insurance from 2009 to 2013. Total health care costs in 2013 were predicted with comorbidity indicators from 2009 to 2012. Using 2012 predictors only, OLS performed poorly (e.g., R(2) = 16.3%) compared to penalized linear regression models (R(2) ranging from 16.8 to 16.9%); using 2009–2012 predictors, the gap in prediction performance increased (R(2):15.0% versus 18.0–18.2%). OLS with a reduced set of predictors selected by lasso showed improved performance (R(2) = 16.6% with 2012 predictors, 17.4% with 2009–2012 predictors) relative to OLS without variable selection but still lagged behind the prediction performance of penalized regression. Lasso regression consistently generated prediction ratios closer to 1 across different levels of predicted risk compared to other models. CONCLUSIONS: This study demonstrated the advantages of using transparent and easy-to-interpret penalized linear regression for predicting future health care costs in older adults relative to standard linear regression. Penalized regression showed better performance than OLS in predicting health care costs. Applying penalized regression to longitudinal data increased prediction accuracy. Lasso regression in particular showed superior prediction ratios across low and high levels of predicted risk. Health care insurers, providers and policy makers may benefit from adopting penalized regression such as lasso regression for cost prediction to improve risk adjustment and population health management and thus better address the underlying needs and risk of the populations they serve. Public Library of Science 2019-03-06 /pmc/articles/PMC6402678/ /pubmed/30840682 http://dx.doi.org/10.1371/journal.pone.0213258 Text en © 2019 Kan et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kan, Hong J.
Kharrazi, Hadi
Chang, Hsien-Yen
Bodycombe, Dave
Lemke, Klaus
Weiner, Jonathan P.
Exploring the use of machine learning for risk adjustment: A comparison of standard and penalized linear regression models in predicting health care costs in older adults
title Exploring the use of machine learning for risk adjustment: A comparison of standard and penalized linear regression models in predicting health care costs in older adults
title_full Exploring the use of machine learning for risk adjustment: A comparison of standard and penalized linear regression models in predicting health care costs in older adults
title_fullStr Exploring the use of machine learning for risk adjustment: A comparison of standard and penalized linear regression models in predicting health care costs in older adults
title_full_unstemmed Exploring the use of machine learning for risk adjustment: A comparison of standard and penalized linear regression models in predicting health care costs in older adults
title_short Exploring the use of machine learning for risk adjustment: A comparison of standard and penalized linear regression models in predicting health care costs in older adults
title_sort exploring the use of machine learning for risk adjustment: a comparison of standard and penalized linear regression models in predicting health care costs in older adults
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6402678/
https://www.ncbi.nlm.nih.gov/pubmed/30840682
http://dx.doi.org/10.1371/journal.pone.0213258
work_keys_str_mv AT kanhongj exploringtheuseofmachinelearningforriskadjustmentacomparisonofstandardandpenalizedlinearregressionmodelsinpredictinghealthcarecostsinolderadults
AT kharrazihadi exploringtheuseofmachinelearningforriskadjustmentacomparisonofstandardandpenalizedlinearregressionmodelsinpredictinghealthcarecostsinolderadults
AT changhsienyen exploringtheuseofmachinelearningforriskadjustmentacomparisonofstandardandpenalizedlinearregressionmodelsinpredictinghealthcarecostsinolderadults
AT bodycombedave exploringtheuseofmachinelearningforriskadjustmentacomparisonofstandardandpenalizedlinearregressionmodelsinpredictinghealthcarecostsinolderadults
AT lemkeklaus exploringtheuseofmachinelearningforriskadjustmentacomparisonofstandardandpenalizedlinearregressionmodelsinpredictinghealthcarecostsinolderadults
AT weinerjonathanp exploringtheuseofmachinelearningforriskadjustmentacomparisonofstandardandpenalizedlinearregressionmodelsinpredictinghealthcarecostsinolderadults