Cargando…

Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression †

Although many authors have highlighted the importance of predicting people’s health costs to improve healthcare budget management, most of them do not address the frequent need to know the reasons behind this prediction, i.e., knowing the factors that influence this prediction. This knowledge allows...

Descripción completa

Detalles Bibliográficos
Autores principales: Panay, Belisario, Baloian, Nelson, Pino, José A., Peñafiel, Sergio, Sanson, Horacio, Bersano, Nicolas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472302/
https://www.ncbi.nlm.nih.gov/pubmed/32781680
http://dx.doi.org/10.3390/s20164392
_version_ 1783578957570899968
author Panay, Belisario
Baloian, Nelson
Pino, José A.
Peñafiel, Sergio
Sanson, Horacio
Bersano, Nicolas
author_facet Panay, Belisario
Baloian, Nelson
Pino, José A.
Peñafiel, Sergio
Sanson, Horacio
Bersano, Nicolas
author_sort Panay, Belisario
collection PubMed
description Although many authors have highlighted the importance of predicting people’s health costs to improve healthcare budget management, most of them do not address the frequent need to know the reasons behind this prediction, i.e., knowing the factors that influence this prediction. This knowledge allows avoiding arbitrariness or people’s discrimination. However, many times the black box methods (that is, those that do not allow this analysis, e.g., methods based on deep learning techniques) are more accurate than those that allow an interpretation of the results. For this reason, in this work, we intend to develop a method that can achieve similar returns as those obtained with black box methods for the problem of predicting health costs, but at the same time it allows the interpretation of the results. This interpretable regression method is based on the Dempster-Shafer theory using Evidential Regression (EVREG) and a discount function based on the contribution of each dimension. The method “learns” the optimal weights for each feature using a gradient descent technique. The method also uses the nearest k-neighbor algorithm to accelerate calculations. It is possible to select the most relevant features for predicting a patient’s health care costs using this approach and the transparency of the Evidential Regression model. We can obtain a reason for a prediction with a k-NN approach. We used the Japanese health records at Tsuyama Chuo Hospital to test our method, which included medical examinations, test results, and billing information from 2013 to 2018. We compared our model to methods based on an Artificial Neural Network, Gradient Boosting, Regression Tree and Weighted k-Nearest Neighbors. Our results showed that our transparent model performed like the Artificial Neural Network and Gradient Boosting with an [Formula: see text] of [Formula: see text].
format Online
Article
Text
id pubmed-7472302
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-74723022020-09-04 Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression † Panay, Belisario Baloian, Nelson Pino, José A. Peñafiel, Sergio Sanson, Horacio Bersano, Nicolas Sensors (Basel) Article Although many authors have highlighted the importance of predicting people’s health costs to improve healthcare budget management, most of them do not address the frequent need to know the reasons behind this prediction, i.e., knowing the factors that influence this prediction. This knowledge allows avoiding arbitrariness or people’s discrimination. However, many times the black box methods (that is, those that do not allow this analysis, e.g., methods based on deep learning techniques) are more accurate than those that allow an interpretation of the results. For this reason, in this work, we intend to develop a method that can achieve similar returns as those obtained with black box methods for the problem of predicting health costs, but at the same time it allows the interpretation of the results. This interpretable regression method is based on the Dempster-Shafer theory using Evidential Regression (EVREG) and a discount function based on the contribution of each dimension. The method “learns” the optimal weights for each feature using a gradient descent technique. The method also uses the nearest k-neighbor algorithm to accelerate calculations. It is possible to select the most relevant features for predicting a patient’s health care costs using this approach and the transparency of the Evidential Regression model. We can obtain a reason for a prediction with a k-NN approach. We used the Japanese health records at Tsuyama Chuo Hospital to test our method, which included medical examinations, test results, and billing information from 2013 to 2018. We compared our model to methods based on an Artificial Neural Network, Gradient Boosting, Regression Tree and Weighted k-Nearest Neighbors. Our results showed that our transparent model performed like the Artificial Neural Network and Gradient Boosting with an [Formula: see text] of [Formula: see text]. MDPI 2020-08-06 /pmc/articles/PMC7472302/ /pubmed/32781680 http://dx.doi.org/10.3390/s20164392 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Panay, Belisario
Baloian, Nelson
Pino, José A.
Peñafiel, Sergio
Sanson, Horacio
Bersano, Nicolas
Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression †
title Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression †
title_full Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression †
title_fullStr Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression †
title_full_unstemmed Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression †
title_short Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression †
title_sort feature selection for health care costs prediction using weighted evidential regression †
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472302/
https://www.ncbi.nlm.nih.gov/pubmed/32781680
http://dx.doi.org/10.3390/s20164392
work_keys_str_mv AT panaybelisario featureselectionforhealthcarecostspredictionusingweightedevidentialregression
AT baloiannelson featureselectionforhealthcarecostspredictionusingweightedevidentialregression
AT pinojosea featureselectionforhealthcarecostspredictionusingweightedevidentialregression
AT penafielsergio featureselectionforhealthcarecostspredictionusingweightedevidentialregression
AT sansonhoracio featureselectionforhealthcarecostspredictionusingweightedevidentialregression
AT bersanonicolas featureselectionforhealthcarecostspredictionusingweightedevidentialregression