Cargando…
Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression †
Although many authors have highlighted the importance of predicting people’s health costs to improve healthcare budget management, most of them do not address the frequent need to know the reasons behind this prediction, i.e., knowing the factors that influence this prediction. This knowledge allows...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472302/ https://www.ncbi.nlm.nih.gov/pubmed/32781680 http://dx.doi.org/10.3390/s20164392 |
_version_ | 1783578957570899968 |
---|---|
author | Panay, Belisario Baloian, Nelson Pino, José A. Peñafiel, Sergio Sanson, Horacio Bersano, Nicolas |
author_facet | Panay, Belisario Baloian, Nelson Pino, José A. Peñafiel, Sergio Sanson, Horacio Bersano, Nicolas |
author_sort | Panay, Belisario |
collection | PubMed |
description | Although many authors have highlighted the importance of predicting people’s health costs to improve healthcare budget management, most of them do not address the frequent need to know the reasons behind this prediction, i.e., knowing the factors that influence this prediction. This knowledge allows avoiding arbitrariness or people’s discrimination. However, many times the black box methods (that is, those that do not allow this analysis, e.g., methods based on deep learning techniques) are more accurate than those that allow an interpretation of the results. For this reason, in this work, we intend to develop a method that can achieve similar returns as those obtained with black box methods for the problem of predicting health costs, but at the same time it allows the interpretation of the results. This interpretable regression method is based on the Dempster-Shafer theory using Evidential Regression (EVREG) and a discount function based on the contribution of each dimension. The method “learns” the optimal weights for each feature using a gradient descent technique. The method also uses the nearest k-neighbor algorithm to accelerate calculations. It is possible to select the most relevant features for predicting a patient’s health care costs using this approach and the transparency of the Evidential Regression model. We can obtain a reason for a prediction with a k-NN approach. We used the Japanese health records at Tsuyama Chuo Hospital to test our method, which included medical examinations, test results, and billing information from 2013 to 2018. We compared our model to methods based on an Artificial Neural Network, Gradient Boosting, Regression Tree and Weighted k-Nearest Neighbors. Our results showed that our transparent model performed like the Artificial Neural Network and Gradient Boosting with an [Formula: see text] of [Formula: see text]. |
format | Online Article Text |
id | pubmed-7472302 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-74723022020-09-04 Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression † Panay, Belisario Baloian, Nelson Pino, José A. Peñafiel, Sergio Sanson, Horacio Bersano, Nicolas Sensors (Basel) Article Although many authors have highlighted the importance of predicting people’s health costs to improve healthcare budget management, most of them do not address the frequent need to know the reasons behind this prediction, i.e., knowing the factors that influence this prediction. This knowledge allows avoiding arbitrariness or people’s discrimination. However, many times the black box methods (that is, those that do not allow this analysis, e.g., methods based on deep learning techniques) are more accurate than those that allow an interpretation of the results. For this reason, in this work, we intend to develop a method that can achieve similar returns as those obtained with black box methods for the problem of predicting health costs, but at the same time it allows the interpretation of the results. This interpretable regression method is based on the Dempster-Shafer theory using Evidential Regression (EVREG) and a discount function based on the contribution of each dimension. The method “learns” the optimal weights for each feature using a gradient descent technique. The method also uses the nearest k-neighbor algorithm to accelerate calculations. It is possible to select the most relevant features for predicting a patient’s health care costs using this approach and the transparency of the Evidential Regression model. We can obtain a reason for a prediction with a k-NN approach. We used the Japanese health records at Tsuyama Chuo Hospital to test our method, which included medical examinations, test results, and billing information from 2013 to 2018. We compared our model to methods based on an Artificial Neural Network, Gradient Boosting, Regression Tree and Weighted k-Nearest Neighbors. Our results showed that our transparent model performed like the Artificial Neural Network and Gradient Boosting with an [Formula: see text] of [Formula: see text]. MDPI 2020-08-06 /pmc/articles/PMC7472302/ /pubmed/32781680 http://dx.doi.org/10.3390/s20164392 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Panay, Belisario Baloian, Nelson Pino, José A. Peñafiel, Sergio Sanson, Horacio Bersano, Nicolas Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression † |
title | Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression † |
title_full | Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression † |
title_fullStr | Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression † |
title_full_unstemmed | Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression † |
title_short | Feature Selection for Health Care Costs Prediction Using Weighted Evidential Regression † |
title_sort | feature selection for health care costs prediction using weighted evidential regression † |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472302/ https://www.ncbi.nlm.nih.gov/pubmed/32781680 http://dx.doi.org/10.3390/s20164392 |
work_keys_str_mv | AT panaybelisario featureselectionforhealthcarecostspredictionusingweightedevidentialregression AT baloiannelson featureselectionforhealthcarecostspredictionusingweightedevidentialregression AT pinojosea featureselectionforhealthcarecostspredictionusingweightedevidentialregression AT penafielsergio featureselectionforhealthcarecostspredictionusingweightedevidentialregression AT sansonhoracio featureselectionforhealthcarecostspredictionusingweightedevidentialregression AT bersanonicolas featureselectionforhealthcarecostspredictionusingweightedevidentialregression |