Cargando…

Importance of medical data preprocessing in predictive modeling and risk factor discovery for the frailty syndrome

BACKGROUND: Increasing life expectancy results in more elderly people struggling with age related diseases and functional conditions. This poses huge challenges towards establishing new approaches for maintaining health at a higher age. An important aspect for age related deterioration of the genera...

Descripción completa

Detalles Bibliográficos
Autores principales: Hassler, Andreas Philipp, Menasalvas, Ernestina, García-García, Francisco José, Rodríguez-Mañas, Leocadio, Holzinger, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6483150/
https://www.ncbi.nlm.nih.gov/pubmed/30777059
http://dx.doi.org/10.1186/s12911-019-0747-6
_version_ 1783413983323095040
author Hassler, Andreas Philipp
Menasalvas, Ernestina
García-García, Francisco José
Rodríguez-Mañas, Leocadio
Holzinger, Andreas
author_facet Hassler, Andreas Philipp
Menasalvas, Ernestina
García-García, Francisco José
Rodríguez-Mañas, Leocadio
Holzinger, Andreas
author_sort Hassler, Andreas Philipp
collection PubMed
description BACKGROUND: Increasing life expectancy results in more elderly people struggling with age related diseases and functional conditions. This poses huge challenges towards establishing new approaches for maintaining health at a higher age. An important aspect for age related deterioration of the general patient condition is frailty. The frailty syndrome is associated with a high risk for falls, hospitalization, disability, and finally increased mortality. Using predictive data mining enables the discovery of potential risk factors and can be used as clinical decision support system, which provides the medical doctor with information on the probable clinical patient outcome. This enables the professional to react promptly and to avert likely adverse events in advance. METHODS: Medical data of 474 study participants containing 284 health related parameters, including questionnaire answers, blood parameters and vital parameters from the Toledo Study for Healthy Aging (TSHA) was used. Binary classification models were built in order to distinguish between frail and non-frail study subjects. RESULTS: Using the available TSHA data and the discovered potential predictors, it was possible to design, develop and evaluate a variety of different predictive models for the frailty syndrome. The best performing model was the support vector machine (SVM, 78.31%). Moreover, a methodology was developed, making it possible to explore and to use incomplete medical data and further identify potential predictors and enable interpretability. CONCLUSIONS: This work demonstrates that it is feasible to use incomplete, imbalanced medical data for the development of a predictive model for the frailty syndrome. Moreover, potential predictive factors have been discovered, which were clinically approved by the clinicians. Future work will improve prediction accuracy, especially with regard to separating the group of frail patients into frail and pre-frail ones and analyze the differences among them. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12911-019-0747-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6483150
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64831502019-05-02 Importance of medical data preprocessing in predictive modeling and risk factor discovery for the frailty syndrome Hassler, Andreas Philipp Menasalvas, Ernestina García-García, Francisco José Rodríguez-Mañas, Leocadio Holzinger, Andreas BMC Med Inform Decis Mak Research Article BACKGROUND: Increasing life expectancy results in more elderly people struggling with age related diseases and functional conditions. This poses huge challenges towards establishing new approaches for maintaining health at a higher age. An important aspect for age related deterioration of the general patient condition is frailty. The frailty syndrome is associated with a high risk for falls, hospitalization, disability, and finally increased mortality. Using predictive data mining enables the discovery of potential risk factors and can be used as clinical decision support system, which provides the medical doctor with information on the probable clinical patient outcome. This enables the professional to react promptly and to avert likely adverse events in advance. METHODS: Medical data of 474 study participants containing 284 health related parameters, including questionnaire answers, blood parameters and vital parameters from the Toledo Study for Healthy Aging (TSHA) was used. Binary classification models were built in order to distinguish between frail and non-frail study subjects. RESULTS: Using the available TSHA data and the discovered potential predictors, it was possible to design, develop and evaluate a variety of different predictive models for the frailty syndrome. The best performing model was the support vector machine (SVM, 78.31%). Moreover, a methodology was developed, making it possible to explore and to use incomplete medical data and further identify potential predictors and enable interpretability. CONCLUSIONS: This work demonstrates that it is feasible to use incomplete, imbalanced medical data for the development of a predictive model for the frailty syndrome. Moreover, potential predictive factors have been discovered, which were clinically approved by the clinicians. Future work will improve prediction accuracy, especially with regard to separating the group of frail patients into frail and pre-frail ones and analyze the differences among them. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12911-019-0747-6) contains supplementary material, which is available to authorized users. BioMed Central 2019-02-18 /pmc/articles/PMC6483150/ /pubmed/30777059 http://dx.doi.org/10.1186/s12911-019-0747-6 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Hassler, Andreas Philipp
Menasalvas, Ernestina
García-García, Francisco José
Rodríguez-Mañas, Leocadio
Holzinger, Andreas
Importance of medical data preprocessing in predictive modeling and risk factor discovery for the frailty syndrome
title Importance of medical data preprocessing in predictive modeling and risk factor discovery for the frailty syndrome
title_full Importance of medical data preprocessing in predictive modeling and risk factor discovery for the frailty syndrome
title_fullStr Importance of medical data preprocessing in predictive modeling and risk factor discovery for the frailty syndrome
title_full_unstemmed Importance of medical data preprocessing in predictive modeling and risk factor discovery for the frailty syndrome
title_short Importance of medical data preprocessing in predictive modeling and risk factor discovery for the frailty syndrome
title_sort importance of medical data preprocessing in predictive modeling and risk factor discovery for the frailty syndrome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6483150/
https://www.ncbi.nlm.nih.gov/pubmed/30777059
http://dx.doi.org/10.1186/s12911-019-0747-6
work_keys_str_mv AT hasslerandreasphilipp importanceofmedicaldatapreprocessinginpredictivemodelingandriskfactordiscoveryforthefrailtysyndrome
AT menasalvasernestina importanceofmedicaldatapreprocessinginpredictivemodelingandriskfactordiscoveryforthefrailtysyndrome
AT garciagarciafranciscojose importanceofmedicaldatapreprocessinginpredictivemodelingandriskfactordiscoveryforthefrailtysyndrome
AT rodriguezmanasleocadio importanceofmedicaldatapreprocessinginpredictivemodelingandriskfactordiscoveryforthefrailtysyndrome
AT holzingerandreas importanceofmedicaldatapreprocessinginpredictivemodelingandriskfactordiscoveryforthefrailtysyndrome