Cargando…

Learning temporal weights of clinical events using variable importance

BACKGROUND: Longitudinal data sources, such as electronic health records (EHRs), are very valuable for monitoring adverse drug events (ADEs). However, ADEs are heavily under-reported in EHRs. Using machine learning algorithms to automatically detect patients that should have had ADEs reported in the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhao, Jing, Henriksson, Aron
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2016
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4965710/ https://www.ncbi.nlm.nih.gov/pubmed/27459993 http://dx.doi.org/10.1186/s12911-016-0311-6

_version_	1782445299488260096
author	Zhao, Jing Henriksson, Aron
author_facet	Zhao, Jing Henriksson, Aron
author_sort	Zhao, Jing
collection	PubMed
description	BACKGROUND: Longitudinal data sources, such as electronic health records (EHRs), are very valuable for monitoring adverse drug events (ADEs). However, ADEs are heavily under-reported in EHRs. Using machine learning algorithms to automatically detect patients that should have had ADEs reported in their health records is an efficient and effective solution. One of the challenges to that end is how to take into account the temporality of clinical events, which are time stamped in EHRs, and providing these as features for machine learning algorithms to exploit. Previous research on this topic suggests that representing EHR data as a bag of temporally weighted clinical events is promising; however, the weights were in that case pre-assigned according to their time stamps, which is limited and potentially less accurate. This study therefore focuses on how to learn weights that effectively take into account the temporality and importance of clinical events for ADE detection. METHODS: Variable importance obtained from the random forest learning algorithm is used for extracting temporal weights. Two strategies are proposed for applying the learned weights: weighted aggregation and weighted sampling. The first strategy aggregates the weighted clinical events from different time windows to form new features; the second strategy retains the original features but samples them by using their weights as probabilities when building each tree in the forest. The predictive performance of random forest models using the learned weights with the two strategies is compared to using pre-assigned weights. In addition, to assess the sensitivity of the weight-learning procedure, weights from different granularity levels are evaluated and compared. RESULTS: In the weighted sampling strategy, using learned weights significantly improves the predictive performance, in comparison to using pre-assigned weights; however, there is no significant difference between them in the weighted aggregation strategy. Moreover, the granularity of the weight learning procedure has a significant impact on the former, but not on the latter. CONCLUSIONS: Learning temporal weights is significantly beneficial in terms of predictive performance with the weighted sampling strategy. Moreover, weighted aggregation generally diminishes the impact of temporal weighting of the clinical events, irrespective of whether the weights are pre-assigned or learned.
format	Online Article Text
id	pubmed-4965710
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-49657102016-08-02 Learning temporal weights of clinical events using variable importance Zhao, Jing Henriksson, Aron BMC Med Inform Decis Mak Research BACKGROUND: Longitudinal data sources, such as electronic health records (EHRs), are very valuable for monitoring adverse drug events (ADEs). However, ADEs are heavily under-reported in EHRs. Using machine learning algorithms to automatically detect patients that should have had ADEs reported in their health records is an efficient and effective solution. One of the challenges to that end is how to take into account the temporality of clinical events, which are time stamped in EHRs, and providing these as features for machine learning algorithms to exploit. Previous research on this topic suggests that representing EHR data as a bag of temporally weighted clinical events is promising; however, the weights were in that case pre-assigned according to their time stamps, which is limited and potentially less accurate. This study therefore focuses on how to learn weights that effectively take into account the temporality and importance of clinical events for ADE detection. METHODS: Variable importance obtained from the random forest learning algorithm is used for extracting temporal weights. Two strategies are proposed for applying the learned weights: weighted aggregation and weighted sampling. The first strategy aggregates the weighted clinical events from different time windows to form new features; the second strategy retains the original features but samples them by using their weights as probabilities when building each tree in the forest. The predictive performance of random forest models using the learned weights with the two strategies is compared to using pre-assigned weights. In addition, to assess the sensitivity of the weight-learning procedure, weights from different granularity levels are evaluated and compared. RESULTS: In the weighted sampling strategy, using learned weights significantly improves the predictive performance, in comparison to using pre-assigned weights; however, there is no significant difference between them in the weighted aggregation strategy. Moreover, the granularity of the weight learning procedure has a significant impact on the former, but not on the latter. CONCLUSIONS: Learning temporal weights is significantly beneficial in terms of predictive performance with the weighted sampling strategy. Moreover, weighted aggregation generally diminishes the impact of temporal weighting of the clinical events, irrespective of whether the weights are pre-assigned or learned. BioMed Central 2016-07-21 /pmc/articles/PMC4965710/ /pubmed/27459993 http://dx.doi.org/10.1186/s12911-016-0311-6 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Zhao, Jing Henriksson, Aron Learning temporal weights of clinical events using variable importance
title	Learning temporal weights of clinical events using variable importance
title_full	Learning temporal weights of clinical events using variable importance
title_fullStr	Learning temporal weights of clinical events using variable importance
title_full_unstemmed	Learning temporal weights of clinical events using variable importance
title_short	Learning temporal weights of clinical events using variable importance
title_sort	learning temporal weights of clinical events using variable importance
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4965710/ https://www.ncbi.nlm.nih.gov/pubmed/27459993 http://dx.doi.org/10.1186/s12911-016-0311-6
work_keys_str_mv	AT zhaojing learningtemporalweightsofclinicaleventsusingvariableimportance AT henrikssonaron learningtemporalweightsofclinicaleventsusingvariableimportance

Learning temporal weights of clinical events using variable importance

Ejemplares similares