Cargando…

Methods for a similarity measure for clinical attributes based on survival data analysis

BACKGROUND: Case-based reasoning is a proven method that relies on learned cases from the past for decision support of a new case. The accuracy of such a system depends on the applied similarity measure, which quantifies the similarity between two cases. This work proposes a collection of methods fo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Karmen, Christian, Gietzelt, Matthias, Knaup-Gregori, Petra, Ganzinger, Matthias
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805472/ https://www.ncbi.nlm.nih.gov/pubmed/31638963 http://dx.doi.org/10.1186/s12911-019-0917-6

_version_	1783461392838295552
author	Karmen, Christian Gietzelt, Matthias Knaup-Gregori, Petra Ganzinger, Matthias
author_facet	Karmen, Christian Gietzelt, Matthias Knaup-Gregori, Petra Ganzinger, Matthias
author_sort	Karmen, Christian
collection	PubMed
description	BACKGROUND: Case-based reasoning is a proven method that relies on learned cases from the past for decision support of a new case. The accuracy of such a system depends on the applied similarity measure, which quantifies the similarity between two cases. This work proposes a collection of methods for similarity measures especially for comparison of clinical cases based on survival data, as they are available for example from clinical trials. METHODS: Our approach is intended to be used in scenarios, where it is of interest to use longitudinal data, such as survival data, for a case-based reasoning approach. This might be especially important, where uncertainty about the ideal therapy decision exists. The collection of methods consists of definitions of the local similarity of nominal as well as numeric attributes, a calculation of attribute weights, a feature selection method and finally a global similarity measure. All of them use survival time (consisting of survival status and overall survival) as a reference of similarity. As a baseline, we calculate a survival function for each value of any given clinical attribute. RESULTS: We define the similarity between values of the same attribute by putting the estimated survival functions in relation to each other. Finally, we quantify the similarity by determining the area between corresponding curves of survival functions. The proposed global similarity measure is designed especially for cases from randomized clinical trials or other collections of clinical data with survival information. Overall survival can be considered as an eligible and alternative solution for similarity calculations. It is especially useful, when similarity measures that depend on the classic solution-describing attribute “applied therapy” are not applicable. This is often the case for data from clinical trials containing randomized arms. CONCLUSIONS: In silico evaluation scenarios showed that the mean accuracy of biomarker detection in k = 10 most similar cases is higher (0.909–0.998) than for competing similarity measures, such as Heterogeneous Euclidian-Overlap Metric (0.657–0.831) and Discretized Value Difference Metric (0.535–0.671). The weight calculation method showed a more than six times (6.59–6.95) higher weight for biomarker attributes over non-biomarker attributes. These results suggest that the similarity measure described here is suitable for applications based on survival data.
format	Online Article Text
id	pubmed-6805472
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-68054722019-10-24 Methods for a similarity measure for clinical attributes based on survival data analysis Karmen, Christian Gietzelt, Matthias Knaup-Gregori, Petra Ganzinger, Matthias BMC Med Inform Decis Mak Research Article BACKGROUND: Case-based reasoning is a proven method that relies on learned cases from the past for decision support of a new case. The accuracy of such a system depends on the applied similarity measure, which quantifies the similarity between two cases. This work proposes a collection of methods for similarity measures especially for comparison of clinical cases based on survival data, as they are available for example from clinical trials. METHODS: Our approach is intended to be used in scenarios, where it is of interest to use longitudinal data, such as survival data, for a case-based reasoning approach. This might be especially important, where uncertainty about the ideal therapy decision exists. The collection of methods consists of definitions of the local similarity of nominal as well as numeric attributes, a calculation of attribute weights, a feature selection method and finally a global similarity measure. All of them use survival time (consisting of survival status and overall survival) as a reference of similarity. As a baseline, we calculate a survival function for each value of any given clinical attribute. RESULTS: We define the similarity between values of the same attribute by putting the estimated survival functions in relation to each other. Finally, we quantify the similarity by determining the area between corresponding curves of survival functions. The proposed global similarity measure is designed especially for cases from randomized clinical trials or other collections of clinical data with survival information. Overall survival can be considered as an eligible and alternative solution for similarity calculations. It is especially useful, when similarity measures that depend on the classic solution-describing attribute “applied therapy” are not applicable. This is often the case for data from clinical trials containing randomized arms. CONCLUSIONS: In silico evaluation scenarios showed that the mean accuracy of biomarker detection in k = 10 most similar cases is higher (0.909–0.998) than for competing similarity measures, such as Heterogeneous Euclidian-Overlap Metric (0.657–0.831) and Discretized Value Difference Metric (0.535–0.671). The weight calculation method showed a more than six times (6.59–6.95) higher weight for biomarker attributes over non-biomarker attributes. These results suggest that the similarity measure described here is suitable for applications based on survival data. BioMed Central 2019-10-21 /pmc/articles/PMC6805472/ /pubmed/31638963 http://dx.doi.org/10.1186/s12911-019-0917-6 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Karmen, Christian Gietzelt, Matthias Knaup-Gregori, Petra Ganzinger, Matthias Methods for a similarity measure for clinical attributes based on survival data analysis
title	Methods for a similarity measure for clinical attributes based on survival data analysis
title_full	Methods for a similarity measure for clinical attributes based on survival data analysis
title_fullStr	Methods for a similarity measure for clinical attributes based on survival data analysis
title_full_unstemmed	Methods for a similarity measure for clinical attributes based on survival data analysis
title_short	Methods for a similarity measure for clinical attributes based on survival data analysis
title_sort	methods for a similarity measure for clinical attributes based on survival data analysis
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805472/ https://www.ncbi.nlm.nih.gov/pubmed/31638963 http://dx.doi.org/10.1186/s12911-019-0917-6
work_keys_str_mv	AT karmenchristian methodsforasimilaritymeasureforclinicalattributesbasedonsurvivaldataanalysis AT gietzeltmatthias methodsforasimilaritymeasureforclinicalattributesbasedonsurvivaldataanalysis AT knaupgregoripetra methodsforasimilaritymeasureforclinicalattributesbasedonsurvivaldataanalysis AT ganzingermatthias methodsforasimilaritymeasureforclinicalattributesbasedonsurvivaldataanalysis

Methods for a similarity measure for clinical attributes based on survival data analysis

Ejemplares similares