Cargando…
Determine the therapeutic role of radiotherapy in administrative data: a data mining approach
BACKGROUND: Clinical data gathered for administrative purposes often lack sufficient information to separate the records of radiotherapy given for palliation from those given for cure. An absence, incompleteness, or inaccuracy of such information could hinder or bias the study of the utilization and...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4350984/ https://www.ncbi.nlm.nih.gov/pubmed/25649372 http://dx.doi.org/10.1186/1471-2288-15-11 |
_version_ | 1782360266609000448 |
---|---|
author | Zhang-Salomons, Jina Salomons, Greg |
author_facet | Zhang-Salomons, Jina Salomons, Greg |
author_sort | Zhang-Salomons, Jina |
collection | PubMed |
description | BACKGROUND: Clinical data gathered for administrative purposes often lack sufficient information to separate the records of radiotherapy given for palliation from those given for cure. An absence, incompleteness, or inaccuracy of such information could hinder or bias the study of the utilization and outcome of radiotherapy. This study has three specific purposes: 1) develop a method to determine the therapeutic role of radiotherapy (TRR); 2) assess the accuracy of the method; 3) report the quality of the information on treatment “intent” recorded in the clinical data in Ontario, Canada. A general purpose is to use this study as a prototype to demonstrate and test a method to assess the quality of administrative data. METHODS: This is a population based retrospective study. A random sample was drawn from the treatment records with “intent” assigned in treating hospitals. A decision tree is grown using treatment parameters as predictors and “intent” as outcome variable to classify the treatments into curative or palliative. The tree classifier was applied to the entire dataset, and the classification results were compared with those identified by “intent”. A manual audit was conducted to assess the accuracy of the classification. RESULTS: The following parameters predicted the TRR, from the strongest to the weakest: radiation dose per fraction, treated body-region, disease site, and time of treatment. When applied to the records of treatments given between 1990 and 2008 in Ontario, Canada, the classification rules correctly classified 96.1% of the records. The quality of the “intent” variable was as follows: 77.5% correctly classified, 3.7% misclassified, and 18.8% did not have an “intent” assigned. CONCLUSIONS: The classification rules derived in this study can be used to determine the TRR when such information is unavailable, incomplete, or inaccurate in administrative data. The study demonstrates that data mining approach can be used to effectively assess and improve the quality of large administrative datasets. |
format | Online Article Text |
id | pubmed-4350984 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43509842015-03-06 Determine the therapeutic role of radiotherapy in administrative data: a data mining approach Zhang-Salomons, Jina Salomons, Greg BMC Med Res Methodol Research Article BACKGROUND: Clinical data gathered for administrative purposes often lack sufficient information to separate the records of radiotherapy given for palliation from those given for cure. An absence, incompleteness, or inaccuracy of such information could hinder or bias the study of the utilization and outcome of radiotherapy. This study has three specific purposes: 1) develop a method to determine the therapeutic role of radiotherapy (TRR); 2) assess the accuracy of the method; 3) report the quality of the information on treatment “intent” recorded in the clinical data in Ontario, Canada. A general purpose is to use this study as a prototype to demonstrate and test a method to assess the quality of administrative data. METHODS: This is a population based retrospective study. A random sample was drawn from the treatment records with “intent” assigned in treating hospitals. A decision tree is grown using treatment parameters as predictors and “intent” as outcome variable to classify the treatments into curative or palliative. The tree classifier was applied to the entire dataset, and the classification results were compared with those identified by “intent”. A manual audit was conducted to assess the accuracy of the classification. RESULTS: The following parameters predicted the TRR, from the strongest to the weakest: radiation dose per fraction, treated body-region, disease site, and time of treatment. When applied to the records of treatments given between 1990 and 2008 in Ontario, Canada, the classification rules correctly classified 96.1% of the records. The quality of the “intent” variable was as follows: 77.5% correctly classified, 3.7% misclassified, and 18.8% did not have an “intent” assigned. CONCLUSIONS: The classification rules derived in this study can be used to determine the TRR when such information is unavailable, incomplete, or inaccurate in administrative data. The study demonstrates that data mining approach can be used to effectively assess and improve the quality of large administrative datasets. BioMed Central 2015-02-03 /pmc/articles/PMC4350984/ /pubmed/25649372 http://dx.doi.org/10.1186/1471-2288-15-11 Text en © Zhang-Salomons and Salomons; licensee BioMed Central. 2015 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Zhang-Salomons, Jina Salomons, Greg Determine the therapeutic role of radiotherapy in administrative data: a data mining approach |
title | Determine the therapeutic role of radiotherapy in administrative data: a data mining approach |
title_full | Determine the therapeutic role of radiotherapy in administrative data: a data mining approach |
title_fullStr | Determine the therapeutic role of radiotherapy in administrative data: a data mining approach |
title_full_unstemmed | Determine the therapeutic role of radiotherapy in administrative data: a data mining approach |
title_short | Determine the therapeutic role of radiotherapy in administrative data: a data mining approach |
title_sort | determine the therapeutic role of radiotherapy in administrative data: a data mining approach |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4350984/ https://www.ncbi.nlm.nih.gov/pubmed/25649372 http://dx.doi.org/10.1186/1471-2288-15-11 |
work_keys_str_mv | AT zhangsalomonsjina determinethetherapeuticroleofradiotherapyinadministrativedataadataminingapproach AT salomonsgreg determinethetherapeuticroleofradiotherapyinadministrativedataadataminingapproach |