Cargando…

Application of a population-based severity scoring system to individual patients results in frequent misclassification

INTRODUCTION: APACHE II (AP2) was developed to allow a systematic examination of intensive care unit outcomes in a risk adjusted manner. AP2 has been widely adopted in clinical trials to assure broad consistency amongst different groups. Although errors in calculating the true AP2 score may not be r...

Descripción completa

Detalles Bibliográficos
Autores principales: Booth, Frank V, Short, Mary, Shorr, Andrew F, Arkins, Nancy, Bates, Becky, Qualy, Rebecca L, Levy, Howard
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1297617/
https://www.ncbi.nlm.nih.gov/pubmed/16277714
http://dx.doi.org/10.1186/cc3790
_version_ 1782126232477892608
author Booth, Frank V
Short, Mary
Shorr, Andrew F
Arkins, Nancy
Bates, Becky
Qualy, Rebecca L
Levy, Howard
author_facet Booth, Frank V
Short, Mary
Shorr, Andrew F
Arkins, Nancy
Bates, Becky
Qualy, Rebecca L
Levy, Howard
author_sort Booth, Frank V
collection PubMed
description INTRODUCTION: APACHE II (AP2) was developed to allow a systematic examination of intensive care unit outcomes in a risk adjusted manner. AP2 has been widely adopted in clinical trials to assure broad consistency amongst different groups. Although errors in calculating the true AP2 score may not be reducible below 15%, the self-canceling effect of random errors reduces the importance of such errors when applied to large populations. It has been suggested that a threshold AP2 score be used in clinical decision making for individual patients. This study reports the AP2 scoring errors of researchers involved in a large sepsis trial and models the consequences of such an error rate for individual severe sepsis patients. METHODS: Fifty-six researchers with explicit training in data abstraction and completion of the AP2 score received scenarios consisting of composites of real patient histories. Descriptive statistics were calculated for each scenario. The standard deviations were calculated compared with an adjudicated score. Intraclass correlations for inter-observer reliability were performed using Shrout-Fleiss methodology. Theoretical distribution curves were calculated for a broad range of AP2 scores using standard deviations of 6, 9 and 12. For each curve, the misclassification rate was determined using an AP2 score cut-off of ≥25. The percentage of misclassifications for each true AP2 score was then applied to the corresponding AP2 score obtained from the PROGRESS severe sepsis registry. RESULTS: The error rate for the total AP2 score was 86% (individual variables were in the range 10% to 87%). Intraclass correlation for the inter-observer reliability was 0.51. Of the patients from the PROGRESS registry. 50% had AP2 scores in the range 17 to 28. Within this interquartile range, 70% to 85% of all misclassified patients would reside. CONCLUSION: It is more likely that an individual patient will be scored incorrectly than correctly. The data obtained from the scenarios indicated that as the true AP2 score approached an arbitrary cut-off point of 25, the observed misclassification rate increased. Integrating our study of AP2 score errors with the published literature leads us to conclude that the AP2 is an inappropriate sole tool for resource allocation decisions for individual patients.
format Text
id pubmed-1297617
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-12976172005-12-01 Application of a population-based severity scoring system to individual patients results in frequent misclassification Booth, Frank V Short, Mary Shorr, Andrew F Arkins, Nancy Bates, Becky Qualy, Rebecca L Levy, Howard Crit Care Research INTRODUCTION: APACHE II (AP2) was developed to allow a systematic examination of intensive care unit outcomes in a risk adjusted manner. AP2 has been widely adopted in clinical trials to assure broad consistency amongst different groups. Although errors in calculating the true AP2 score may not be reducible below 15%, the self-canceling effect of random errors reduces the importance of such errors when applied to large populations. It has been suggested that a threshold AP2 score be used in clinical decision making for individual patients. This study reports the AP2 scoring errors of researchers involved in a large sepsis trial and models the consequences of such an error rate for individual severe sepsis patients. METHODS: Fifty-six researchers with explicit training in data abstraction and completion of the AP2 score received scenarios consisting of composites of real patient histories. Descriptive statistics were calculated for each scenario. The standard deviations were calculated compared with an adjudicated score. Intraclass correlations for inter-observer reliability were performed using Shrout-Fleiss methodology. Theoretical distribution curves were calculated for a broad range of AP2 scores using standard deviations of 6, 9 and 12. For each curve, the misclassification rate was determined using an AP2 score cut-off of ≥25. The percentage of misclassifications for each true AP2 score was then applied to the corresponding AP2 score obtained from the PROGRESS severe sepsis registry. RESULTS: The error rate for the total AP2 score was 86% (individual variables were in the range 10% to 87%). Intraclass correlation for the inter-observer reliability was 0.51. Of the patients from the PROGRESS registry. 50% had AP2 scores in the range 17 to 28. Within this interquartile range, 70% to 85% of all misclassified patients would reside. CONCLUSION: It is more likely that an individual patient will be scored incorrectly than correctly. The data obtained from the scenarios indicated that as the true AP2 score approached an arbitrary cut-off point of 25, the observed misclassification rate increased. Integrating our study of AP2 score errors with the published literature leads us to conclude that the AP2 is an inappropriate sole tool for resource allocation decisions for individual patients. BioMed Central 2005 2005-08-09 /pmc/articles/PMC1297617/ /pubmed/16277714 http://dx.doi.org/10.1186/cc3790 Text en Copyright © 2005 Booth et al.; licensee BioMed Central Ltd.
spellingShingle Research
Booth, Frank V
Short, Mary
Shorr, Andrew F
Arkins, Nancy
Bates, Becky
Qualy, Rebecca L
Levy, Howard
Application of a population-based severity scoring system to individual patients results in frequent misclassification
title Application of a population-based severity scoring system to individual patients results in frequent misclassification
title_full Application of a population-based severity scoring system to individual patients results in frequent misclassification
title_fullStr Application of a population-based severity scoring system to individual patients results in frequent misclassification
title_full_unstemmed Application of a population-based severity scoring system to individual patients results in frequent misclassification
title_short Application of a population-based severity scoring system to individual patients results in frequent misclassification
title_sort application of a population-based severity scoring system to individual patients results in frequent misclassification
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1297617/
https://www.ncbi.nlm.nih.gov/pubmed/16277714
http://dx.doi.org/10.1186/cc3790
work_keys_str_mv AT boothfrankv applicationofapopulationbasedseverityscoringsystemtoindividualpatientsresultsinfrequentmisclassification
AT shortmary applicationofapopulationbasedseverityscoringsystemtoindividualpatientsresultsinfrequentmisclassification
AT shorrandrewf applicationofapopulationbasedseverityscoringsystemtoindividualpatientsresultsinfrequentmisclassification
AT arkinsnancy applicationofapopulationbasedseverityscoringsystemtoindividualpatientsresultsinfrequentmisclassification
AT batesbecky applicationofapopulationbasedseverityscoringsystemtoindividualpatientsresultsinfrequentmisclassification
AT qualyrebeccal applicationofapopulationbasedseverityscoringsystemtoindividualpatientsresultsinfrequentmisclassification
AT levyhoward applicationofapopulationbasedseverityscoringsystemtoindividualpatientsresultsinfrequentmisclassification