Cargando…
An item response theory evaluation of three depression assessment instruments in a clinical sample
BACKGROUND: This study investigates whether an analysis, based on Item Response Theory (IRT), can be used for initial evaluations of depression assessment instruments in a limited patient sample from an affective disorder outpatient clinic, with the aim to finding major advantages and deficiencies o...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3599629/ https://www.ncbi.nlm.nih.gov/pubmed/22721257 http://dx.doi.org/10.1186/1471-2288-12-84 |
_version_ | 1782263006890033152 |
---|---|
author | Adler, Mats Hetta, Jerker Isacsson, Göran Brodin, Ulf |
author_facet | Adler, Mats Hetta, Jerker Isacsson, Göran Brodin, Ulf |
author_sort | Adler, Mats |
collection | PubMed |
description | BACKGROUND: This study investigates whether an analysis, based on Item Response Theory (IRT), can be used for initial evaluations of depression assessment instruments in a limited patient sample from an affective disorder outpatient clinic, with the aim to finding major advantages and deficiencies of the instruments. METHODS: Three depression assessment instruments, the depression module from the Patient Health Questionnaire (PHQ9), the depression subscale of Affective Self Rating Scale (AS-18-D) and the Montgomery-Åsberg Depression Rating Scale (MADRS) were evaluated in a sample of 61 patients with affective disorder diagnoses, mainly bipolar disorder. A ‘3- step IRT strategy’ was used. RESULTS: In a first step, the Mokken non-parametric analysis showed that PHQ9 and AS-18-D had strong overall scalabilities of 0.510 [C.I. 0.42, 0.61] and 0,513 [C.I. 0.41, 0.63] respectively, while MADRS had a weak scalability of 0.339 [C.I. 0.25, 0.43]. In a second step, a Rasch model analysis indicated large differences concerning the item discriminating capacity and was therefore considered not suitable for the data. In third step, applying a more flexible two parameter model, all three instruments showed large differences in item information and items had a low capacity to reliably measure respondents at low levels of depression severity. CONCLUSIONS: We conclude that a stepwise IRT-approach, as performed in this study, is a suitable tool for studying assessment instruments at early stages of development. Such an analysis can give useful information, even in small samples, in order to construct more precise measurements or to evaluate existing assessment instruments. The study suggests that the PHQ9 and AS-18-D can be useful for measurement of depression severity in an outpatient clinic for affective disorder, while the MADRS shows weak measurement properties for this type of patients. |
format | Online Article Text |
id | pubmed-3599629 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35996292013-03-17 An item response theory evaluation of three depression assessment instruments in a clinical sample Adler, Mats Hetta, Jerker Isacsson, Göran Brodin, Ulf BMC Med Res Methodol Research Article BACKGROUND: This study investigates whether an analysis, based on Item Response Theory (IRT), can be used for initial evaluations of depression assessment instruments in a limited patient sample from an affective disorder outpatient clinic, with the aim to finding major advantages and deficiencies of the instruments. METHODS: Three depression assessment instruments, the depression module from the Patient Health Questionnaire (PHQ9), the depression subscale of Affective Self Rating Scale (AS-18-D) and the Montgomery-Åsberg Depression Rating Scale (MADRS) were evaluated in a sample of 61 patients with affective disorder diagnoses, mainly bipolar disorder. A ‘3- step IRT strategy’ was used. RESULTS: In a first step, the Mokken non-parametric analysis showed that PHQ9 and AS-18-D had strong overall scalabilities of 0.510 [C.I. 0.42, 0.61] and 0,513 [C.I. 0.41, 0.63] respectively, while MADRS had a weak scalability of 0.339 [C.I. 0.25, 0.43]. In a second step, a Rasch model analysis indicated large differences concerning the item discriminating capacity and was therefore considered not suitable for the data. In third step, applying a more flexible two parameter model, all three instruments showed large differences in item information and items had a low capacity to reliably measure respondents at low levels of depression severity. CONCLUSIONS: We conclude that a stepwise IRT-approach, as performed in this study, is a suitable tool for studying assessment instruments at early stages of development. Such an analysis can give useful information, even in small samples, in order to construct more precise measurements or to evaluate existing assessment instruments. The study suggests that the PHQ9 and AS-18-D can be useful for measurement of depression severity in an outpatient clinic for affective disorder, while the MADRS shows weak measurement properties for this type of patients. BioMed Central 2012-06-21 /pmc/articles/PMC3599629/ /pubmed/22721257 http://dx.doi.org/10.1186/1471-2288-12-84 Text en Copyright ©2012 Adler et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Adler, Mats Hetta, Jerker Isacsson, Göran Brodin, Ulf An item response theory evaluation of three depression assessment instruments in a clinical sample |
title | An item response theory evaluation of three depression assessment instruments in a clinical sample |
title_full | An item response theory evaluation of three depression assessment instruments in a clinical sample |
title_fullStr | An item response theory evaluation of three depression assessment instruments in a clinical sample |
title_full_unstemmed | An item response theory evaluation of three depression assessment instruments in a clinical sample |
title_short | An item response theory evaluation of three depression assessment instruments in a clinical sample |
title_sort | item response theory evaluation of three depression assessment instruments in a clinical sample |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3599629/ https://www.ncbi.nlm.nih.gov/pubmed/22721257 http://dx.doi.org/10.1186/1471-2288-12-84 |
work_keys_str_mv | AT adlermats anitemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample AT hettajerker anitemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample AT isacssongoran anitemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample AT brodinulf anitemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample AT adlermats itemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample AT hettajerker itemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample AT isacssongoran itemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample AT brodinulf itemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample |