Cargando…

An item response theory evaluation of three depression assessment instruments in a clinical sample

BACKGROUND: This study investigates whether an analysis, based on Item Response Theory (IRT), can be used for initial evaluations of depression assessment instruments in a limited patient sample from an affective disorder outpatient clinic, with the aim to finding major advantages and deficiencies o...

Descripción completa

Detalles Bibliográficos
Autores principales: Adler, Mats, Hetta, Jerker, Isacsson, Göran, Brodin, Ulf
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3599629/
https://www.ncbi.nlm.nih.gov/pubmed/22721257
http://dx.doi.org/10.1186/1471-2288-12-84
_version_ 1782263006890033152
author Adler, Mats
Hetta, Jerker
Isacsson, Göran
Brodin, Ulf
author_facet Adler, Mats
Hetta, Jerker
Isacsson, Göran
Brodin, Ulf
author_sort Adler, Mats
collection PubMed
description BACKGROUND: This study investigates whether an analysis, based on Item Response Theory (IRT), can be used for initial evaluations of depression assessment instruments in a limited patient sample from an affective disorder outpatient clinic, with the aim to finding major advantages and deficiencies of the instruments. METHODS: Three depression assessment instruments, the depression module from the Patient Health Questionnaire (PHQ9), the depression subscale of Affective Self Rating Scale (AS-18-D) and the Montgomery-Åsberg Depression Rating Scale (MADRS) were evaluated in a sample of 61 patients with affective disorder diagnoses, mainly bipolar disorder. A ‘3- step IRT strategy’ was used. RESULTS: In a first step, the Mokken non-parametric analysis showed that PHQ9 and AS-18-D had strong overall scalabilities of 0.510 [C.I. 0.42, 0.61] and 0,513 [C.I. 0.41, 0.63] respectively, while MADRS had a weak scalability of 0.339 [C.I. 0.25, 0.43]. In a second step, a Rasch model analysis indicated large differences concerning the item discriminating capacity and was therefore considered not suitable for the data. In third step, applying a more flexible two parameter model, all three instruments showed large differences in item information and items had a low capacity to reliably measure respondents at low levels of depression severity. CONCLUSIONS: We conclude that a stepwise IRT-approach, as performed in this study, is a suitable tool for studying assessment instruments at early stages of development. Such an analysis can give useful information, even in small samples, in order to construct more precise measurements or to evaluate existing assessment instruments. The study suggests that the PHQ9 and AS-18-D can be useful for measurement of depression severity in an outpatient clinic for affective disorder, while the MADRS shows weak measurement properties for this type of patients.
format Online
Article
Text
id pubmed-3599629
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35996292013-03-17 An item response theory evaluation of three depression assessment instruments in a clinical sample Adler, Mats Hetta, Jerker Isacsson, Göran Brodin, Ulf BMC Med Res Methodol Research Article BACKGROUND: This study investigates whether an analysis, based on Item Response Theory (IRT), can be used for initial evaluations of depression assessment instruments in a limited patient sample from an affective disorder outpatient clinic, with the aim to finding major advantages and deficiencies of the instruments. METHODS: Three depression assessment instruments, the depression module from the Patient Health Questionnaire (PHQ9), the depression subscale of Affective Self Rating Scale (AS-18-D) and the Montgomery-Åsberg Depression Rating Scale (MADRS) were evaluated in a sample of 61 patients with affective disorder diagnoses, mainly bipolar disorder. A ‘3- step IRT strategy’ was used. RESULTS: In a first step, the Mokken non-parametric analysis showed that PHQ9 and AS-18-D had strong overall scalabilities of 0.510 [C.I. 0.42, 0.61] and 0,513 [C.I. 0.41, 0.63] respectively, while MADRS had a weak scalability of 0.339 [C.I. 0.25, 0.43]. In a second step, a Rasch model analysis indicated large differences concerning the item discriminating capacity and was therefore considered not suitable for the data. In third step, applying a more flexible two parameter model, all three instruments showed large differences in item information and items had a low capacity to reliably measure respondents at low levels of depression severity. CONCLUSIONS: We conclude that a stepwise IRT-approach, as performed in this study, is a suitable tool for studying assessment instruments at early stages of development. Such an analysis can give useful information, even in small samples, in order to construct more precise measurements or to evaluate existing assessment instruments. The study suggests that the PHQ9 and AS-18-D can be useful for measurement of depression severity in an outpatient clinic for affective disorder, while the MADRS shows weak measurement properties for this type of patients. BioMed Central 2012-06-21 /pmc/articles/PMC3599629/ /pubmed/22721257 http://dx.doi.org/10.1186/1471-2288-12-84 Text en Copyright ©2012 Adler et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Adler, Mats
Hetta, Jerker
Isacsson, Göran
Brodin, Ulf
An item response theory evaluation of three depression assessment instruments in a clinical sample
title An item response theory evaluation of three depression assessment instruments in a clinical sample
title_full An item response theory evaluation of three depression assessment instruments in a clinical sample
title_fullStr An item response theory evaluation of three depression assessment instruments in a clinical sample
title_full_unstemmed An item response theory evaluation of three depression assessment instruments in a clinical sample
title_short An item response theory evaluation of three depression assessment instruments in a clinical sample
title_sort item response theory evaluation of three depression assessment instruments in a clinical sample
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3599629/
https://www.ncbi.nlm.nih.gov/pubmed/22721257
http://dx.doi.org/10.1186/1471-2288-12-84
work_keys_str_mv AT adlermats anitemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample
AT hettajerker anitemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample
AT isacssongoran anitemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample
AT brodinulf anitemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample
AT adlermats itemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample
AT hettajerker itemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample
AT isacssongoran itemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample
AT brodinulf itemresponsetheoryevaluationofthreedepressionassessmentinstrumentsinaclinicalsample