Cargando…

Patient Health Questionnaire-9 Item Pairing Predictiveness for Prescreening Depressive Symptomatology: Machine Learning Analysis

BACKGROUND: Anhedonia and depressed mood are considered the cardinal symptoms of major depressive disorder. These are the first 2 items of the Patient Health Questionnaire (PHQ)–9 and comprise the ultrabrief PHQ-2 used for prescreening depressive symptomatology. The prescreening performance of alter...

Descripción completa

Detalles Bibliográficos
Autores principales: Glavin, Darragh, Grua, Eoin Martino, Nakamura, Carina Akemi, Scazufca, Marcia, Ribeiro dos Santos, Edinilza, Wong, Gloria H Y, Hollingworth, William, Peters, Tim J, Araya, Ricardo, Van de Ven, Pepijn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10623235/
https://www.ncbi.nlm.nih.gov/pubmed/37856186
http://dx.doi.org/10.2196/48444
_version_ 1785130703965388800
author Glavin, Darragh
Grua, Eoin Martino
Nakamura, Carina Akemi
Scazufca, Marcia
Ribeiro dos Santos, Edinilza
Wong, Gloria H Y
Hollingworth, William
Peters, Tim J
Araya, Ricardo
Van de Ven, Pepijn
author_facet Glavin, Darragh
Grua, Eoin Martino
Nakamura, Carina Akemi
Scazufca, Marcia
Ribeiro dos Santos, Edinilza
Wong, Gloria H Y
Hollingworth, William
Peters, Tim J
Araya, Ricardo
Van de Ven, Pepijn
author_sort Glavin, Darragh
collection PubMed
description BACKGROUND: Anhedonia and depressed mood are considered the cardinal symptoms of major depressive disorder. These are the first 2 items of the Patient Health Questionnaire (PHQ)–9 and comprise the ultrabrief PHQ-2 used for prescreening depressive symptomatology. The prescreening performance of alternative PHQ-9 item pairings is rarely compared with that of the PHQ-2. OBJECTIVE: This study aims to use machine learning (ML) with the PHQ-9 items to identify and validate the most predictive 2-item depressive symptomatology ultrabrief questionnaire and to test the generalizability of the best pairings found on the primary data set, with 6 external data sets from different populations to validate their use as prescreening instruments. METHODS: All 36 possible PHQ-9 item pairings (each yielding scores of 0-6) were investigated using ML-based methods with logistic regression models. Their performances were evaluated based on the classification of depressive symptomatology, defined as PHQ-9 scores ≥10. This gave each pairing an equal opportunity and avoided any bias in item pairing selection. RESULTS: The ML-based PHQ-9 items 2 and 4 (phq2&4), the depressed mood and low-energy item pairing, and PHQ-9 items 2 and 8 (phq2&8), the depressed mood and psychomotor retardation or agitation item pairing, were found to be the best on the primary data set training split. They generalized well on the primary data set test split with area under the curves (AUCs) of 0.954 and 0.946, respectively, compared with an AUC of 0.942 for the PHQ-2. The phq2&4 had a higher AUC than the PHQ-2 on all 6 external data sets, and the phq2&8 had a higher AUC than the PHQ-2 on 3 data sets. The phq2&4 had the highest Youden index (an unweighted average of sensitivity and specificity) on 2 external data sets, and the phq2&8 had the highest Youden index on another 2. The PHQ-2≥2 cutoff also had the highest Youden index on 2 external data sets, joint highest with the phq2&4 on 1, but its performance fluctuated the most. The PHQ-2≥3 cutoff had the highest Youden index on 1 external data set. The sensitivity and specificity achieved by the phq2&4 and phq2&8 were more evenly balanced than the PHQ-2≥2 and ≥3 cutoffs. CONCLUSIONS: The PHQ-2 did not prove to be a more effective prescreening instrument when compared with other PHQ-9 item pairings. Evaluating all item pairings showed that, compared with alternative partner items, the anhedonia item underperformed alongside the depressed mood item. This suggests that the inclusion of anhedonia as a core symptom of depression and its presence in ultrabrief questionnaires may be incompatible with the empirical evidence. The use of the PHQ-2 to prescreen for depressive symptomatology could result in a greater number of misclassifications than alternative item pairings.
format Online
Article
Text
id pubmed-10623235
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-106232352023-11-04 Patient Health Questionnaire-9 Item Pairing Predictiveness for Prescreening Depressive Symptomatology: Machine Learning Analysis Glavin, Darragh Grua, Eoin Martino Nakamura, Carina Akemi Scazufca, Marcia Ribeiro dos Santos, Edinilza Wong, Gloria H Y Hollingworth, William Peters, Tim J Araya, Ricardo Van de Ven, Pepijn JMIR Ment Health Original Paper BACKGROUND: Anhedonia and depressed mood are considered the cardinal symptoms of major depressive disorder. These are the first 2 items of the Patient Health Questionnaire (PHQ)–9 and comprise the ultrabrief PHQ-2 used for prescreening depressive symptomatology. The prescreening performance of alternative PHQ-9 item pairings is rarely compared with that of the PHQ-2. OBJECTIVE: This study aims to use machine learning (ML) with the PHQ-9 items to identify and validate the most predictive 2-item depressive symptomatology ultrabrief questionnaire and to test the generalizability of the best pairings found on the primary data set, with 6 external data sets from different populations to validate their use as prescreening instruments. METHODS: All 36 possible PHQ-9 item pairings (each yielding scores of 0-6) were investigated using ML-based methods with logistic regression models. Their performances were evaluated based on the classification of depressive symptomatology, defined as PHQ-9 scores ≥10. This gave each pairing an equal opportunity and avoided any bias in item pairing selection. RESULTS: The ML-based PHQ-9 items 2 and 4 (phq2&4), the depressed mood and low-energy item pairing, and PHQ-9 items 2 and 8 (phq2&8), the depressed mood and psychomotor retardation or agitation item pairing, were found to be the best on the primary data set training split. They generalized well on the primary data set test split with area under the curves (AUCs) of 0.954 and 0.946, respectively, compared with an AUC of 0.942 for the PHQ-2. The phq2&4 had a higher AUC than the PHQ-2 on all 6 external data sets, and the phq2&8 had a higher AUC than the PHQ-2 on 3 data sets. The phq2&4 had the highest Youden index (an unweighted average of sensitivity and specificity) on 2 external data sets, and the phq2&8 had the highest Youden index on another 2. The PHQ-2≥2 cutoff also had the highest Youden index on 2 external data sets, joint highest with the phq2&4 on 1, but its performance fluctuated the most. The PHQ-2≥3 cutoff had the highest Youden index on 1 external data set. The sensitivity and specificity achieved by the phq2&4 and phq2&8 were more evenly balanced than the PHQ-2≥2 and ≥3 cutoffs. CONCLUSIONS: The PHQ-2 did not prove to be a more effective prescreening instrument when compared with other PHQ-9 item pairings. Evaluating all item pairings showed that, compared with alternative partner items, the anhedonia item underperformed alongside the depressed mood item. This suggests that the inclusion of anhedonia as a core symptom of depression and its presence in ultrabrief questionnaires may be incompatible with the empirical evidence. The use of the PHQ-2 to prescreen for depressive symptomatology could result in a greater number of misclassifications than alternative item pairings. JMIR Publications 2023-10-19 /pmc/articles/PMC10623235/ /pubmed/37856186 http://dx.doi.org/10.2196/48444 Text en ©Darragh Glavin, Eoin Martino Grua, Carina Akemi Nakamura, Marcia Scazufca, Edinilza Ribeiro dos Santos, Gloria H Y Wong, William Hollingworth, Tim J Peters, Ricardo Araya, Pepijn Van de Ven. Originally published in JMIR Mental Health (https://mental.jmir.org), 19.10.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Mental Health, is properly cited. The complete bibliographic information, a link to the original publication on https://mental.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Glavin, Darragh
Grua, Eoin Martino
Nakamura, Carina Akemi
Scazufca, Marcia
Ribeiro dos Santos, Edinilza
Wong, Gloria H Y
Hollingworth, William
Peters, Tim J
Araya, Ricardo
Van de Ven, Pepijn
Patient Health Questionnaire-9 Item Pairing Predictiveness for Prescreening Depressive Symptomatology: Machine Learning Analysis
title Patient Health Questionnaire-9 Item Pairing Predictiveness for Prescreening Depressive Symptomatology: Machine Learning Analysis
title_full Patient Health Questionnaire-9 Item Pairing Predictiveness for Prescreening Depressive Symptomatology: Machine Learning Analysis
title_fullStr Patient Health Questionnaire-9 Item Pairing Predictiveness for Prescreening Depressive Symptomatology: Machine Learning Analysis
title_full_unstemmed Patient Health Questionnaire-9 Item Pairing Predictiveness for Prescreening Depressive Symptomatology: Machine Learning Analysis
title_short Patient Health Questionnaire-9 Item Pairing Predictiveness for Prescreening Depressive Symptomatology: Machine Learning Analysis
title_sort patient health questionnaire-9 item pairing predictiveness for prescreening depressive symptomatology: machine learning analysis
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10623235/
https://www.ncbi.nlm.nih.gov/pubmed/37856186
http://dx.doi.org/10.2196/48444
work_keys_str_mv AT glavindarragh patienthealthquestionnaire9itempairingpredictivenessforprescreeningdepressivesymptomatologymachinelearninganalysis
AT gruaeoinmartino patienthealthquestionnaire9itempairingpredictivenessforprescreeningdepressivesymptomatologymachinelearninganalysis
AT nakamuracarinaakemi patienthealthquestionnaire9itempairingpredictivenessforprescreeningdepressivesymptomatologymachinelearninganalysis
AT scazufcamarcia patienthealthquestionnaire9itempairingpredictivenessforprescreeningdepressivesymptomatologymachinelearninganalysis
AT ribeirodossantosedinilza patienthealthquestionnaire9itempairingpredictivenessforprescreeningdepressivesymptomatologymachinelearninganalysis
AT wonggloriahy patienthealthquestionnaire9itempairingpredictivenessforprescreeningdepressivesymptomatologymachinelearninganalysis
AT hollingworthwilliam patienthealthquestionnaire9itempairingpredictivenessforprescreeningdepressivesymptomatologymachinelearninganalysis
AT peterstimj patienthealthquestionnaire9itempairingpredictivenessforprescreeningdepressivesymptomatologymachinelearninganalysis
AT arayaricardo patienthealthquestionnaire9itempairingpredictivenessforprescreeningdepressivesymptomatologymachinelearninganalysis
AT vandevenpepijn patienthealthquestionnaire9itempairingpredictivenessforprescreeningdepressivesymptomatologymachinelearninganalysis