Cargando…
Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study
BACKGROUND: Elevated psychological distress has demonstrated impacts on individuals’ health. Reliable and efficient ways to detect distress are key to early intervention. Artificial intelligence has the potential to detect states of emotional distress in an accurate, efficient, and timely manner. OB...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9811648/ https://www.ncbi.nlm.nih.gov/pubmed/36534456 http://dx.doi.org/10.2196/42249 |
_version_ | 1784863570108874752 |
---|---|
author | Iyer, Ravi Nedeljkovic, Maja Meyer, Denny |
author_facet | Iyer, Ravi Nedeljkovic, Maja Meyer, Denny |
author_sort | Iyer, Ravi |
collection | PubMed |
description | BACKGROUND: Elevated psychological distress has demonstrated impacts on individuals’ health. Reliable and efficient ways to detect distress are key to early intervention. Artificial intelligence has the potential to detect states of emotional distress in an accurate, efficient, and timely manner. OBJECTIVE: The aim of this study was to automatically classify short segments of speech obtained from callers to national suicide prevention helpline services according to high versus low psychological distress and using a range of vocal characteristics in combination with machine learning approaches. METHODS: A total of 120 telephone call recordings were initially converted to 16-bit pulse code modulation format. Short variable-length segments of each call were rated on psychological distress using the distress thermometer by the responding counselor and a second team of psychologists (n=6) blinded to the initial ratings. Following this, 24 vocal characteristics were initially extracted from 40-ms speech frames nested within segments within calls. After highly correlated variables were eliminated, 19 remained. Of 19 vocal characteristics, 7 were identified and validated as predictors of psychological distress using a penalized generalized additive mixed effects regression model, accounting for nonlinearity, autocorrelation, and moderation by sex. Speech frames were then grouped using k-means clustering based on the selected vocal characteristics. Finally, component-wise gradient boosting incorporating these clusters was used to classify each speech frame according to high versus low psychological distress. Classification accuracy was confirmed via leave-one-caller-out cross-validation, ensuring that speech segments from individual callers were not used in both the training and test data. RESULTS: The sample comprised 87 female and 33 male callers. From an initial pool of 19 characteristics, 7 vocal characteristics were identified. After grouping speech frames into 2 separate clusters (correlation with sex of caller, Cramer’s V =0.02), the component-wise gradient boosting algorithm successfully classified psychological distress to a high level of accuracy, with an area under the receiver operating characteristic curve of 97.39% (95% CI 96.20-98.45) and an area under the precision-recall curve of 97.52 (95% CI 95.71-99.12). Thus, 39,282 of 41,883 (93.39%) speech frames nested within 728 of 754 segments (96.6%) were classified as exhibiting low psychological distress, and 71455 of 75503 (94.64%) speech frames nested within 382 of 423 (90.3%) segments were classified as exhibiting high psychological distress. As the probability of high psychological distress increases, male callers spoke louder, with greater vowel articulation but with greater roughness (subharmonic depth). In contrast, female callers exhibited decreased vocal clarity (entropy), greater proportion of signal noise, higher frequencies, increased breathiness (spectral slope), and increased roughness of speech with increasing psychological distress. Individual caller random effects contributed 68% to risk reduction in the classification algorithm, followed by cluster configuration (23.4%), spectral slope (4.4%), and the 50th percentile frequency (4.2%). CONCLUSIONS: The high level of accuracy achieved suggests possibilities for real-time detection of psychological distress in helpline settings and has potential uses in pre-emptive triage and evaluations of counseling outcomes. TRIAL REGISTRATION: ANZCTR ACTRN12622000486729; https://www.anzctr.org.au/ACTRN12622000486729.aspx |
format | Online Article Text |
id | pubmed-9811648 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-98116482023-01-05 Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study Iyer, Ravi Nedeljkovic, Maja Meyer, Denny JMIR Form Res Original Paper BACKGROUND: Elevated psychological distress has demonstrated impacts on individuals’ health. Reliable and efficient ways to detect distress are key to early intervention. Artificial intelligence has the potential to detect states of emotional distress in an accurate, efficient, and timely manner. OBJECTIVE: The aim of this study was to automatically classify short segments of speech obtained from callers to national suicide prevention helpline services according to high versus low psychological distress and using a range of vocal characteristics in combination with machine learning approaches. METHODS: A total of 120 telephone call recordings were initially converted to 16-bit pulse code modulation format. Short variable-length segments of each call were rated on psychological distress using the distress thermometer by the responding counselor and a second team of psychologists (n=6) blinded to the initial ratings. Following this, 24 vocal characteristics were initially extracted from 40-ms speech frames nested within segments within calls. After highly correlated variables were eliminated, 19 remained. Of 19 vocal characteristics, 7 were identified and validated as predictors of psychological distress using a penalized generalized additive mixed effects regression model, accounting for nonlinearity, autocorrelation, and moderation by sex. Speech frames were then grouped using k-means clustering based on the selected vocal characteristics. Finally, component-wise gradient boosting incorporating these clusters was used to classify each speech frame according to high versus low psychological distress. Classification accuracy was confirmed via leave-one-caller-out cross-validation, ensuring that speech segments from individual callers were not used in both the training and test data. RESULTS: The sample comprised 87 female and 33 male callers. From an initial pool of 19 characteristics, 7 vocal characteristics were identified. After grouping speech frames into 2 separate clusters (correlation with sex of caller, Cramer’s V =0.02), the component-wise gradient boosting algorithm successfully classified psychological distress to a high level of accuracy, with an area under the receiver operating characteristic curve of 97.39% (95% CI 96.20-98.45) and an area under the precision-recall curve of 97.52 (95% CI 95.71-99.12). Thus, 39,282 of 41,883 (93.39%) speech frames nested within 728 of 754 segments (96.6%) were classified as exhibiting low psychological distress, and 71455 of 75503 (94.64%) speech frames nested within 382 of 423 (90.3%) segments were classified as exhibiting high psychological distress. As the probability of high psychological distress increases, male callers spoke louder, with greater vowel articulation but with greater roughness (subharmonic depth). In contrast, female callers exhibited decreased vocal clarity (entropy), greater proportion of signal noise, higher frequencies, increased breathiness (spectral slope), and increased roughness of speech with increasing psychological distress. Individual caller random effects contributed 68% to risk reduction in the classification algorithm, followed by cluster configuration (23.4%), spectral slope (4.4%), and the 50th percentile frequency (4.2%). CONCLUSIONS: The high level of accuracy achieved suggests possibilities for real-time detection of psychological distress in helpline settings and has potential uses in pre-emptive triage and evaluations of counseling outcomes. TRIAL REGISTRATION: ANZCTR ACTRN12622000486729; https://www.anzctr.org.au/ACTRN12622000486729.aspx JMIR Publications 2022-12-19 /pmc/articles/PMC9811648/ /pubmed/36534456 http://dx.doi.org/10.2196/42249 Text en ©Ravi Iyer, Maja Nedeljkovic, Denny Meyer. Originally published in JMIR Formative Research (https://formative.jmir.org), 19.12.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Iyer, Ravi Nedeljkovic, Maja Meyer, Denny Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study |
title | Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study |
title_full | Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study |
title_fullStr | Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study |
title_full_unstemmed | Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study |
title_short | Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study |
title_sort | using vocal characteristics to classify psychological distress in adult helpline callers: retrospective observational study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9811648/ https://www.ncbi.nlm.nih.gov/pubmed/36534456 http://dx.doi.org/10.2196/42249 |
work_keys_str_mv | AT iyerravi usingvocalcharacteristicstoclassifypsychologicaldistressinadulthelplinecallersretrospectiveobservationalstudy AT nedeljkovicmaja usingvocalcharacteristicstoclassifypsychologicaldistressinadulthelplinecallersretrospectiveobservationalstudy AT meyerdenny usingvocalcharacteristicstoclassifypsychologicaldistressinadulthelplinecallersretrospectiveobservationalstudy |