Cargando…

Natural Language Processing markers in first episode psychosis and people at clinical high-risk

Recent work has suggested that disorganised speech might be a powerful predictor of later psychotic illness in clinical high risk subjects. To that end, several automated measures to quantify disorganisation of transcribed speech have been proposed. However, it remains unclear which measures are mos...

Descripción completa

Detalles Bibliográficos
Autores principales: Morgan, Sarah E., Diederen, Kelly, Vértes, Petra E., Ip, Samantha H. Y., Wang, Bo, Thompson, Bethany, Demjaha, Arsime, De Micheli, Andrea, Oliver, Dominic, Liakata, Maria, Fusar-Poli, Paolo, Spencer, Tom J., McGuire, Philip
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8669009/
https://www.ncbi.nlm.nih.gov/pubmed/34903724
http://dx.doi.org/10.1038/s41398-021-01722-y
_version_ 1784614702306820096
author Morgan, Sarah E.
Diederen, Kelly
Vértes, Petra E.
Ip, Samantha H. Y.
Wang, Bo
Thompson, Bethany
Demjaha, Arsime
De Micheli, Andrea
Oliver, Dominic
Liakata, Maria
Fusar-Poli, Paolo
Spencer, Tom J.
McGuire, Philip
author_facet Morgan, Sarah E.
Diederen, Kelly
Vértes, Petra E.
Ip, Samantha H. Y.
Wang, Bo
Thompson, Bethany
Demjaha, Arsime
De Micheli, Andrea
Oliver, Dominic
Liakata, Maria
Fusar-Poli, Paolo
Spencer, Tom J.
McGuire, Philip
author_sort Morgan, Sarah E.
collection PubMed
description Recent work has suggested that disorganised speech might be a powerful predictor of later psychotic illness in clinical high risk subjects. To that end, several automated measures to quantify disorganisation of transcribed speech have been proposed. However, it remains unclear which measures are most strongly associated with psychosis, how different measures are related to each other and what the best strategies are to collect speech data from participants. Here, we assessed whether twelve automated Natural Language Processing markers could differentiate transcribed speech excerpts from subjects at clinical high risk for psychosis, first episode psychosis patients and healthy control subjects (total N = 54). In-line with previous work, several measures showed significant differences between groups, including semantic coherence, speech graph connectivity and a measure of whether speech was on-topic, the latter of which outperformed the related measure of tangentiality. Most NLP measures examined were only weakly related to each other, suggesting they provide complementary information. Finally, we compared the ability of transcribed speech generated using different tasks to differentiate the groups. Speech generated from picture descriptions of the Thematic Apperception Test and a story re-telling task outperformed free speech, suggesting that choice of speech generation method may be an important consideration. Overall, quantitative speech markers represent a promising direction for future clinical applications.
format Online
Article
Text
id pubmed-8669009
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-86690092021-12-28 Natural Language Processing markers in first episode psychosis and people at clinical high-risk Morgan, Sarah E. Diederen, Kelly Vértes, Petra E. Ip, Samantha H. Y. Wang, Bo Thompson, Bethany Demjaha, Arsime De Micheli, Andrea Oliver, Dominic Liakata, Maria Fusar-Poli, Paolo Spencer, Tom J. McGuire, Philip Transl Psychiatry Article Recent work has suggested that disorganised speech might be a powerful predictor of later psychotic illness in clinical high risk subjects. To that end, several automated measures to quantify disorganisation of transcribed speech have been proposed. However, it remains unclear which measures are most strongly associated with psychosis, how different measures are related to each other and what the best strategies are to collect speech data from participants. Here, we assessed whether twelve automated Natural Language Processing markers could differentiate transcribed speech excerpts from subjects at clinical high risk for psychosis, first episode psychosis patients and healthy control subjects (total N = 54). In-line with previous work, several measures showed significant differences between groups, including semantic coherence, speech graph connectivity and a measure of whether speech was on-topic, the latter of which outperformed the related measure of tangentiality. Most NLP measures examined were only weakly related to each other, suggesting they provide complementary information. Finally, we compared the ability of transcribed speech generated using different tasks to differentiate the groups. Speech generated from picture descriptions of the Thematic Apperception Test and a story re-telling task outperformed free speech, suggesting that choice of speech generation method may be an important consideration. Overall, quantitative speech markers represent a promising direction for future clinical applications. Nature Publishing Group UK 2021-12-13 /pmc/articles/PMC8669009/ /pubmed/34903724 http://dx.doi.org/10.1038/s41398-021-01722-y Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Morgan, Sarah E.
Diederen, Kelly
Vértes, Petra E.
Ip, Samantha H. Y.
Wang, Bo
Thompson, Bethany
Demjaha, Arsime
De Micheli, Andrea
Oliver, Dominic
Liakata, Maria
Fusar-Poli, Paolo
Spencer, Tom J.
McGuire, Philip
Natural Language Processing markers in first episode psychosis and people at clinical high-risk
title Natural Language Processing markers in first episode psychosis and people at clinical high-risk
title_full Natural Language Processing markers in first episode psychosis and people at clinical high-risk
title_fullStr Natural Language Processing markers in first episode psychosis and people at clinical high-risk
title_full_unstemmed Natural Language Processing markers in first episode psychosis and people at clinical high-risk
title_short Natural Language Processing markers in first episode psychosis and people at clinical high-risk
title_sort natural language processing markers in first episode psychosis and people at clinical high-risk
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8669009/
https://www.ncbi.nlm.nih.gov/pubmed/34903724
http://dx.doi.org/10.1038/s41398-021-01722-y
work_keys_str_mv AT morgansarahe naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT diederenkelly naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT vertespetrae naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT ipsamanthahy naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT wangbo naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT thompsonbethany naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT demjahaarsime naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT demicheliandrea naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT oliverdominic naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT liakatamaria naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT fusarpolipaolo naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT spencertomj naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk
AT mcguirephilip naturallanguageprocessingmarkersinfirstepisodepsychosisandpeopleatclinicalhighrisk