Cargando…

Integrating questionnaire measures for transdiagnostic psychiatric phenotyping using word2vec

BACKGROUND: Recent initiatives in psychiatry emphasize the utility of characterizing psychiatric symptoms in a multidimensional manner. However, strategies for applying standard self-report scales for multiaxial assessment have not been well-studied, particularly where the aim is to support both cat...

Descripción completa

Detalles Bibliográficos
Autores principales: Sonabend W., Aaron, Pellegrini, Amelia M., Chan, Stephanie, Brown, Hannah E., Rosenquist, James N., Vuijk, Pieter J., Doyle, Alysa E., Perlis, Roy H., Cai, Tianxi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7122719/
https://www.ncbi.nlm.nih.gov/pubmed/32243452
http://dx.doi.org/10.1371/journal.pone.0230663
_version_ 1783515479643521024
author Sonabend W., Aaron
Pellegrini, Amelia M.
Chan, Stephanie
Brown, Hannah E.
Rosenquist, James N.
Vuijk, Pieter J.
Doyle, Alysa E.
Perlis, Roy H.
Cai, Tianxi
author_facet Sonabend W., Aaron
Pellegrini, Amelia M.
Chan, Stephanie
Brown, Hannah E.
Rosenquist, James N.
Vuijk, Pieter J.
Doyle, Alysa E.
Perlis, Roy H.
Cai, Tianxi
author_sort Sonabend W., Aaron
collection PubMed
description BACKGROUND: Recent initiatives in psychiatry emphasize the utility of characterizing psychiatric symptoms in a multidimensional manner. However, strategies for applying standard self-report scales for multiaxial assessment have not been well-studied, particularly where the aim is to support both categorical and dimensional phenotypes. METHODS: We propose a method for applying natural language processing to derive dimensional measures of psychiatric symptoms from questionnaire data. We utilized nine self-report symptom measures drawn from a large cellular biobanking study that enrolled individuals with mood and psychotic disorders, as well as healthy controls. To summarize questionnaire results we used word embeddings, a technique to represent words as numeric vectors preserving semantic and syntactic meaning. A low-dimensional approximation to the embedding space was used to derive the proposed succinct summary of symptom profiles. To validate our embedding-based disease profiles, these were compared to presence or absence of axis I diagnoses derived from structured clinical interview, and to objective neurocognitive testing. RESULTS: Unsupervised and supervised classification to distinguish presence/absence of axis I disorders using survey-level embeddings remained discriminative, with area under the receiver operating characteristic curve up to 0.85, 95% confidence interval (CI) (0.74,0.91) using Gaussian mixture modeling, and cross-validated area under the receiver operating characteristic curve 0.91, 95% CI (0.88,0.94) using logistic regression. Derived symptom measures and estimated Research Domain Criteria scores also associated significantly with performance on neurocognitive tests. CONCLUSIONS: Our results support the potential utility of deriving dimensional phenotypic measures in psychiatric illness through the use of word embeddings, while illustrating the challenges in identifying truly orthogonal dimensions.
format Online
Article
Text
id pubmed-7122719
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-71227192020-04-09 Integrating questionnaire measures for transdiagnostic psychiatric phenotyping using word2vec Sonabend W., Aaron Pellegrini, Amelia M. Chan, Stephanie Brown, Hannah E. Rosenquist, James N. Vuijk, Pieter J. Doyle, Alysa E. Perlis, Roy H. Cai, Tianxi PLoS One Research Article BACKGROUND: Recent initiatives in psychiatry emphasize the utility of characterizing psychiatric symptoms in a multidimensional manner. However, strategies for applying standard self-report scales for multiaxial assessment have not been well-studied, particularly where the aim is to support both categorical and dimensional phenotypes. METHODS: We propose a method for applying natural language processing to derive dimensional measures of psychiatric symptoms from questionnaire data. We utilized nine self-report symptom measures drawn from a large cellular biobanking study that enrolled individuals with mood and psychotic disorders, as well as healthy controls. To summarize questionnaire results we used word embeddings, a technique to represent words as numeric vectors preserving semantic and syntactic meaning. A low-dimensional approximation to the embedding space was used to derive the proposed succinct summary of symptom profiles. To validate our embedding-based disease profiles, these were compared to presence or absence of axis I diagnoses derived from structured clinical interview, and to objective neurocognitive testing. RESULTS: Unsupervised and supervised classification to distinguish presence/absence of axis I disorders using survey-level embeddings remained discriminative, with area under the receiver operating characteristic curve up to 0.85, 95% confidence interval (CI) (0.74,0.91) using Gaussian mixture modeling, and cross-validated area under the receiver operating characteristic curve 0.91, 95% CI (0.88,0.94) using logistic regression. Derived symptom measures and estimated Research Domain Criteria scores also associated significantly with performance on neurocognitive tests. CONCLUSIONS: Our results support the potential utility of deriving dimensional phenotypic measures in psychiatric illness through the use of word embeddings, while illustrating the challenges in identifying truly orthogonal dimensions. Public Library of Science 2020-04-03 /pmc/articles/PMC7122719/ /pubmed/32243452 http://dx.doi.org/10.1371/journal.pone.0230663 Text en © 2020 Sonabend W. et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sonabend W., Aaron
Pellegrini, Amelia M.
Chan, Stephanie
Brown, Hannah E.
Rosenquist, James N.
Vuijk, Pieter J.
Doyle, Alysa E.
Perlis, Roy H.
Cai, Tianxi
Integrating questionnaire measures for transdiagnostic psychiatric phenotyping using word2vec
title Integrating questionnaire measures for transdiagnostic psychiatric phenotyping using word2vec
title_full Integrating questionnaire measures for transdiagnostic psychiatric phenotyping using word2vec
title_fullStr Integrating questionnaire measures for transdiagnostic psychiatric phenotyping using word2vec
title_full_unstemmed Integrating questionnaire measures for transdiagnostic psychiatric phenotyping using word2vec
title_short Integrating questionnaire measures for transdiagnostic psychiatric phenotyping using word2vec
title_sort integrating questionnaire measures for transdiagnostic psychiatric phenotyping using word2vec
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7122719/
https://www.ncbi.nlm.nih.gov/pubmed/32243452
http://dx.doi.org/10.1371/journal.pone.0230663
work_keys_str_mv AT sonabendwaaron integratingquestionnairemeasuresfortransdiagnosticpsychiatricphenotypingusingword2vec
AT pellegriniameliam integratingquestionnairemeasuresfortransdiagnosticpsychiatricphenotypingusingword2vec
AT chanstephanie integratingquestionnairemeasuresfortransdiagnosticpsychiatricphenotypingusingword2vec
AT brownhannahe integratingquestionnairemeasuresfortransdiagnosticpsychiatricphenotypingusingword2vec
AT rosenquistjamesn integratingquestionnairemeasuresfortransdiagnosticpsychiatricphenotypingusingword2vec
AT vuijkpieterj integratingquestionnairemeasuresfortransdiagnosticpsychiatricphenotypingusingword2vec
AT doylealysae integratingquestionnairemeasuresfortransdiagnosticpsychiatricphenotypingusingword2vec
AT perlisroyh integratingquestionnairemeasuresfortransdiagnosticpsychiatricphenotypingusingword2vec
AT caitianxi integratingquestionnairemeasuresfortransdiagnosticpsychiatricphenotypingusingword2vec