Cargando…

Semantic Linkages of Obsessions From an International Obsessive-Compulsive Disorder Mobile App Data Set: Big Data Analytics Study

BACKGROUND: Obsessive-compulsive disorder (OCD) is characterized by recurrent intrusive thoughts, urges, or images (obsessions) and repetitive physical or mental behaviors (compulsions). Previous factor analytic and clustering studies suggest the presence of three or four subtypes of OCD symptoms. H...

Descripción completa

Detalles Bibliográficos
Autores principales: Feusner, Jamie D, Mohideen, Reza, Smith, Stephen, Patanam, Ilyas, Vaitla, Anil, Lam, Christopher, Massi, Michelle, Leow, Alex
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8277342/
https://www.ncbi.nlm.nih.gov/pubmed/33892466
http://dx.doi.org/10.2196/25482
_version_ 1783722053092769792
author Feusner, Jamie D
Mohideen, Reza
Smith, Stephen
Patanam, Ilyas
Vaitla, Anil
Lam, Christopher
Massi, Michelle
Leow, Alex
author_facet Feusner, Jamie D
Mohideen, Reza
Smith, Stephen
Patanam, Ilyas
Vaitla, Anil
Lam, Christopher
Massi, Michelle
Leow, Alex
author_sort Feusner, Jamie D
collection PubMed
description BACKGROUND: Obsessive-compulsive disorder (OCD) is characterized by recurrent intrusive thoughts, urges, or images (obsessions) and repetitive physical or mental behaviors (compulsions). Previous factor analytic and clustering studies suggest the presence of three or four subtypes of OCD symptoms. However, these studies have relied on predefined symptom checklists, which are limited in breadth and may be biased toward researchers’ previous conceptualizations of OCD. OBJECTIVE: In this study, we examine a large data set of freely reported obsession symptoms obtained from an OCD mobile app as an alternative to uncovering potential OCD subtypes. From this, we examine data-driven clusters of obsessions based on their latent semantic relationships in the English language using word embeddings. METHODS: We extracted free-text entry words describing obsessions in a large sample of users of a mobile app, NOCD. Semantic vector space modeling was applied using the Global Vectors for Word Representation algorithm. A domain-specific extension, Mittens, was also applied to enhance the corpus with OCD-specific words. The resulting representations provided linear substructures of the word vector in a 100-dimensional space. We applied principal component analysis to the 100-dimensional vector representation of the most frequent words, followed by k-means clustering to obtain clusters of related words. RESULTS: We obtained 7001 unique words representing obsessions from 25,369 individuals. Heuristics for determining the optimal number of clusters pointed to a three-cluster solution for grouping subtypes of OCD. The first had themes relating to relationship and just-right; the second had themes relating to doubt and checking; and the third had themes relating to contamination, somatic, physical harm, and sexual harm. All three clusters showed close semantic relationships with each other in the central area of convergence, with themes relating to harm. An equal-sized split-sample analysis across individuals and a split-sample analysis over time both showed overall stable cluster solutions. Words in the third cluster were the most frequently occurring words, followed by words in the first cluster. CONCLUSIONS: The clustering of naturally acquired obsessional words resulted in three major groupings of semantic themes, which partially overlapped with predefined checklists from previous studies. Furthermore, the closeness of the overall embedded relationships across clusters and their central convergence on harm suggests that, at least at the level of self-reported obsessional thoughts, most obsessions have close semantic relationships. Harm to self or others may be an underlying organizing theme across many obsessions. Notably, relationship-themed words, not previously included in factor-analytic studies, clustered with just-right words. These novel insights have potential implications for understanding how an apparent multitude of obsessional symptoms are connected by underlying themes. This observation could aid exposure-based treatment approaches and could be used as a conceptual framework for future research.
format Online
Article
Text
id pubmed-8277342
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-82773422021-07-26 Semantic Linkages of Obsessions From an International Obsessive-Compulsive Disorder Mobile App Data Set: Big Data Analytics Study Feusner, Jamie D Mohideen, Reza Smith, Stephen Patanam, Ilyas Vaitla, Anil Lam, Christopher Massi, Michelle Leow, Alex J Med Internet Res Original Paper BACKGROUND: Obsessive-compulsive disorder (OCD) is characterized by recurrent intrusive thoughts, urges, or images (obsessions) and repetitive physical or mental behaviors (compulsions). Previous factor analytic and clustering studies suggest the presence of three or four subtypes of OCD symptoms. However, these studies have relied on predefined symptom checklists, which are limited in breadth and may be biased toward researchers’ previous conceptualizations of OCD. OBJECTIVE: In this study, we examine a large data set of freely reported obsession symptoms obtained from an OCD mobile app as an alternative to uncovering potential OCD subtypes. From this, we examine data-driven clusters of obsessions based on their latent semantic relationships in the English language using word embeddings. METHODS: We extracted free-text entry words describing obsessions in a large sample of users of a mobile app, NOCD. Semantic vector space modeling was applied using the Global Vectors for Word Representation algorithm. A domain-specific extension, Mittens, was also applied to enhance the corpus with OCD-specific words. The resulting representations provided linear substructures of the word vector in a 100-dimensional space. We applied principal component analysis to the 100-dimensional vector representation of the most frequent words, followed by k-means clustering to obtain clusters of related words. RESULTS: We obtained 7001 unique words representing obsessions from 25,369 individuals. Heuristics for determining the optimal number of clusters pointed to a three-cluster solution for grouping subtypes of OCD. The first had themes relating to relationship and just-right; the second had themes relating to doubt and checking; and the third had themes relating to contamination, somatic, physical harm, and sexual harm. All three clusters showed close semantic relationships with each other in the central area of convergence, with themes relating to harm. An equal-sized split-sample analysis across individuals and a split-sample analysis over time both showed overall stable cluster solutions. Words in the third cluster were the most frequently occurring words, followed by words in the first cluster. CONCLUSIONS: The clustering of naturally acquired obsessional words resulted in three major groupings of semantic themes, which partially overlapped with predefined checklists from previous studies. Furthermore, the closeness of the overall embedded relationships across clusters and their central convergence on harm suggests that, at least at the level of self-reported obsessional thoughts, most obsessions have close semantic relationships. Harm to self or others may be an underlying organizing theme across many obsessions. Notably, relationship-themed words, not previously included in factor-analytic studies, clustered with just-right words. These novel insights have potential implications for understanding how an apparent multitude of obsessional symptoms are connected by underlying themes. This observation could aid exposure-based treatment approaches and could be used as a conceptual framework for future research. JMIR Publications 2021-06-21 /pmc/articles/PMC8277342/ /pubmed/33892466 http://dx.doi.org/10.2196/25482 Text en ©Jamie D Feusner, Reza Mohideen, Stephen Smith, Ilyas Patanam, Anil Vaitla, Christopher Lam, Michelle Massi, Alex Leow. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 21.06.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Feusner, Jamie D
Mohideen, Reza
Smith, Stephen
Patanam, Ilyas
Vaitla, Anil
Lam, Christopher
Massi, Michelle
Leow, Alex
Semantic Linkages of Obsessions From an International Obsessive-Compulsive Disorder Mobile App Data Set: Big Data Analytics Study
title Semantic Linkages of Obsessions From an International Obsessive-Compulsive Disorder Mobile App Data Set: Big Data Analytics Study
title_full Semantic Linkages of Obsessions From an International Obsessive-Compulsive Disorder Mobile App Data Set: Big Data Analytics Study
title_fullStr Semantic Linkages of Obsessions From an International Obsessive-Compulsive Disorder Mobile App Data Set: Big Data Analytics Study
title_full_unstemmed Semantic Linkages of Obsessions From an International Obsessive-Compulsive Disorder Mobile App Data Set: Big Data Analytics Study
title_short Semantic Linkages of Obsessions From an International Obsessive-Compulsive Disorder Mobile App Data Set: Big Data Analytics Study
title_sort semantic linkages of obsessions from an international obsessive-compulsive disorder mobile app data set: big data analytics study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8277342/
https://www.ncbi.nlm.nih.gov/pubmed/33892466
http://dx.doi.org/10.2196/25482
work_keys_str_mv AT feusnerjamied semanticlinkagesofobsessionsfromaninternationalobsessivecompulsivedisordermobileappdatasetbigdataanalyticsstudy
AT mohideenreza semanticlinkagesofobsessionsfromaninternationalobsessivecompulsivedisordermobileappdatasetbigdataanalyticsstudy
AT smithstephen semanticlinkagesofobsessionsfromaninternationalobsessivecompulsivedisordermobileappdatasetbigdataanalyticsstudy
AT patanamilyas semanticlinkagesofobsessionsfromaninternationalobsessivecompulsivedisordermobileappdatasetbigdataanalyticsstudy
AT vaitlaanil semanticlinkagesofobsessionsfromaninternationalobsessivecompulsivedisordermobileappdatasetbigdataanalyticsstudy
AT lamchristopher semanticlinkagesofobsessionsfromaninternationalobsessivecompulsivedisordermobileappdatasetbigdataanalyticsstudy
AT massimichelle semanticlinkagesofobsessionsfromaninternationalobsessivecompulsivedisordermobileappdatasetbigdataanalyticsstudy
AT leowalex semanticlinkagesofobsessionsfromaninternationalobsessivecompulsivedisordermobileappdatasetbigdataanalyticsstudy