Cargando…

T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING

BACKGROUND: Advances in molecular biology, genetics and neuroimaging have the potential to improve our understanding of psychotic disorders. However, the clinical classification of psychotic disorders has remained largely unchanged and is based on criterion-based diagnostic systems (such as ICD-10 a...

Descripción completa

Detalles Bibliográficos
Autores principales: Patel, Rashmi, Jackson, Richard, Stewart, Robert, McGuire, Philip
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5888675/
http://dx.doi.org/10.1093/schbul/sby016.377
_version_ 1783312578307424256
author Patel, Rashmi
Jackson, Richard
Stewart, Robert
McGuire, Philip
author_facet Patel, Rashmi
Jackson, Richard
Stewart, Robert
McGuire, Philip
author_sort Patel, Rashmi
collection PubMed
description BACKGROUND: Advances in molecular biology, genetics and neuroimaging have the potential to improve our understanding of psychotic disorders. However, the clinical classification of psychotic disorders has remained largely unchanged and is based on criterion-based diagnostic systems (such as ICD-10 and DSM-5) which do not necessarily reflect their underlying aetiology and pathophysiology. A more refined characterisation of clinical phenotype could help to improve our understanding of these disorders. Clinical data are increasingly recorded in the form of electronic health records (EHRs). Automated information extraction methods such as natural language processing (NLP) offer the opportunity to quickly extract and analyse large volumes of clinical data from EHRs. We sought to characterise the range of presenting symptoms in a large sample of patients with psychotic disorders using NLP. METHODS: Dataset: South London and Maudsley NHS Trust (SLaM) Biomedical Research Centre (BRC) Case Register comprising pseudonymised EHRs of over 270,000 people. Clinical sample: 18,761 patients with an ICD-10 diagnosis of a psychotic disorders (F20, F25 or F31) and a control group of 57,999 patients with a non-psychotic disorder diagnosis (mood/affective/personality disorders without psychotic symptoms). Data collection: The NLP software package TextHunter was used. All sentences containing keywords relevant to the following symptom categories were analysed using a support vector machine learning (SVM) approach: positive symptoms, negative symptoms, disorganisation, mania and catatonia. Data on 46 symptoms were obtained with 37,211 instances annotated to contribute training and gold standard data for machine learning. 2,950 instances were independently annotated to determine inter-annotator agreement. OUTCOMES: prevalence of psychotic symptoms and their association with ICD-10 diagnosis. RESULTS: A good degree of inter-annotator agreement was achieved (Cohen’s κ: 0.83). Machine learning NLP achieved a mean precision (positive predictive value) of 83% and recall (sensitivity) of 78%. Among patients with psychotic disorders, the most frequently documented symptoms were paranoia, disturbed sleep and hallucinations. Psychotic symptoms were not limited to patients with an ICD-10 diagnosis of a psychotic disorder and were also present in the control group. DISCUSSION: We found that psychotic symptoms were not limited to patients with a specific ICD-10 diagnosis and were present in a wide range of ICD-10 disorders. These findings highlight the utility of detailed NLP-derived symptom data to better characterise psychotic disorders.
format Online
Article
Text
id pubmed-5888675
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58886752018-04-11 T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING Patel, Rashmi Jackson, Richard Stewart, Robert McGuire, Philip Schizophr Bull Abstracts BACKGROUND: Advances in molecular biology, genetics and neuroimaging have the potential to improve our understanding of psychotic disorders. However, the clinical classification of psychotic disorders has remained largely unchanged and is based on criterion-based diagnostic systems (such as ICD-10 and DSM-5) which do not necessarily reflect their underlying aetiology and pathophysiology. A more refined characterisation of clinical phenotype could help to improve our understanding of these disorders. Clinical data are increasingly recorded in the form of electronic health records (EHRs). Automated information extraction methods such as natural language processing (NLP) offer the opportunity to quickly extract and analyse large volumes of clinical data from EHRs. We sought to characterise the range of presenting symptoms in a large sample of patients with psychotic disorders using NLP. METHODS: Dataset: South London and Maudsley NHS Trust (SLaM) Biomedical Research Centre (BRC) Case Register comprising pseudonymised EHRs of over 270,000 people. Clinical sample: 18,761 patients with an ICD-10 diagnosis of a psychotic disorders (F20, F25 or F31) and a control group of 57,999 patients with a non-psychotic disorder diagnosis (mood/affective/personality disorders without psychotic symptoms). Data collection: The NLP software package TextHunter was used. All sentences containing keywords relevant to the following symptom categories were analysed using a support vector machine learning (SVM) approach: positive symptoms, negative symptoms, disorganisation, mania and catatonia. Data on 46 symptoms were obtained with 37,211 instances annotated to contribute training and gold standard data for machine learning. 2,950 instances were independently annotated to determine inter-annotator agreement. OUTCOMES: prevalence of psychotic symptoms and their association with ICD-10 diagnosis. RESULTS: A good degree of inter-annotator agreement was achieved (Cohen’s κ: 0.83). Machine learning NLP achieved a mean precision (positive predictive value) of 83% and recall (sensitivity) of 78%. Among patients with psychotic disorders, the most frequently documented symptoms were paranoia, disturbed sleep and hallucinations. Psychotic symptoms were not limited to patients with an ICD-10 diagnosis of a psychotic disorder and were also present in the control group. DISCUSSION: We found that psychotic symptoms were not limited to patients with a specific ICD-10 diagnosis and were present in a wide range of ICD-10 disorders. These findings highlight the utility of detailed NLP-derived symptom data to better characterise psychotic disorders. Oxford University Press 2018-04 2018-04-01 /pmc/articles/PMC5888675/ http://dx.doi.org/10.1093/schbul/sby016.377 Text en © Maryland Psychiatric Research Center 2018. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Abstracts
Patel, Rashmi
Jackson, Richard
Stewart, Robert
McGuire, Philip
T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING
title T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING
title_full T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING
title_fullStr T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING
title_full_unstemmed T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING
title_short T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING
title_sort t101. enriching psychotic disorder classification using natural language processing
topic Abstracts
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5888675/
http://dx.doi.org/10.1093/schbul/sby016.377
work_keys_str_mv AT patelrashmi t101enrichingpsychoticdisorderclassificationusingnaturallanguageprocessing
AT jacksonrichard t101enrichingpsychoticdisorderclassificationusingnaturallanguageprocessing
AT stewartrobert t101enrichingpsychoticdisorderclassificationusingnaturallanguageprocessing
AT mcguirephilip t101enrichingpsychoticdisorderclassificationusingnaturallanguageprocessing