Cargando…
T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING
BACKGROUND: Advances in molecular biology, genetics and neuroimaging have the potential to improve our understanding of psychotic disorders. However, the clinical classification of psychotic disorders has remained largely unchanged and is based on criterion-based diagnostic systems (such as ICD-10 a...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5888675/ http://dx.doi.org/10.1093/schbul/sby016.377 |
_version_ | 1783312578307424256 |
---|---|
author | Patel, Rashmi Jackson, Richard Stewart, Robert McGuire, Philip |
author_facet | Patel, Rashmi Jackson, Richard Stewart, Robert McGuire, Philip |
author_sort | Patel, Rashmi |
collection | PubMed |
description | BACKGROUND: Advances in molecular biology, genetics and neuroimaging have the potential to improve our understanding of psychotic disorders. However, the clinical classification of psychotic disorders has remained largely unchanged and is based on criterion-based diagnostic systems (such as ICD-10 and DSM-5) which do not necessarily reflect their underlying aetiology and pathophysiology. A more refined characterisation of clinical phenotype could help to improve our understanding of these disorders. Clinical data are increasingly recorded in the form of electronic health records (EHRs). Automated information extraction methods such as natural language processing (NLP) offer the opportunity to quickly extract and analyse large volumes of clinical data from EHRs. We sought to characterise the range of presenting symptoms in a large sample of patients with psychotic disorders using NLP. METHODS: Dataset: South London and Maudsley NHS Trust (SLaM) Biomedical Research Centre (BRC) Case Register comprising pseudonymised EHRs of over 270,000 people. Clinical sample: 18,761 patients with an ICD-10 diagnosis of a psychotic disorders (F20, F25 or F31) and a control group of 57,999 patients with a non-psychotic disorder diagnosis (mood/affective/personality disorders without psychotic symptoms). Data collection: The NLP software package TextHunter was used. All sentences containing keywords relevant to the following symptom categories were analysed using a support vector machine learning (SVM) approach: positive symptoms, negative symptoms, disorganisation, mania and catatonia. Data on 46 symptoms were obtained with 37,211 instances annotated to contribute training and gold standard data for machine learning. 2,950 instances were independently annotated to determine inter-annotator agreement. OUTCOMES: prevalence of psychotic symptoms and their association with ICD-10 diagnosis. RESULTS: A good degree of inter-annotator agreement was achieved (Cohen’s κ: 0.83). Machine learning NLP achieved a mean precision (positive predictive value) of 83% and recall (sensitivity) of 78%. Among patients with psychotic disorders, the most frequently documented symptoms were paranoia, disturbed sleep and hallucinations. Psychotic symptoms were not limited to patients with an ICD-10 diagnosis of a psychotic disorder and were also present in the control group. DISCUSSION: We found that psychotic symptoms were not limited to patients with a specific ICD-10 diagnosis and were present in a wide range of ICD-10 disorders. These findings highlight the utility of detailed NLP-derived symptom data to better characterise psychotic disorders. |
format | Online Article Text |
id | pubmed-5888675 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58886752018-04-11 T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING Patel, Rashmi Jackson, Richard Stewart, Robert McGuire, Philip Schizophr Bull Abstracts BACKGROUND: Advances in molecular biology, genetics and neuroimaging have the potential to improve our understanding of psychotic disorders. However, the clinical classification of psychotic disorders has remained largely unchanged and is based on criterion-based diagnostic systems (such as ICD-10 and DSM-5) which do not necessarily reflect their underlying aetiology and pathophysiology. A more refined characterisation of clinical phenotype could help to improve our understanding of these disorders. Clinical data are increasingly recorded in the form of electronic health records (EHRs). Automated information extraction methods such as natural language processing (NLP) offer the opportunity to quickly extract and analyse large volumes of clinical data from EHRs. We sought to characterise the range of presenting symptoms in a large sample of patients with psychotic disorders using NLP. METHODS: Dataset: South London and Maudsley NHS Trust (SLaM) Biomedical Research Centre (BRC) Case Register comprising pseudonymised EHRs of over 270,000 people. Clinical sample: 18,761 patients with an ICD-10 diagnosis of a psychotic disorders (F20, F25 or F31) and a control group of 57,999 patients with a non-psychotic disorder diagnosis (mood/affective/personality disorders without psychotic symptoms). Data collection: The NLP software package TextHunter was used. All sentences containing keywords relevant to the following symptom categories were analysed using a support vector machine learning (SVM) approach: positive symptoms, negative symptoms, disorganisation, mania and catatonia. Data on 46 symptoms were obtained with 37,211 instances annotated to contribute training and gold standard data for machine learning. 2,950 instances were independently annotated to determine inter-annotator agreement. OUTCOMES: prevalence of psychotic symptoms and their association with ICD-10 diagnosis. RESULTS: A good degree of inter-annotator agreement was achieved (Cohen’s κ: 0.83). Machine learning NLP achieved a mean precision (positive predictive value) of 83% and recall (sensitivity) of 78%. Among patients with psychotic disorders, the most frequently documented symptoms were paranoia, disturbed sleep and hallucinations. Psychotic symptoms were not limited to patients with an ICD-10 diagnosis of a psychotic disorder and were also present in the control group. DISCUSSION: We found that psychotic symptoms were not limited to patients with a specific ICD-10 diagnosis and were present in a wide range of ICD-10 disorders. These findings highlight the utility of detailed NLP-derived symptom data to better characterise psychotic disorders. Oxford University Press 2018-04 2018-04-01 /pmc/articles/PMC5888675/ http://dx.doi.org/10.1093/schbul/sby016.377 Text en © Maryland Psychiatric Research Center 2018. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Abstracts Patel, Rashmi Jackson, Richard Stewart, Robert McGuire, Philip T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING |
title | T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING |
title_full | T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING |
title_fullStr | T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING |
title_full_unstemmed | T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING |
title_short | T101. ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING |
title_sort | t101. enriching psychotic disorder classification using natural language processing |
topic | Abstracts |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5888675/ http://dx.doi.org/10.1093/schbul/sby016.377 |
work_keys_str_mv | AT patelrashmi t101enrichingpsychoticdisorderclassificationusingnaturallanguageprocessing AT jacksonrichard t101enrichingpsychoticdisorderclassificationusingnaturallanguageprocessing AT stewartrobert t101enrichingpsychoticdisorderclassificationusingnaturallanguageprocessing AT mcguirephilip t101enrichingpsychoticdisorderclassificationusingnaturallanguageprocessing |