Cargando…

Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project

OBJECTIVES: We sought to use natural language processing to develop a suite of language models to capture key symptoms of severe mental illness (SMI) from clinical text, to facilitate the secondary use of mental healthcare data in research. DESIGN: Development and validation of information extractio...

Descripción completa

Detalles Bibliográficos
Autores principales: Jackson, Richard G, Patel, Rashmi, Jayatilleke, Nishamali, Kolliakou, Anna, Ball, Michael, Gorrell, Genevieve, Roberts, Angus, Dobson, Richard J, Stewart, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5253558/
https://www.ncbi.nlm.nih.gov/pubmed/28096249
http://dx.doi.org/10.1136/bmjopen-2016-012012
_version_ 1782498181455544320
author Jackson, Richard G
Patel, Rashmi
Jayatilleke, Nishamali
Kolliakou, Anna
Ball, Michael
Gorrell, Genevieve
Roberts, Angus
Dobson, Richard J
Stewart, Robert
author_facet Jackson, Richard G
Patel, Rashmi
Jayatilleke, Nishamali
Kolliakou, Anna
Ball, Michael
Gorrell, Genevieve
Roberts, Angus
Dobson, Richard J
Stewart, Robert
author_sort Jackson, Richard G
collection PubMed
description OBJECTIVES: We sought to use natural language processing to develop a suite of language models to capture key symptoms of severe mental illness (SMI) from clinical text, to facilitate the secondary use of mental healthcare data in research. DESIGN: Development and validation of information extraction applications for ascertaining symptoms of SMI in routine mental health records using the Clinical Record Interactive Search (CRIS) data resource; description of their distribution in a corpus of discharge summaries. SETTING: Electronic records from a large mental healthcare provider serving a geographic catchment of 1.2 million residents in four boroughs of south London, UK. PARTICIPANTS: The distribution of derived symptoms was described in 23 128 discharge summaries from 7962 patients who had received an SMI diagnosis, and 13 496 discharge summaries from 7575 patients who had received a non-SMI diagnosis. OUTCOME MEASURES: Fifty SMI symptoms were identified by a team of psychiatrists for extraction based on salience and linguistic consistency in records, broadly categorised under positive, negative, disorganisation, manic and catatonic subgroups. Text models for each symptom were generated using the TextHunter tool and the CRIS database. RESULTS: We extracted data for 46 symptoms with a median F1 score of 0.88. Four symptom models performed poorly and were excluded. From the corpus of discharge summaries, it was possible to extract symptomatology in 87% of patients with SMI and 60% of patients with non-SMI diagnosis. CONCLUSIONS: This work demonstrates the possibility of automatically extracting a broad range of SMI symptoms from English text discharge summaries for patients with an SMI diagnosis. Descriptive data also indicated that most symptoms cut across diagnoses, rather than being restricted to particular groups.
format Online
Article
Text
id pubmed-5253558
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-52535582017-01-25 Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project Jackson, Richard G Patel, Rashmi Jayatilleke, Nishamali Kolliakou, Anna Ball, Michael Gorrell, Genevieve Roberts, Angus Dobson, Richard J Stewart, Robert BMJ Open Mental Health OBJECTIVES: We sought to use natural language processing to develop a suite of language models to capture key symptoms of severe mental illness (SMI) from clinical text, to facilitate the secondary use of mental healthcare data in research. DESIGN: Development and validation of information extraction applications for ascertaining symptoms of SMI in routine mental health records using the Clinical Record Interactive Search (CRIS) data resource; description of their distribution in a corpus of discharge summaries. SETTING: Electronic records from a large mental healthcare provider serving a geographic catchment of 1.2 million residents in four boroughs of south London, UK. PARTICIPANTS: The distribution of derived symptoms was described in 23 128 discharge summaries from 7962 patients who had received an SMI diagnosis, and 13 496 discharge summaries from 7575 patients who had received a non-SMI diagnosis. OUTCOME MEASURES: Fifty SMI symptoms were identified by a team of psychiatrists for extraction based on salience and linguistic consistency in records, broadly categorised under positive, negative, disorganisation, manic and catatonic subgroups. Text models for each symptom were generated using the TextHunter tool and the CRIS database. RESULTS: We extracted data for 46 symptoms with a median F1 score of 0.88. Four symptom models performed poorly and were excluded. From the corpus of discharge summaries, it was possible to extract symptomatology in 87% of patients with SMI and 60% of patients with non-SMI diagnosis. CONCLUSIONS: This work demonstrates the possibility of automatically extracting a broad range of SMI symptoms from English text discharge summaries for patients with an SMI diagnosis. Descriptive data also indicated that most symptoms cut across diagnoses, rather than being restricted to particular groups. BMJ Publishing Group 2017-01-17 /pmc/articles/PMC5253558/ /pubmed/28096249 http://dx.doi.org/10.1136/bmjopen-2016-012012 Text en Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/ This is an Open Access article distributed in accordance with the terms of the Creative Commons Attribution (CC BY 4.0) license, which permits others to distribute, remix, adapt and build upon this work, for commercial use, provided the original work is properly cited. See: http://creativecommons.org/licenses/by/4.0/
spellingShingle Mental Health
Jackson, Richard G
Patel, Rashmi
Jayatilleke, Nishamali
Kolliakou, Anna
Ball, Michael
Gorrell, Genevieve
Roberts, Angus
Dobson, Richard J
Stewart, Robert
Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project
title Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project
title_full Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project
title_fullStr Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project
title_full_unstemmed Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project
title_short Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project
title_sort natural language processing to extract symptoms of severe mental illness from clinical text: the clinical record interactive search comprehensive data extraction (cris-code) project
topic Mental Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5253558/
https://www.ncbi.nlm.nih.gov/pubmed/28096249
http://dx.doi.org/10.1136/bmjopen-2016-012012
work_keys_str_mv AT jacksonrichardg naturallanguageprocessingtoextractsymptomsofseverementalillnessfromclinicaltexttheclinicalrecordinteractivesearchcomprehensivedataextractioncriscodeproject
AT patelrashmi naturallanguageprocessingtoextractsymptomsofseverementalillnessfromclinicaltexttheclinicalrecordinteractivesearchcomprehensivedataextractioncriscodeproject
AT jayatillekenishamali naturallanguageprocessingtoextractsymptomsofseverementalillnessfromclinicaltexttheclinicalrecordinteractivesearchcomprehensivedataextractioncriscodeproject
AT kolliakouanna naturallanguageprocessingtoextractsymptomsofseverementalillnessfromclinicaltexttheclinicalrecordinteractivesearchcomprehensivedataextractioncriscodeproject
AT ballmichael naturallanguageprocessingtoextractsymptomsofseverementalillnessfromclinicaltexttheclinicalrecordinteractivesearchcomprehensivedataextractioncriscodeproject
AT gorrellgenevieve naturallanguageprocessingtoextractsymptomsofseverementalillnessfromclinicaltexttheclinicalrecordinteractivesearchcomprehensivedataextractioncriscodeproject
AT robertsangus naturallanguageprocessingtoextractsymptomsofseverementalillnessfromclinicaltexttheclinicalrecordinteractivesearchcomprehensivedataextractioncriscodeproject
AT dobsonrichardj naturallanguageprocessingtoextractsymptomsofseverementalillnessfromclinicaltexttheclinicalrecordinteractivesearchcomprehensivedataextractioncriscodeproject
AT stewartrobert naturallanguageprocessingtoextractsymptomsofseverementalillnessfromclinicaltexttheclinicalrecordinteractivesearchcomprehensivedataextractioncriscodeproject