Cargando…

Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations

OBJECTIVE: To investigate the distribution and content of quoted text within the electronic health records (EHRs) using a previously developed natural language processing tool to generate a database of quotations. DESIGN: χ(2) and logistic regression were used to assess the profile of patients recei...

Descripción completa

Detalles Bibliográficos
Autores principales: Jayasinghe, Lasantha, Velupillai, Sumithra, Stewart, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8719193/
http://dx.doi.org/10.1136/bmjopen-2021-049249
_version_ 1784624884802912256
author Jayasinghe, Lasantha
Velupillai, Sumithra
Stewart, Robert
author_facet Jayasinghe, Lasantha
Velupillai, Sumithra
Stewart, Robert
author_sort Jayasinghe, Lasantha
collection PubMed
description OBJECTIVE: To investigate the distribution and content of quoted text within the electronic health records (EHRs) using a previously developed natural language processing tool to generate a database of quotations. DESIGN: χ(2) and logistic regression were used to assess the profile of patients receiving mental healthcare for whom quotations exist. K-means clustering using pre-trained word embeddings developed on general discharge summaries and psychosis specific mental health records were used to group one-word quotations into semantically similar groups and labelled by human subjective judgement. SETTING: EHRs from a large mental healthcare provider serving a geographic catchment area of 1.3 million residents in South London. PARTICIPANTS: For analysis of distribution, 33 499 individuals receiving mental healthcare on 30 June 2019 in South London and Maudsley. For analysis of content, 1587 unique lemmatised words, appearing a minimum of 20 times on the database of quotations created on 16 January 2020. RESULTS: The strongest individual indicator of quoted text is inpatient care in the preceding 12 months (OR 9.79, 95% CI 7.84 to 12.23). Next highest indicator is ethnicity with those with a black background more likely to have quoted text in comparison to white background (OR 2.20, 95% CI 2.08 to 2.33). Both are attenuated slightly in the adjusted model. Early psychosis intervention word embeddings subjectively produced categories pertaining to: mental illness, verbs, negative sentiment, people/relationships, mixed sentiment, aggression/violence and negative connotation. CONCLUSIONS: The findings that inpatients and those from a black ethnic background more commonly have quoted text raise important questions around where clinical attention is focused and whether this may point to any systematic bias. Our study also shows that word embeddings trained on early psychosis intervention records are useful in categorising even small subsets of the clinical records represented by one-word quotations.
format Online
Article
Text
id pubmed-8719193
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-87191932022-01-12 Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations Jayasinghe, Lasantha Velupillai, Sumithra Stewart, Robert BMJ Open Health Informatics OBJECTIVE: To investigate the distribution and content of quoted text within the electronic health records (EHRs) using a previously developed natural language processing tool to generate a database of quotations. DESIGN: χ(2) and logistic regression were used to assess the profile of patients receiving mental healthcare for whom quotations exist. K-means clustering using pre-trained word embeddings developed on general discharge summaries and psychosis specific mental health records were used to group one-word quotations into semantically similar groups and labelled by human subjective judgement. SETTING: EHRs from a large mental healthcare provider serving a geographic catchment area of 1.3 million residents in South London. PARTICIPANTS: For analysis of distribution, 33 499 individuals receiving mental healthcare on 30 June 2019 in South London and Maudsley. For analysis of content, 1587 unique lemmatised words, appearing a minimum of 20 times on the database of quotations created on 16 January 2020. RESULTS: The strongest individual indicator of quoted text is inpatient care in the preceding 12 months (OR 9.79, 95% CI 7.84 to 12.23). Next highest indicator is ethnicity with those with a black background more likely to have quoted text in comparison to white background (OR 2.20, 95% CI 2.08 to 2.33). Both are attenuated slightly in the adjusted model. Early psychosis intervention word embeddings subjectively produced categories pertaining to: mental illness, verbs, negative sentiment, people/relationships, mixed sentiment, aggression/violence and negative connotation. CONCLUSIONS: The findings that inpatients and those from a black ethnic background more commonly have quoted text raise important questions around where clinical attention is focused and whether this may point to any systematic bias. Our study also shows that word embeddings trained on early psychosis intervention records are useful in categorising even small subsets of the clinical records represented by one-word quotations. BMJ Publishing Group 2021-12-25 /pmc/articles/PMC8719193/ http://dx.doi.org/10.1136/bmjopen-2021-049249 Text en © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY. Published by BMJ. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.
spellingShingle Health Informatics
Jayasinghe, Lasantha
Velupillai, Sumithra
Stewart, Robert
Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations
title Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations
title_full Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations
title_fullStr Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations
title_full_unstemmed Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations
title_short Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations
title_sort quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations
topic Health Informatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8719193/
http://dx.doi.org/10.1136/bmjopen-2021-049249
work_keys_str_mv AT jayasinghelasantha quotedtextinthementalhealthcareelectronicrecordananalysisofthedistributionandcontentofsinglewordquotations
AT velupillaisumithra quotedtextinthementalhealthcareelectronicrecordananalysisofthedistributionandcontentofsinglewordquotations
AT stewartrobert quotedtextinthementalhealthcareelectronicrecordananalysisofthedistributionandcontentofsinglewordquotations