Cargando…
Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations
OBJECTIVE: To investigate the distribution and content of quoted text within the electronic health records (EHRs) using a previously developed natural language processing tool to generate a database of quotations. DESIGN: χ(2) and logistic regression were used to assess the profile of patients recei...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BMJ Publishing Group
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8719193/ http://dx.doi.org/10.1136/bmjopen-2021-049249 |
_version_ | 1784624884802912256 |
---|---|
author | Jayasinghe, Lasantha Velupillai, Sumithra Stewart, Robert |
author_facet | Jayasinghe, Lasantha Velupillai, Sumithra Stewart, Robert |
author_sort | Jayasinghe, Lasantha |
collection | PubMed |
description | OBJECTIVE: To investigate the distribution and content of quoted text within the electronic health records (EHRs) using a previously developed natural language processing tool to generate a database of quotations. DESIGN: χ(2) and logistic regression were used to assess the profile of patients receiving mental healthcare for whom quotations exist. K-means clustering using pre-trained word embeddings developed on general discharge summaries and psychosis specific mental health records were used to group one-word quotations into semantically similar groups and labelled by human subjective judgement. SETTING: EHRs from a large mental healthcare provider serving a geographic catchment area of 1.3 million residents in South London. PARTICIPANTS: For analysis of distribution, 33 499 individuals receiving mental healthcare on 30 June 2019 in South London and Maudsley. For analysis of content, 1587 unique lemmatised words, appearing a minimum of 20 times on the database of quotations created on 16 January 2020. RESULTS: The strongest individual indicator of quoted text is inpatient care in the preceding 12 months (OR 9.79, 95% CI 7.84 to 12.23). Next highest indicator is ethnicity with those with a black background more likely to have quoted text in comparison to white background (OR 2.20, 95% CI 2.08 to 2.33). Both are attenuated slightly in the adjusted model. Early psychosis intervention word embeddings subjectively produced categories pertaining to: mental illness, verbs, negative sentiment, people/relationships, mixed sentiment, aggression/violence and negative connotation. CONCLUSIONS: The findings that inpatients and those from a black ethnic background more commonly have quoted text raise important questions around where clinical attention is focused and whether this may point to any systematic bias. Our study also shows that word embeddings trained on early psychosis intervention records are useful in categorising even small subsets of the clinical records represented by one-word quotations. |
format | Online Article Text |
id | pubmed-8719193 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BMJ Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-87191932022-01-12 Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations Jayasinghe, Lasantha Velupillai, Sumithra Stewart, Robert BMJ Open Health Informatics OBJECTIVE: To investigate the distribution and content of quoted text within the electronic health records (EHRs) using a previously developed natural language processing tool to generate a database of quotations. DESIGN: χ(2) and logistic regression were used to assess the profile of patients receiving mental healthcare for whom quotations exist. K-means clustering using pre-trained word embeddings developed on general discharge summaries and psychosis specific mental health records were used to group one-word quotations into semantically similar groups and labelled by human subjective judgement. SETTING: EHRs from a large mental healthcare provider serving a geographic catchment area of 1.3 million residents in South London. PARTICIPANTS: For analysis of distribution, 33 499 individuals receiving mental healthcare on 30 June 2019 in South London and Maudsley. For analysis of content, 1587 unique lemmatised words, appearing a minimum of 20 times on the database of quotations created on 16 January 2020. RESULTS: The strongest individual indicator of quoted text is inpatient care in the preceding 12 months (OR 9.79, 95% CI 7.84 to 12.23). Next highest indicator is ethnicity with those with a black background more likely to have quoted text in comparison to white background (OR 2.20, 95% CI 2.08 to 2.33). Both are attenuated slightly in the adjusted model. Early psychosis intervention word embeddings subjectively produced categories pertaining to: mental illness, verbs, negative sentiment, people/relationships, mixed sentiment, aggression/violence and negative connotation. CONCLUSIONS: The findings that inpatients and those from a black ethnic background more commonly have quoted text raise important questions around where clinical attention is focused and whether this may point to any systematic bias. Our study also shows that word embeddings trained on early psychosis intervention records are useful in categorising even small subsets of the clinical records represented by one-word quotations. BMJ Publishing Group 2021-12-25 /pmc/articles/PMC8719193/ http://dx.doi.org/10.1136/bmjopen-2021-049249 Text en © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY. Published by BMJ. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Health Informatics Jayasinghe, Lasantha Velupillai, Sumithra Stewart, Robert Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations |
title | Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations |
title_full | Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations |
title_fullStr | Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations |
title_full_unstemmed | Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations |
title_short | Quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations |
title_sort | quoted text in the mental healthcare electronic record: an analysis of the distribution and content of single-word quotations |
topic | Health Informatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8719193/ http://dx.doi.org/10.1136/bmjopen-2021-049249 |
work_keys_str_mv | AT jayasinghelasantha quotedtextinthementalhealthcareelectronicrecordananalysisofthedistributionandcontentofsinglewordquotations AT velupillaisumithra quotedtextinthementalhealthcareelectronicrecordananalysisofthedistributionandcontentofsinglewordquotations AT stewartrobert quotedtextinthementalhealthcareelectronicrecordananalysisofthedistributionandcontentofsinglewordquotations |