Cargando…
Discovering themes in medical records of patients with psychogenic non-epileptic seizures
INTRODUCTION: Epileptic and psychogenic non-epileptic seizures (PNES) are common diagnostic problems encountered in hospital practice. This study explores the use of unsupervised machine learning in discovering themes in medical records of patients presenting with PNES. We hypothesised that themes g...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BMJ Publishing Group
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7903185/ https://www.ncbi.nlm.nih.gov/pubmed/33681804 http://dx.doi.org/10.1136/bmjno-2020-000087 |
_version_ | 1783654686502420480 |
---|---|
author | Lay, Joshua Seneviratne, Udaya Fok, Anthony Roberts, Helene Phan, Thanh |
author_facet | Lay, Joshua Seneviratne, Udaya Fok, Anthony Roberts, Helene Phan, Thanh |
author_sort | Lay, Joshua |
collection | PubMed |
description | INTRODUCTION: Epileptic and psychogenic non-epileptic seizures (PNES) are common diagnostic problems encountered in hospital practice. This study explores the use of unsupervised machine learning in discovering themes in medical records of patients presenting with PNES. We hypothesised that themes generated by machine learning are comparable with the classification by human experts. METHODS: This is a retrospective analysis of the medical records in the emergency department of patients (age >18 years) with PNES who underwent inpatient video-electroencephalography monitoring from May 2009 to June 2014 and received a final diagnosis of PNES. Prior to machine learning of written text, we applied a standardised approach in natural language processing to create a document-term matrix (removal of numbers, stop-words and punctuations, transforming fonts to lower case). The words were separated into tokens and treated as if existing within a bag-of-words. A probability of each word existing within a topic (theme) was modelled on multivariate Dirichlet distribution (R Foundation, V.3.5.0). Next, we asked four experts to independently provide a clinical interpretation of the generated topics. When the majority of (≥3) experts agreed, it was regarded as highly congruent. Interactive data are available on the web at (https://gntem2.github.io/PNES/%23topic=1&lambda=0.6&term=). RESULTS: There were 39 patients (74.4% women, median age 35 years with range 20–82). A total of 121 documents were converted to text files for text mining. There were 15 generated topics with 12/15 topics rated as highly congruent. The main themes were about descriptors of seizures and medication use. CONCLUSIONS: The findings from machine learning on PNES-related documentation provides evidence for the feasibility of applying machine-learning methodology to analyse large volumes of medical records. The topics generated by machine learning were congruent with interpretations by clinicians indicating this method can be used for screening of medical conditions among large volumes of medical records. |
format | Online Article Text |
id | pubmed-7903185 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BMJ Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-79031852021-03-04 Discovering themes in medical records of patients with psychogenic non-epileptic seizures Lay, Joshua Seneviratne, Udaya Fok, Anthony Roberts, Helene Phan, Thanh BMJ Neurol Open Original Research INTRODUCTION: Epileptic and psychogenic non-epileptic seizures (PNES) are common diagnostic problems encountered in hospital practice. This study explores the use of unsupervised machine learning in discovering themes in medical records of patients presenting with PNES. We hypothesised that themes generated by machine learning are comparable with the classification by human experts. METHODS: This is a retrospective analysis of the medical records in the emergency department of patients (age >18 years) with PNES who underwent inpatient video-electroencephalography monitoring from May 2009 to June 2014 and received a final diagnosis of PNES. Prior to machine learning of written text, we applied a standardised approach in natural language processing to create a document-term matrix (removal of numbers, stop-words and punctuations, transforming fonts to lower case). The words were separated into tokens and treated as if existing within a bag-of-words. A probability of each word existing within a topic (theme) was modelled on multivariate Dirichlet distribution (R Foundation, V.3.5.0). Next, we asked four experts to independently provide a clinical interpretation of the generated topics. When the majority of (≥3) experts agreed, it was regarded as highly congruent. Interactive data are available on the web at (https://gntem2.github.io/PNES/%23topic=1&lambda=0.6&term=). RESULTS: There were 39 patients (74.4% women, median age 35 years with range 20–82). A total of 121 documents were converted to text files for text mining. There were 15 generated topics with 12/15 topics rated as highly congruent. The main themes were about descriptors of seizures and medication use. CONCLUSIONS: The findings from machine learning on PNES-related documentation provides evidence for the feasibility of applying machine-learning methodology to analyse large volumes of medical records. The topics generated by machine learning were congruent with interpretations by clinicians indicating this method can be used for screening of medical conditions among large volumes of medical records. BMJ Publishing Group 2020-10-23 /pmc/articles/PMC7903185/ /pubmed/33681804 http://dx.doi.org/10.1136/bmjno-2020-000087 Text en © Author(s) (or their employer(s)) 2020. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. http://creativecommons.org/licenses/by-nc/4.0/ http://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Original Research Lay, Joshua Seneviratne, Udaya Fok, Anthony Roberts, Helene Phan, Thanh Discovering themes in medical records of patients with psychogenic non-epileptic seizures |
title | Discovering themes in medical records of patients with psychogenic non-epileptic seizures |
title_full | Discovering themes in medical records of patients with psychogenic non-epileptic seizures |
title_fullStr | Discovering themes in medical records of patients with psychogenic non-epileptic seizures |
title_full_unstemmed | Discovering themes in medical records of patients with psychogenic non-epileptic seizures |
title_short | Discovering themes in medical records of patients with psychogenic non-epileptic seizures |
title_sort | discovering themes in medical records of patients with psychogenic non-epileptic seizures |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7903185/ https://www.ncbi.nlm.nih.gov/pubmed/33681804 http://dx.doi.org/10.1136/bmjno-2020-000087 |
work_keys_str_mv | AT layjoshua discoveringthemesinmedicalrecordsofpatientswithpsychogenicnonepilepticseizures AT seneviratneudaya discoveringthemesinmedicalrecordsofpatientswithpsychogenicnonepilepticseizures AT fokanthony discoveringthemesinmedicalrecordsofpatientswithpsychogenicnonepilepticseizures AT robertshelene discoveringthemesinmedicalrecordsofpatientswithpsychogenicnonepilepticseizures AT phanthanh discoveringthemesinmedicalrecordsofpatientswithpsychogenicnonepilepticseizures |