Cargando…

Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing

OBJECTIVE: Seizure frequency and seizure freedom are among the most important outcome measures for patients with epilepsy. In this study, we aimed to automatically extract this clinical information from unstructured text in clinical notes. If successful, this could improve clinical decision-making i...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Kevin, Gallagher, Ryan S, Conrad, Erin C, Garrick, Chadric O, Baldassano, Steven N, Bernabei, John M, Galer, Peter D, Ghosn, Nina J, Greenblatt, Adam S, Jennings, Tara, Kornspun, Alana, Kulick-Soper, Catherine V, Panchal, Jal M, Pattnaik, Akash R, Scheid, Brittany H, Wei, Danmeng, Weitzman, Micah, Muthukrishnan, Ramya, Kim, Joongwon, Litt, Brian, Ellis, Colin A, Roth, Dan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9006692/
https://www.ncbi.nlm.nih.gov/pubmed/35190834
http://dx.doi.org/10.1093/jamia/ocac018
_version_ 1784686716796272640
author Xie, Kevin
Gallagher, Ryan S
Conrad, Erin C
Garrick, Chadric O
Baldassano, Steven N
Bernabei, John M
Galer, Peter D
Ghosn, Nina J
Greenblatt, Adam S
Jennings, Tara
Kornspun, Alana
Kulick-Soper, Catherine V
Panchal, Jal M
Pattnaik, Akash R
Scheid, Brittany H
Wei, Danmeng
Weitzman, Micah
Muthukrishnan, Ramya
Kim, Joongwon
Litt, Brian
Ellis, Colin A
Roth, Dan
author_facet Xie, Kevin
Gallagher, Ryan S
Conrad, Erin C
Garrick, Chadric O
Baldassano, Steven N
Bernabei, John M
Galer, Peter D
Ghosn, Nina J
Greenblatt, Adam S
Jennings, Tara
Kornspun, Alana
Kulick-Soper, Catherine V
Panchal, Jal M
Pattnaik, Akash R
Scheid, Brittany H
Wei, Danmeng
Weitzman, Micah
Muthukrishnan, Ramya
Kim, Joongwon
Litt, Brian
Ellis, Colin A
Roth, Dan
author_sort Xie, Kevin
collection PubMed
description OBJECTIVE: Seizure frequency and seizure freedom are among the most important outcome measures for patients with epilepsy. In this study, we aimed to automatically extract this clinical information from unstructured text in clinical notes. If successful, this could improve clinical decision-making in epilepsy patients and allow for rapid, large-scale retrospective research. MATERIALS AND METHODS: We developed a finetuning pipeline for pretrained neural models to classify patients as being seizure-free and to extract text containing their seizure frequency and date of last seizure from clinical notes. We annotated 1000 notes for use as training and testing data and determined how well 3 pretrained neural models, BERT, RoBERTa, and Bio_ClinicalBERT, could identify and extract the desired information after finetuning. RESULTS: The finetuned models (BERT(FT), Bio_ClinicalBERT(FT), and RoBERTa(FT)) achieved near-human performance when classifying patients as seizure free, with BERT(FT) and Bio_ClinicalBERT(FT) achieving accuracy scores over 80%. All 3 models also achieved human performance when extracting seizure frequency and date of last seizure, with overall F(1) scores over 0.80. The best combination of models was Bio_ClinicalBERT(FT) for classification, and RoBERTa(FT) for text extraction. Most of the gains in performance due to finetuning required roughly 70 annotated notes. DISCUSSION AND CONCLUSION: Our novel machine reading approach to extracting important clinical outcomes performed at or near human performance on several tasks. This approach opens new possibilities to support clinical practice and conduct large-scale retrospective clinical research. Future studies can use our finetuning pipeline with minimal training annotations to answer new clinical questions.
format Online
Article
Text
id pubmed-9006692
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-90066922022-04-13 Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing Xie, Kevin Gallagher, Ryan S Conrad, Erin C Garrick, Chadric O Baldassano, Steven N Bernabei, John M Galer, Peter D Ghosn, Nina J Greenblatt, Adam S Jennings, Tara Kornspun, Alana Kulick-Soper, Catherine V Panchal, Jal M Pattnaik, Akash R Scheid, Brittany H Wei, Danmeng Weitzman, Micah Muthukrishnan, Ramya Kim, Joongwon Litt, Brian Ellis, Colin A Roth, Dan J Am Med Inform Assoc Research and Applications OBJECTIVE: Seizure frequency and seizure freedom are among the most important outcome measures for patients with epilepsy. In this study, we aimed to automatically extract this clinical information from unstructured text in clinical notes. If successful, this could improve clinical decision-making in epilepsy patients and allow for rapid, large-scale retrospective research. MATERIALS AND METHODS: We developed a finetuning pipeline for pretrained neural models to classify patients as being seizure-free and to extract text containing their seizure frequency and date of last seizure from clinical notes. We annotated 1000 notes for use as training and testing data and determined how well 3 pretrained neural models, BERT, RoBERTa, and Bio_ClinicalBERT, could identify and extract the desired information after finetuning. RESULTS: The finetuned models (BERT(FT), Bio_ClinicalBERT(FT), and RoBERTa(FT)) achieved near-human performance when classifying patients as seizure free, with BERT(FT) and Bio_ClinicalBERT(FT) achieving accuracy scores over 80%. All 3 models also achieved human performance when extracting seizure frequency and date of last seizure, with overall F(1) scores over 0.80. The best combination of models was Bio_ClinicalBERT(FT) for classification, and RoBERTa(FT) for text extraction. Most of the gains in performance due to finetuning required roughly 70 annotated notes. DISCUSSION AND CONCLUSION: Our novel machine reading approach to extracting important clinical outcomes performed at or near human performance on several tasks. This approach opens new possibilities to support clinical practice and conduct large-scale retrospective clinical research. Future studies can use our finetuning pipeline with minimal training annotations to answer new clinical questions. Oxford University Press 2022-02-22 /pmc/articles/PMC9006692/ /pubmed/35190834 http://dx.doi.org/10.1093/jamia/ocac018 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research and Applications
Xie, Kevin
Gallagher, Ryan S
Conrad, Erin C
Garrick, Chadric O
Baldassano, Steven N
Bernabei, John M
Galer, Peter D
Ghosn, Nina J
Greenblatt, Adam S
Jennings, Tara
Kornspun, Alana
Kulick-Soper, Catherine V
Panchal, Jal M
Pattnaik, Akash R
Scheid, Brittany H
Wei, Danmeng
Weitzman, Micah
Muthukrishnan, Ramya
Kim, Joongwon
Litt, Brian
Ellis, Colin A
Roth, Dan
Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing
title Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing
title_full Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing
title_fullStr Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing
title_full_unstemmed Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing
title_short Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing
title_sort extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9006692/
https://www.ncbi.nlm.nih.gov/pubmed/35190834
http://dx.doi.org/10.1093/jamia/ocac018
work_keys_str_mv AT xiekevin extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT gallagherryans extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT conraderinc extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT garrickchadrico extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT baldassanostevenn extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT bernabeijohnm extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT galerpeterd extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT ghosnninaj extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT greenblattadams extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT jenningstara extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT kornspunalana extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT kulicksopercatherinev extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT panchaljalm extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT pattnaikakashr extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT scheidbrittanyh extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT weidanmeng extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT weitzmanmicah extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT muthukrishnanramya extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT kimjoongwon extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT littbrian extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT elliscolina extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing
AT rothdan extractingseizurefrequencyfromepilepsyclinicnotesamachinereadingapproachtonaturallanguageprocessing