Cargando…

Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study

BACKGROUND: The increasing adoption of electronic health records (EHRs) in clinical practice holds the promise of improving care and advancing research by serving as a rich source of data, but most EHRs allow clinicians to enter data in a text format without much structure. Natural language processi...

Descripción completa

Detalles Bibliográficos
Autores principales: Petch, Jeremy, Batt, Jane, Murray, Joshua, Mamdani, Muhammad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6913750/
https://www.ncbi.nlm.nih.gov/pubmed/31682579
http://dx.doi.org/10.2196/12575
_version_ 1783479695056044032
author Petch, Jeremy
Batt, Jane
Murray, Joshua
Mamdani, Muhammad
author_facet Petch, Jeremy
Batt, Jane
Murray, Joshua
Mamdani, Muhammad
author_sort Petch, Jeremy
collection PubMed
description BACKGROUND: The increasing adoption of electronic health records (EHRs) in clinical practice holds the promise of improving care and advancing research by serving as a rich source of data, but most EHRs allow clinicians to enter data in a text format without much structure. Natural language processing (NLP) may reduce reliance on manual abstraction of these text data by extracting clinical features directly from unstructured clinical digital text data and converting them into structured data. OBJECTIVE: This study aimed to assess the performance of a commercially available NLP tool for extracting clinical features from free-text consult notes. METHODS: We conducted a pilot, retrospective, cross-sectional study of the accuracy of NLP from dictated consult notes from our tuberculosis clinic with manual chart abstraction as the reference standard. Consult notes for 130 patients were extracted and processed using NLP. We extracted 15 clinical features from these consult notes and grouped them a priori into categories of simple, moderate, and complex for analysis. RESULTS: For the primary outcome of overall accuracy, NLP performed best for features classified as simple, achieving an overall accuracy of 96% (95% CI 94.3-97.6). Performance was slightly lower for features of moderate clinical and linguistic complexity at 93% (95% CI 91.1-94.4), and lowest for complex features at 91% (95% CI 87.3-93.1). CONCLUSIONS: The findings of this study support the use of NLP for extracting clinical features from dictated consult notes in the setting of a tuberculosis clinic. Further research is needed to fully establish the validity of NLP for this and other purposes.
format Online
Article
Text
id pubmed-6913750
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-69137502020-01-06 Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study Petch, Jeremy Batt, Jane Murray, Joshua Mamdani, Muhammad JMIR Med Inform Original Paper BACKGROUND: The increasing adoption of electronic health records (EHRs) in clinical practice holds the promise of improving care and advancing research by serving as a rich source of data, but most EHRs allow clinicians to enter data in a text format without much structure. Natural language processing (NLP) may reduce reliance on manual abstraction of these text data by extracting clinical features directly from unstructured clinical digital text data and converting them into structured data. OBJECTIVE: This study aimed to assess the performance of a commercially available NLP tool for extracting clinical features from free-text consult notes. METHODS: We conducted a pilot, retrospective, cross-sectional study of the accuracy of NLP from dictated consult notes from our tuberculosis clinic with manual chart abstraction as the reference standard. Consult notes for 130 patients were extracted and processed using NLP. We extracted 15 clinical features from these consult notes and grouped them a priori into categories of simple, moderate, and complex for analysis. RESULTS: For the primary outcome of overall accuracy, NLP performed best for features classified as simple, achieving an overall accuracy of 96% (95% CI 94.3-97.6). Performance was slightly lower for features of moderate clinical and linguistic complexity at 93% (95% CI 91.1-94.4), and lowest for complex features at 91% (95% CI 87.3-93.1). CONCLUSIONS: The findings of this study support the use of NLP for extracting clinical features from dictated consult notes in the setting of a tuberculosis clinic. Further research is needed to fully establish the validity of NLP for this and other purposes. JMIR Publications 2019-11-01 /pmc/articles/PMC6913750/ /pubmed/31682579 http://dx.doi.org/10.2196/12575 Text en ©Jeremy Petch, Jane Batt, Joshua Murray, Muhammad Mamdani. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 01.11.2019. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Petch, Jeremy
Batt, Jane
Murray, Joshua
Mamdani, Muhammad
Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study
title Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study
title_full Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study
title_fullStr Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study
title_full_unstemmed Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study
title_short Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study
title_sort extracting clinical features from dictated ambulatory consult notes using a commercially available natural language processing tool: pilot, retrospective, cross-sectional validation study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6913750/
https://www.ncbi.nlm.nih.gov/pubmed/31682579
http://dx.doi.org/10.2196/12575
work_keys_str_mv AT petchjeremy extractingclinicalfeaturesfromdictatedambulatoryconsultnotesusingacommerciallyavailablenaturallanguageprocessingtoolpilotretrospectivecrosssectionalvalidationstudy
AT battjane extractingclinicalfeaturesfromdictatedambulatoryconsultnotesusingacommerciallyavailablenaturallanguageprocessingtoolpilotretrospectivecrosssectionalvalidationstudy
AT murrayjoshua extractingclinicalfeaturesfromdictatedambulatoryconsultnotesusingacommerciallyavailablenaturallanguageprocessingtoolpilotretrospectivecrosssectionalvalidationstudy
AT mamdanimuhammad extractingclinicalfeaturesfromdictatedambulatoryconsultnotesusingacommerciallyavailablenaturallanguageprocessingtoolpilotretrospectivecrosssectionalvalidationstudy