Cargando…

Evaluation of natural language processing from emergency department computerized medical records for intra-hospital syndromic surveillance

BACKGROUND: The identification of patients who pose an epidemic hazard when they are admitted to a health facility plays a role in preventing the risk of hospital acquired infection. An automated clinical decision support system to detect suspected cases, based on the principle of syndromic surveill...

Descripción completa

Detalles Bibliográficos
Autores principales: Gerbier, Solweig, Yarovaya, Olga, Gicquel, Quentin, Millet, Anne-Laure, Smaldore, Véronique, Pagliaroli, Véronique, Darmoni, Stefan, Metzger, Marie-Hélène
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3158541/
https://www.ncbi.nlm.nih.gov/pubmed/21798029
http://dx.doi.org/10.1186/1472-6947-11-50
_version_ 1782210379604033536
author Gerbier, Solweig
Yarovaya, Olga
Gicquel, Quentin
Millet, Anne-Laure
Smaldore, Véronique
Pagliaroli, Véronique
Darmoni, Stefan
Metzger, Marie-Hélène
author_facet Gerbier, Solweig
Yarovaya, Olga
Gicquel, Quentin
Millet, Anne-Laure
Smaldore, Véronique
Pagliaroli, Véronique
Darmoni, Stefan
Metzger, Marie-Hélène
author_sort Gerbier, Solweig
collection PubMed
description BACKGROUND: The identification of patients who pose an epidemic hazard when they are admitted to a health facility plays a role in preventing the risk of hospital acquired infection. An automated clinical decision support system to detect suspected cases, based on the principle of syndromic surveillance, is being developed at the University of Lyon's Hôpital de la Croix-Rousse. This tool will analyse structured data and narrative reports from computerized emergency department (ED) medical records. The first step consists of developing an application (UrgIndex) which automatically extracts and encodes information found in narrative reports. The purpose of the present article is to describe and evaluate this natural language processing system. METHODS: Narrative reports have to be pre-processed before utilizing the French-language medical multi-terminology indexer (ECMT) for standardized encoding. UrgIndex identifies and excludes syntagmas containing a negation and replaces non-standard terms (abbreviations, acronyms, spelling errors...). Then, the phrases are sent to the ECMT through an Internet connection. The indexer's reply, based on Extensible Markup Language, returns codes and literals corresponding to the concepts found in phrases. UrgIndex filters codes corresponding to suspected infections. Recall is defined as the number of relevant processed medical concepts divided by the number of concepts evaluated (coded manually by the medical epidemiologist). Precision is defined as the number of relevant processed concepts divided by the number of concepts proposed by UrgIndex. Recall and precision were assessed for respiratory and cutaneous syndromes. RESULTS: Evaluation of 1,674 processed medical concepts contained in 100 ED medical records (50 for respiratory syndromes and 50 for cutaneous syndromes) showed an overall recall of 85.8% (95% CI: 84.1-87.3). Recall varied from 84.5% for respiratory syndromes to 87.0% for cutaneous syndromes. The most frequent cause of lack of processing was non-recognition of the term by UrgIndex (9.7%). Overall precision was 79.1% (95% CI: 77.3-80.8). It varied from 81.4% for respiratory syndromes to 77.0% for cutaneous syndromes. CONCLUSIONS: This study demonstrates the feasibility of and interest in developing an automated method for extracting and encoding medical concepts from ED narrative reports, the first step required for the detection of potentially infectious patients at epidemic risk.
format Online
Article
Text
id pubmed-3158541
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31585412011-08-20 Evaluation of natural language processing from emergency department computerized medical records for intra-hospital syndromic surveillance Gerbier, Solweig Yarovaya, Olga Gicquel, Quentin Millet, Anne-Laure Smaldore, Véronique Pagliaroli, Véronique Darmoni, Stefan Metzger, Marie-Hélène BMC Med Inform Decis Mak Research Article BACKGROUND: The identification of patients who pose an epidemic hazard when they are admitted to a health facility plays a role in preventing the risk of hospital acquired infection. An automated clinical decision support system to detect suspected cases, based on the principle of syndromic surveillance, is being developed at the University of Lyon's Hôpital de la Croix-Rousse. This tool will analyse structured data and narrative reports from computerized emergency department (ED) medical records. The first step consists of developing an application (UrgIndex) which automatically extracts and encodes information found in narrative reports. The purpose of the present article is to describe and evaluate this natural language processing system. METHODS: Narrative reports have to be pre-processed before utilizing the French-language medical multi-terminology indexer (ECMT) for standardized encoding. UrgIndex identifies and excludes syntagmas containing a negation and replaces non-standard terms (abbreviations, acronyms, spelling errors...). Then, the phrases are sent to the ECMT through an Internet connection. The indexer's reply, based on Extensible Markup Language, returns codes and literals corresponding to the concepts found in phrases. UrgIndex filters codes corresponding to suspected infections. Recall is defined as the number of relevant processed medical concepts divided by the number of concepts evaluated (coded manually by the medical epidemiologist). Precision is defined as the number of relevant processed concepts divided by the number of concepts proposed by UrgIndex. Recall and precision were assessed for respiratory and cutaneous syndromes. RESULTS: Evaluation of 1,674 processed medical concepts contained in 100 ED medical records (50 for respiratory syndromes and 50 for cutaneous syndromes) showed an overall recall of 85.8% (95% CI: 84.1-87.3). Recall varied from 84.5% for respiratory syndromes to 87.0% for cutaneous syndromes. The most frequent cause of lack of processing was non-recognition of the term by UrgIndex (9.7%). Overall precision was 79.1% (95% CI: 77.3-80.8). It varied from 81.4% for respiratory syndromes to 77.0% for cutaneous syndromes. CONCLUSIONS: This study demonstrates the feasibility of and interest in developing an automated method for extracting and encoding medical concepts from ED narrative reports, the first step required for the detection of potentially infectious patients at epidemic risk. BioMed Central 2011-07-28 /pmc/articles/PMC3158541/ /pubmed/21798029 http://dx.doi.org/10.1186/1472-6947-11-50 Text en Copyright ©2011 Gerbier et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Gerbier, Solweig
Yarovaya, Olga
Gicquel, Quentin
Millet, Anne-Laure
Smaldore, Véronique
Pagliaroli, Véronique
Darmoni, Stefan
Metzger, Marie-Hélène
Evaluation of natural language processing from emergency department computerized medical records for intra-hospital syndromic surveillance
title Evaluation of natural language processing from emergency department computerized medical records for intra-hospital syndromic surveillance
title_full Evaluation of natural language processing from emergency department computerized medical records for intra-hospital syndromic surveillance
title_fullStr Evaluation of natural language processing from emergency department computerized medical records for intra-hospital syndromic surveillance
title_full_unstemmed Evaluation of natural language processing from emergency department computerized medical records for intra-hospital syndromic surveillance
title_short Evaluation of natural language processing from emergency department computerized medical records for intra-hospital syndromic surveillance
title_sort evaluation of natural language processing from emergency department computerized medical records for intra-hospital syndromic surveillance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3158541/
https://www.ncbi.nlm.nih.gov/pubmed/21798029
http://dx.doi.org/10.1186/1472-6947-11-50
work_keys_str_mv AT gerbiersolweig evaluationofnaturallanguageprocessingfromemergencydepartmentcomputerizedmedicalrecordsforintrahospitalsyndromicsurveillance
AT yarovayaolga evaluationofnaturallanguageprocessingfromemergencydepartmentcomputerizedmedicalrecordsforintrahospitalsyndromicsurveillance
AT gicquelquentin evaluationofnaturallanguageprocessingfromemergencydepartmentcomputerizedmedicalrecordsforintrahospitalsyndromicsurveillance
AT milletannelaure evaluationofnaturallanguageprocessingfromemergencydepartmentcomputerizedmedicalrecordsforintrahospitalsyndromicsurveillance
AT smaldoreveronique evaluationofnaturallanguageprocessingfromemergencydepartmentcomputerizedmedicalrecordsforintrahospitalsyndromicsurveillance
AT pagliaroliveronique evaluationofnaturallanguageprocessingfromemergencydepartmentcomputerizedmedicalrecordsforintrahospitalsyndromicsurveillance
AT darmonistefan evaluationofnaturallanguageprocessingfromemergencydepartmentcomputerizedmedicalrecordsforintrahospitalsyndromicsurveillance
AT metzgermariehelene evaluationofnaturallanguageprocessingfromemergencydepartmentcomputerizedmedicalrecordsforintrahospitalsyndromicsurveillance