Cargando…
Benchmarking Clinical Speech Recognition and Information Extraction: New Data, Methods, and Evaluations
BACKGROUND: Over a tenth of preventable adverse events in health care are caused by failures in information flow. These failures are tangible in clinical handover; regardless of good verbal handover, from two-thirds to all of this information is lost after 3-5 shifts if notes are taken by hand, or n...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Gunther Eysenbach
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4427705/ https://www.ncbi.nlm.nih.gov/pubmed/25917752 http://dx.doi.org/10.2196/medinform.4321 |
_version_ | 1782370764102565888 |
---|---|
author | Suominen, Hanna Zhou, Liyuan Hanlen, Leif Ferraro, Gabriela |
author_facet | Suominen, Hanna Zhou, Liyuan Hanlen, Leif Ferraro, Gabriela |
author_sort | Suominen, Hanna |
collection | PubMed |
description | BACKGROUND: Over a tenth of preventable adverse events in health care are caused by failures in information flow. These failures are tangible in clinical handover; regardless of good verbal handover, from two-thirds to all of this information is lost after 3-5 shifts if notes are taken by hand, or not at all. Speech recognition and information extraction provide a way to fill out a handover form for clinical proofing and sign-off. OBJECTIVE: The objective of the study was to provide a recorded spoken handover, annotated verbatim transcriptions, and evaluations to support research in spoken and written natural language processing for filling out a clinical handover form. This dataset is based on synthetic patient profiles, thereby avoiding ethical and legal restrictions, while maintaining efficacy for research in speech-to-text conversion and information extraction, based on realistic clinical scenarios. We also introduce a Web app to demonstrate the system design and workflow. METHODS: We experiment with Dragon Medical 11.0 for speech recognition and CRF++ for information extraction. To compute features for information extraction, we also apply CoreNLP, MetaMap, and Ontoserver. Our evaluation uses cross-validation techniques to measure processing correctness. RESULTS: The data provided were a simulation of nursing handover, as recorded using a mobile device, built from simulated patient records and handover scripts, spoken by an Australian registered nurse. Speech recognition recognized 5276 of 7277 words in our 100 test documents correctly. We considered 50 mutually exclusive categories in information extraction and achieved the F1 (ie, the harmonic mean of Precision and Recall) of 0.86 in the category for irrelevant text and the macro-averaged F1 of 0.70 over the remaining 35 nonempty categories of the form in our 101 test documents. CONCLUSIONS: The significance of this study hinges on opening our data, together with the related performance benchmarks and some processing software, to the research and development community for studying clinical documentation and language-processing. The data are used in the CLEFeHealth 2015 evaluation laboratory for a shared task on speech recognition. |
format | Online Article Text |
id | pubmed-4427705 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Gunther Eysenbach |
record_format | MEDLINE/PubMed |
spelling | pubmed-44277052015-05-26 Benchmarking Clinical Speech Recognition and Information Extraction: New Data, Methods, and Evaluations Suominen, Hanna Zhou, Liyuan Hanlen, Leif Ferraro, Gabriela JMIR Med Inform Original Paper BACKGROUND: Over a tenth of preventable adverse events in health care are caused by failures in information flow. These failures are tangible in clinical handover; regardless of good verbal handover, from two-thirds to all of this information is lost after 3-5 shifts if notes are taken by hand, or not at all. Speech recognition and information extraction provide a way to fill out a handover form for clinical proofing and sign-off. OBJECTIVE: The objective of the study was to provide a recorded spoken handover, annotated verbatim transcriptions, and evaluations to support research in spoken and written natural language processing for filling out a clinical handover form. This dataset is based on synthetic patient profiles, thereby avoiding ethical and legal restrictions, while maintaining efficacy for research in speech-to-text conversion and information extraction, based on realistic clinical scenarios. We also introduce a Web app to demonstrate the system design and workflow. METHODS: We experiment with Dragon Medical 11.0 for speech recognition and CRF++ for information extraction. To compute features for information extraction, we also apply CoreNLP, MetaMap, and Ontoserver. Our evaluation uses cross-validation techniques to measure processing correctness. RESULTS: The data provided were a simulation of nursing handover, as recorded using a mobile device, built from simulated patient records and handover scripts, spoken by an Australian registered nurse. Speech recognition recognized 5276 of 7277 words in our 100 test documents correctly. We considered 50 mutually exclusive categories in information extraction and achieved the F1 (ie, the harmonic mean of Precision and Recall) of 0.86 in the category for irrelevant text and the macro-averaged F1 of 0.70 over the remaining 35 nonempty categories of the form in our 101 test documents. CONCLUSIONS: The significance of this study hinges on opening our data, together with the related performance benchmarks and some processing software, to the research and development community for studying clinical documentation and language-processing. The data are used in the CLEFeHealth 2015 evaluation laboratory for a shared task on speech recognition. Gunther Eysenbach 2015-04-27 /pmc/articles/PMC4427705/ /pubmed/25917752 http://dx.doi.org/10.2196/medinform.4321 Text en ©Hanna Suominen, Liyuan Zhou, Leif Hanlen, Gabriela Ferraro. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 27.04.2015. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Suominen, Hanna Zhou, Liyuan Hanlen, Leif Ferraro, Gabriela Benchmarking Clinical Speech Recognition and Information Extraction: New Data, Methods, and Evaluations |
title | Benchmarking Clinical Speech Recognition and Information Extraction: New Data, Methods, and Evaluations |
title_full | Benchmarking Clinical Speech Recognition and Information Extraction: New Data, Methods, and Evaluations |
title_fullStr | Benchmarking Clinical Speech Recognition and Information Extraction: New Data, Methods, and Evaluations |
title_full_unstemmed | Benchmarking Clinical Speech Recognition and Information Extraction: New Data, Methods, and Evaluations |
title_short | Benchmarking Clinical Speech Recognition and Information Extraction: New Data, Methods, and Evaluations |
title_sort | benchmarking clinical speech recognition and information extraction: new data, methods, and evaluations |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4427705/ https://www.ncbi.nlm.nih.gov/pubmed/25917752 http://dx.doi.org/10.2196/medinform.4321 |
work_keys_str_mv | AT suominenhanna benchmarkingclinicalspeechrecognitionandinformationextractionnewdatamethodsandevaluations AT zhouliyuan benchmarkingclinicalspeechrecognitionandinformationextractionnewdatamethodsandevaluations AT hanlenleif benchmarkingclinicalspeechrecognitionandinformationextractionnewdatamethodsandevaluations AT ferrarogabriela benchmarkingclinicalspeechrecognitionandinformationextractionnewdatamethodsandevaluations |