Cargando…

Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation

BACKGROUND: Natural language processing (NLP) methods are powerful tools for extracting and analyzing critical information from free-text data. MedTaggerIE, an open-source NLP pipeline for information extraction based on text patterns, has been widely used in the annotation of clinical notes. A rule...

Descripción completa

Detalles Bibliográficos
Autores principales:	Han, Peijin, Fu, Sunyang, Kolis, Julie, Hughes, Richard, Hallstrom, Brian R, Carvour, Martha, Maradit-Kremers, Hilal, Sohn, Sunghwan, Vydiswaran, VG Vinod
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2022
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9475406/ https://www.ncbi.nlm.nih.gov/pubmed/36044253 http://dx.doi.org/10.2196/38155

_version_	1784789905180721152
author	Han, Peijin Fu, Sunyang Kolis, Julie Hughes, Richard Hallstrom, Brian R Carvour, Martha Maradit-Kremers, Hilal Sohn, Sunghwan Vydiswaran, VG Vinod
author_facet	Han, Peijin Fu, Sunyang Kolis, Julie Hughes, Richard Hallstrom, Brian R Carvour, Martha Maradit-Kremers, Hilal Sohn, Sunghwan Vydiswaran, VG Vinod
author_sort	Han, Peijin
collection	PubMed
description	BACKGROUND: Natural language processing (NLP) methods are powerful tools for extracting and analyzing critical information from free-text data. MedTaggerIE, an open-source NLP pipeline for information extraction based on text patterns, has been widely used in the annotation of clinical notes. A rule-based system, MedTagger-total hip arthroplasty (THA), developed based on MedTaggerIE, was previously shown to correctly identify the surgical approach, fixation, and bearing surface from the THA operative notes at Mayo Clinic. OBJECTIVE: This study aimed to assess the implementability, usability, and portability of MedTagger-THA at two external institutions, Michigan Medicine and the University of Iowa, and provide lessons learned for best practices. METHODS: We conducted iterative test-apply-refinement processes with three involved sites—the development site (Mayo Clinic) and two deployment sites (Michigan Medicine and the University of Iowa). Mayo Clinic was the primary NLP development site, with the THA registry as the gold standard. The activities at the two deployment sites included the extraction of the operative notes, gold standard development (Michigan: registry data; Iowa: manual chart review), the refinement of NLP algorithms on training data, and the evaluation of test data. Error analyses were conducted to understand language variations across sites. To further assess the model specificity for approach and fixation, we applied the refined MedTagger-THA to arthroscopic hip procedures and periacetabular osteotomy cases, as neither of these operative notes should contain any approach or fixation keywords. RESULTS: MedTagger-THA algorithms were implemented and refined independently for both sites. At Michigan, the study comprised THA-related notes for 2569 patient-date pairs. Before model refinement, MedTagger-THA algorithms demonstrated excellent accuracy for approach (96.6%, 95% CI 94.6%-97.9%) and fixation (95.7%, 95% CI 92.4%-97.6%). These results were comparable with internal accuracy at the development site (99.2% for approach and 90.7% for fixation). Model refinement improved accuracies slightly for both approach (99%, 95% CI 97.6%-99.6%) and fixation (98%, 95% CI 95.3%-99.3%). The specificity of approach identification was 88.9% for arthroscopy cases, and the specificity of fixation identification was 100% for both periacetabular osteotomy and arthroscopy cases. At the Iowa site, the study comprised an overall data set of 100 operative notes (50 training notes and 50 test notes). MedTagger-THA algorithms achieved moderate-high performance on the training data. After model refinement, the model achieved high performance for approach (100%, 95% CI 91.3%-100%), fixation (98%, 95% CI 88.3%-100%), and bearing surface (92%, 95% CI 80.5%-97.3%). CONCLUSIONS: High performance across centers was achieved for the MedTagger-THA algorithms, demonstrating that they were sufficiently implementable, usable, and portable to different deployment sites. This study provided important lessons learned during the model deployment and validation processes, and it can serve as a reference for transferring rule-based electronic health record models.
format	Online Article Text
id	pubmed-9475406
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-94754062022-09-16 Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation Han, Peijin Fu, Sunyang Kolis, Julie Hughes, Richard Hallstrom, Brian R Carvour, Martha Maradit-Kremers, Hilal Sohn, Sunghwan Vydiswaran, VG Vinod JMIR Med Inform Original Paper BACKGROUND: Natural language processing (NLP) methods are powerful tools for extracting and analyzing critical information from free-text data. MedTaggerIE, an open-source NLP pipeline for information extraction based on text patterns, has been widely used in the annotation of clinical notes. A rule-based system, MedTagger-total hip arthroplasty (THA), developed based on MedTaggerIE, was previously shown to correctly identify the surgical approach, fixation, and bearing surface from the THA operative notes at Mayo Clinic. OBJECTIVE: This study aimed to assess the implementability, usability, and portability of MedTagger-THA at two external institutions, Michigan Medicine and the University of Iowa, and provide lessons learned for best practices. METHODS: We conducted iterative test-apply-refinement processes with three involved sites—the development site (Mayo Clinic) and two deployment sites (Michigan Medicine and the University of Iowa). Mayo Clinic was the primary NLP development site, with the THA registry as the gold standard. The activities at the two deployment sites included the extraction of the operative notes, gold standard development (Michigan: registry data; Iowa: manual chart review), the refinement of NLP algorithms on training data, and the evaluation of test data. Error analyses were conducted to understand language variations across sites. To further assess the model specificity for approach and fixation, we applied the refined MedTagger-THA to arthroscopic hip procedures and periacetabular osteotomy cases, as neither of these operative notes should contain any approach or fixation keywords. RESULTS: MedTagger-THA algorithms were implemented and refined independently for both sites. At Michigan, the study comprised THA-related notes for 2569 patient-date pairs. Before model refinement, MedTagger-THA algorithms demonstrated excellent accuracy for approach (96.6%, 95% CI 94.6%-97.9%) and fixation (95.7%, 95% CI 92.4%-97.6%). These results were comparable with internal accuracy at the development site (99.2% for approach and 90.7% for fixation). Model refinement improved accuracies slightly for both approach (99%, 95% CI 97.6%-99.6%) and fixation (98%, 95% CI 95.3%-99.3%). The specificity of approach identification was 88.9% for arthroscopy cases, and the specificity of fixation identification was 100% for both periacetabular osteotomy and arthroscopy cases. At the Iowa site, the study comprised an overall data set of 100 operative notes (50 training notes and 50 test notes). MedTagger-THA algorithms achieved moderate-high performance on the training data. After model refinement, the model achieved high performance for approach (100%, 95% CI 91.3%-100%), fixation (98%, 95% CI 88.3%-100%), and bearing surface (92%, 95% CI 80.5%-97.3%). CONCLUSIONS: High performance across centers was achieved for the MedTagger-THA algorithms, demonstrating that they were sufficiently implementable, usable, and portable to different deployment sites. This study provided important lessons learned during the model deployment and validation processes, and it can serve as a reference for transferring rule-based electronic health record models. JMIR Publications 2022-08-31 /pmc/articles/PMC9475406/ /pubmed/36044253 http://dx.doi.org/10.2196/38155 Text en ©Peijin Han, Sunyang Fu, Julie Kolis, Richard Hughes, Brian R Hallstrom, Martha Carvour, Hilal Maradit-Kremers, Sunghwan Sohn, VG Vinod Vydiswaran. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 31.08.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Han, Peijin Fu, Sunyang Kolis, Julie Hughes, Richard Hallstrom, Brian R Carvour, Martha Maradit-Kremers, Hilal Sohn, Sunghwan Vydiswaran, VG Vinod Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation
title	Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation
title_full	Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation
title_fullStr	Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation
title_full_unstemmed	Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation
title_short	Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation
title_sort	multicenter validation of natural language processing algorithms for the detection of common data elements in operative notes for total hip arthroplasty: algorithm development and validation
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9475406/ https://www.ncbi.nlm.nih.gov/pubmed/36044253 http://dx.doi.org/10.2196/38155
work_keys_str_mv	AT hanpeijin multicentervalidationofnaturallanguageprocessingalgorithmsforthedetectionofcommondataelementsinoperativenotesfortotalhiparthroplastyalgorithmdevelopmentandvalidation AT fusunyang multicentervalidationofnaturallanguageprocessingalgorithmsforthedetectionofcommondataelementsinoperativenotesfortotalhiparthroplastyalgorithmdevelopmentandvalidation AT kolisjulie multicentervalidationofnaturallanguageprocessingalgorithmsforthedetectionofcommondataelementsinoperativenotesfortotalhiparthroplastyalgorithmdevelopmentandvalidation AT hughesrichard multicentervalidationofnaturallanguageprocessingalgorithmsforthedetectionofcommondataelementsinoperativenotesfortotalhiparthroplastyalgorithmdevelopmentandvalidation AT hallstrombrianr multicentervalidationofnaturallanguageprocessingalgorithmsforthedetectionofcommondataelementsinoperativenotesfortotalhiparthroplastyalgorithmdevelopmentandvalidation AT carvourmartha multicentervalidationofnaturallanguageprocessingalgorithmsforthedetectionofcommondataelementsinoperativenotesfortotalhiparthroplastyalgorithmdevelopmentandvalidation AT maraditkremershilal multicentervalidationofnaturallanguageprocessingalgorithmsforthedetectionofcommondataelementsinoperativenotesfortotalhiparthroplastyalgorithmdevelopmentandvalidation AT sohnsunghwan multicentervalidationofnaturallanguageprocessingalgorithmsforthedetectionofcommondataelementsinoperativenotesfortotalhiparthroplastyalgorithmdevelopmentandvalidation AT vydiswaranvgvinod multicentervalidationofnaturallanguageprocessingalgorithmsforthedetectionofcommondataelementsinoperativenotesfortotalhiparthroplastyalgorithmdevelopmentandvalidation

Multicenter Validation of Natural Language Processing Algorithms for the Detection of Common Data Elements in Operative Notes for Total Hip Arthroplasty: Algorithm Development and Validation

Ejemplares similares