Cargando…

Performance of a Natural Language Processing (NLP) Tool to Extract Pulmonary Function Test (PFT) Reports from Structured and Semistructured Veteran Affairs (VA) Data

INTRODUCTION/OBJECTIVE: Pulmonary function tests (PFTs) are objective estimates of lung function, but are not reliably stored within the Veteran Health Affairs data systems as structured data. The aim of this study was to validate the natural language processing (NLP) tool we developed—which extract...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sauer, Brian C., Jones, Barbara E., Globe, Gary, Leng, Jianwei, Lu, Chao-Chin, He, Tao, Teng, Chia-Chen, Sullivan, Patrick, Zeng, Qing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	AcademyHealth 2016
Materias:	Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4909376/ https://www.ncbi.nlm.nih.gov/pubmed/27376095 http://dx.doi.org/10.13063/2327-9214.1217

_version_	1782437830270648320
author	Sauer, Brian C. Jones, Barbara E. Globe, Gary Leng, Jianwei Lu, Chao-Chin He, Tao Teng, Chia-Chen Sullivan, Patrick Zeng, Qing
author_facet	Sauer, Brian C. Jones, Barbara E. Globe, Gary Leng, Jianwei Lu, Chao-Chin He, Tao Teng, Chia-Chen Sullivan, Patrick Zeng, Qing
author_sort	Sauer, Brian C.
collection	PubMed
description	INTRODUCTION/OBJECTIVE: Pulmonary function tests (PFTs) are objective estimates of lung function, but are not reliably stored within the Veteran Health Affairs data systems as structured data. The aim of this study was to validate the natural language processing (NLP) tool we developed—which extracts spirometric values and responses to bronchodilator administration—against expert review, and to estimate the number of additional spirometric tests identified beyond the structured data. METHODS: All patients at seven Veteran Affairs Medical Centers with a diagnostic code for asthma Jan 1, 2006–Dec 31, 2012 were included. Evidence of spirometry with a bronchodilator challenge (BDC) was extracted from structured data as well as clinical documents. NLP’s performance was compared against a human reference standard using a random sample of 1,001 documents. RESULTS: In the validation set NLP demonstrated a precision of 98.9 percent (95 percent confidence intervals (CI): 93.9 percent, 99.7 percent), recall of 97.8 percent (95 percent CI: 92.2 percent, 99.7 percent), and an F-measure of 98.3 percent for the forced vital capacity pre- and post pairs and precision of 100 percent (95 percent CI: 96.6 percent, 100 percent), recall of 100 percent (95 percent CI: 96.6 percent, 100 percent), and an F-measure of 100 percent for the forced expiratory volume in one second pre- and post pairs for bronchodilator administration. Application of the NLP increased the proportion identified with complete bronchodilator challenge by 25 percent. DISCUSSION/CONCLUSION: This technology can improve identification of PFTs for epidemiologic research. Caution must be taken in assuming that a single domain of clinical data can completely capture the scope of a disease, treatment, or clinical test.
format	Online Article Text
id	pubmed-4909376
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	AcademyHealth
record_format	MEDLINE/PubMed
spelling	pubmed-49093762016-07-01 Performance of a Natural Language Processing (NLP) Tool to Extract Pulmonary Function Test (PFT) Reports from Structured and Semistructured Veteran Affairs (VA) Data Sauer, Brian C. Jones, Barbara E. Globe, Gary Leng, Jianwei Lu, Chao-Chin He, Tao Teng, Chia-Chen Sullivan, Patrick Zeng, Qing EGEMS (Wash DC) Articles INTRODUCTION/OBJECTIVE: Pulmonary function tests (PFTs) are objective estimates of lung function, but are not reliably stored within the Veteran Health Affairs data systems as structured data. The aim of this study was to validate the natural language processing (NLP) tool we developed—which extracts spirometric values and responses to bronchodilator administration—against expert review, and to estimate the number of additional spirometric tests identified beyond the structured data. METHODS: All patients at seven Veteran Affairs Medical Centers with a diagnostic code for asthma Jan 1, 2006–Dec 31, 2012 were included. Evidence of spirometry with a bronchodilator challenge (BDC) was extracted from structured data as well as clinical documents. NLP’s performance was compared against a human reference standard using a random sample of 1,001 documents. RESULTS: In the validation set NLP demonstrated a precision of 98.9 percent (95 percent confidence intervals (CI): 93.9 percent, 99.7 percent), recall of 97.8 percent (95 percent CI: 92.2 percent, 99.7 percent), and an F-measure of 98.3 percent for the forced vital capacity pre- and post pairs and precision of 100 percent (95 percent CI: 96.6 percent, 100 percent), recall of 100 percent (95 percent CI: 96.6 percent, 100 percent), and an F-measure of 100 percent for the forced expiratory volume in one second pre- and post pairs for bronchodilator administration. Application of the NLP increased the proportion identified with complete bronchodilator challenge by 25 percent. DISCUSSION/CONCLUSION: This technology can improve identification of PFTs for epidemiologic research. Caution must be taken in assuming that a single domain of clinical data can completely capture the scope of a disease, treatment, or clinical test. AcademyHealth 2016-06-01 /pmc/articles/PMC4909376/ /pubmed/27376095 http://dx.doi.org/10.13063/2327-9214.1217 Text en All eGEMs publications are licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License http://creativecommons.org/licenses/by-nc-nd/3.0/
spellingShingle	Articles Sauer, Brian C. Jones, Barbara E. Globe, Gary Leng, Jianwei Lu, Chao-Chin He, Tao Teng, Chia-Chen Sullivan, Patrick Zeng, Qing Performance of a Natural Language Processing (NLP) Tool to Extract Pulmonary Function Test (PFT) Reports from Structured and Semistructured Veteran Affairs (VA) Data
title	Performance of a Natural Language Processing (NLP) Tool to Extract Pulmonary Function Test (PFT) Reports from Structured and Semistructured Veteran Affairs (VA) Data
title_full	Performance of a Natural Language Processing (NLP) Tool to Extract Pulmonary Function Test (PFT) Reports from Structured and Semistructured Veteran Affairs (VA) Data
title_fullStr	Performance of a Natural Language Processing (NLP) Tool to Extract Pulmonary Function Test (PFT) Reports from Structured and Semistructured Veteran Affairs (VA) Data
title_full_unstemmed	Performance of a Natural Language Processing (NLP) Tool to Extract Pulmonary Function Test (PFT) Reports from Structured and Semistructured Veteran Affairs (VA) Data
title_short	Performance of a Natural Language Processing (NLP) Tool to Extract Pulmonary Function Test (PFT) Reports from Structured and Semistructured Veteran Affairs (VA) Data
title_sort	performance of a natural language processing (nlp) tool to extract pulmonary function test (pft) reports from structured and semistructured veteran affairs (va) data
topic	Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4909376/ https://www.ncbi.nlm.nih.gov/pubmed/27376095 http://dx.doi.org/10.13063/2327-9214.1217
work_keys_str_mv	AT sauerbrianc performanceofanaturallanguageprocessingnlptooltoextractpulmonaryfunctiontestpftreportsfromstructuredandsemistructuredveteranaffairsvadata AT jonesbarbarae performanceofanaturallanguageprocessingnlptooltoextractpulmonaryfunctiontestpftreportsfromstructuredandsemistructuredveteranaffairsvadata AT globegary performanceofanaturallanguageprocessingnlptooltoextractpulmonaryfunctiontestpftreportsfromstructuredandsemistructuredveteranaffairsvadata AT lengjianwei performanceofanaturallanguageprocessingnlptooltoextractpulmonaryfunctiontestpftreportsfromstructuredandsemistructuredveteranaffairsvadata AT luchaochin performanceofanaturallanguageprocessingnlptooltoextractpulmonaryfunctiontestpftreportsfromstructuredandsemistructuredveteranaffairsvadata AT hetao performanceofanaturallanguageprocessingnlptooltoextractpulmonaryfunctiontestpftreportsfromstructuredandsemistructuredveteranaffairsvadata AT tengchiachen performanceofanaturallanguageprocessingnlptooltoextractpulmonaryfunctiontestpftreportsfromstructuredandsemistructuredveteranaffairsvadata AT sullivanpatrick performanceofanaturallanguageprocessingnlptooltoextractpulmonaryfunctiontestpftreportsfromstructuredandsemistructuredveteranaffairsvadata AT zengqing performanceofanaturallanguageprocessingnlptooltoextractpulmonaryfunctiontestpftreportsfromstructuredandsemistructuredveteranaffairsvadata

Performance of a Natural Language Processing (NLP) Tool to Extract Pulmonary Function Test (PFT) Reports from Structured and Semistructured Veteran Affairs (VA) Data

Ejemplares similares