Cargando…

Identification of pneumonia and influenza deaths using the death certificate pipeline

BACKGROUND: Death records are a rich source of data, which can be used to assist with public surveillance and/or decision support. However, to use this type of data for such purposes it has to be transformed into a coded format to make it computable. Because the cause of death in the certificates is...

Descripción completa

Detalles Bibliográficos
Autores principales: Davis, Kailah, Staes, Catherine, Duncan, Jeff, Igo, Sean, Facelli, Julio C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3444937/
https://www.ncbi.nlm.nih.gov/pubmed/22569097
http://dx.doi.org/10.1186/1472-6947-12-37
_version_ 1782243743766675456
author Davis, Kailah
Staes, Catherine
Duncan, Jeff
Igo, Sean
Facelli, Julio C
author_facet Davis, Kailah
Staes, Catherine
Duncan, Jeff
Igo, Sean
Facelli, Julio C
author_sort Davis, Kailah
collection PubMed
description BACKGROUND: Death records are a rich source of data, which can be used to assist with public surveillance and/or decision support. However, to use this type of data for such purposes it has to be transformed into a coded format to make it computable. Because the cause of death in the certificates is reported as free text, encoding the data is currently the single largest barrier of using death certificates for surveillance. Therefore, the purpose of this study was to demonstrate the feasibility of using a pipeline, composed of a detection rule and a natural language processor, for the real time encoding of death certificates using the identification of pneumonia and influenza cases as an example and demonstrating that its accuracy is comparable to existing methods. RESULTS: A Death Certificates Pipeline (DCP) was developed to automatically code death certificates and identify pneumonia and influenza cases. The pipeline used MetaMap to code death certificates from the Utah Department of Health for the year 2008. The output of MetaMap was then accessed by detection rules which flagged pneumonia and influenza cases based on the Centers of Disease and Control and Prevention (CDC) case definition. The output from the DCP was compared with the current method used by the CDC and with a keyword search. Recall, precision, positive predictive value and F-measure with respect to the CDC method were calculated for the two other methods considered here. The two different techniques compared here with the CDC method showed the following recall/ precision results: DCP: 0.998/0.98 and keyword searching: 0.96/0.96. The F-measure were 0.99 and 0.96 respectively (DCP and keyword searching). Both the keyword and the DCP can run in interactive form with modest computer resources, but DCP showed superior performance. CONCLUSION: The pipeline proposed here for coding death certificates and the detection of cases is feasible and can be extended to other conditions. This method provides an alternative that allows for coding free-text death certificates in real time that may increase its utilization not only in the public health domain but also for biomedical researchers and developers. TRIAL REGISTRATION: This study did not involved any clinical trials.
format Online
Article
Text
id pubmed-3444937
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34449372012-09-19 Identification of pneumonia and influenza deaths using the death certificate pipeline Davis, Kailah Staes, Catherine Duncan, Jeff Igo, Sean Facelli, Julio C BMC Med Inform Decis Mak Research Article BACKGROUND: Death records are a rich source of data, which can be used to assist with public surveillance and/or decision support. However, to use this type of data for such purposes it has to be transformed into a coded format to make it computable. Because the cause of death in the certificates is reported as free text, encoding the data is currently the single largest barrier of using death certificates for surveillance. Therefore, the purpose of this study was to demonstrate the feasibility of using a pipeline, composed of a detection rule and a natural language processor, for the real time encoding of death certificates using the identification of pneumonia and influenza cases as an example and demonstrating that its accuracy is comparable to existing methods. RESULTS: A Death Certificates Pipeline (DCP) was developed to automatically code death certificates and identify pneumonia and influenza cases. The pipeline used MetaMap to code death certificates from the Utah Department of Health for the year 2008. The output of MetaMap was then accessed by detection rules which flagged pneumonia and influenza cases based on the Centers of Disease and Control and Prevention (CDC) case definition. The output from the DCP was compared with the current method used by the CDC and with a keyword search. Recall, precision, positive predictive value and F-measure with respect to the CDC method were calculated for the two other methods considered here. The two different techniques compared here with the CDC method showed the following recall/ precision results: DCP: 0.998/0.98 and keyword searching: 0.96/0.96. The F-measure were 0.99 and 0.96 respectively (DCP and keyword searching). Both the keyword and the DCP can run in interactive form with modest computer resources, but DCP showed superior performance. CONCLUSION: The pipeline proposed here for coding death certificates and the detection of cases is feasible and can be extended to other conditions. This method provides an alternative that allows for coding free-text death certificates in real time that may increase its utilization not only in the public health domain but also for biomedical researchers and developers. TRIAL REGISTRATION: This study did not involved any clinical trials. BioMed Central 2012-05-08 /pmc/articles/PMC3444937/ /pubmed/22569097 http://dx.doi.org/10.1186/1472-6947-12-37 Text en Copyright ©2012 Davis et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Davis, Kailah
Staes, Catherine
Duncan, Jeff
Igo, Sean
Facelli, Julio C
Identification of pneumonia and influenza deaths using the death certificate pipeline
title Identification of pneumonia and influenza deaths using the death certificate pipeline
title_full Identification of pneumonia and influenza deaths using the death certificate pipeline
title_fullStr Identification of pneumonia and influenza deaths using the death certificate pipeline
title_full_unstemmed Identification of pneumonia and influenza deaths using the death certificate pipeline
title_short Identification of pneumonia and influenza deaths using the death certificate pipeline
title_sort identification of pneumonia and influenza deaths using the death certificate pipeline
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3444937/
https://www.ncbi.nlm.nih.gov/pubmed/22569097
http://dx.doi.org/10.1186/1472-6947-12-37
work_keys_str_mv AT daviskailah identificationofpneumoniaandinfluenzadeathsusingthedeathcertificatepipeline
AT staescatherine identificationofpneumoniaandinfluenzadeathsusingthedeathcertificatepipeline
AT duncanjeff identificationofpneumoniaandinfluenzadeathsusingthedeathcertificatepipeline
AT igosean identificationofpneumoniaandinfluenzadeathsusingthedeathcertificatepipeline
AT facellijulioc identificationofpneumoniaandinfluenzadeathsusingthedeathcertificatepipeline