Cargando…
Identification of pneumonia and influenza deaths using the death certificate pipeline
BACKGROUND: Death records are a rich source of data, which can be used to assist with public surveillance and/or decision support. However, to use this type of data for such purposes it has to be transformed into a coded format to make it computable. Because the cause of death in the certificates is...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3444937/ https://www.ncbi.nlm.nih.gov/pubmed/22569097 http://dx.doi.org/10.1186/1472-6947-12-37 |
_version_ | 1782243743766675456 |
---|---|
author | Davis, Kailah Staes, Catherine Duncan, Jeff Igo, Sean Facelli, Julio C |
author_facet | Davis, Kailah Staes, Catherine Duncan, Jeff Igo, Sean Facelli, Julio C |
author_sort | Davis, Kailah |
collection | PubMed |
description | BACKGROUND: Death records are a rich source of data, which can be used to assist with public surveillance and/or decision support. However, to use this type of data for such purposes it has to be transformed into a coded format to make it computable. Because the cause of death in the certificates is reported as free text, encoding the data is currently the single largest barrier of using death certificates for surveillance. Therefore, the purpose of this study was to demonstrate the feasibility of using a pipeline, composed of a detection rule and a natural language processor, for the real time encoding of death certificates using the identification of pneumonia and influenza cases as an example and demonstrating that its accuracy is comparable to existing methods. RESULTS: A Death Certificates Pipeline (DCP) was developed to automatically code death certificates and identify pneumonia and influenza cases. The pipeline used MetaMap to code death certificates from the Utah Department of Health for the year 2008. The output of MetaMap was then accessed by detection rules which flagged pneumonia and influenza cases based on the Centers of Disease and Control and Prevention (CDC) case definition. The output from the DCP was compared with the current method used by the CDC and with a keyword search. Recall, precision, positive predictive value and F-measure with respect to the CDC method were calculated for the two other methods considered here. The two different techniques compared here with the CDC method showed the following recall/ precision results: DCP: 0.998/0.98 and keyword searching: 0.96/0.96. The F-measure were 0.99 and 0.96 respectively (DCP and keyword searching). Both the keyword and the DCP can run in interactive form with modest computer resources, but DCP showed superior performance. CONCLUSION: The pipeline proposed here for coding death certificates and the detection of cases is feasible and can be extended to other conditions. This method provides an alternative that allows for coding free-text death certificates in real time that may increase its utilization not only in the public health domain but also for biomedical researchers and developers. TRIAL REGISTRATION: This study did not involved any clinical trials. |
format | Online Article Text |
id | pubmed-3444937 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-34449372012-09-19 Identification of pneumonia and influenza deaths using the death certificate pipeline Davis, Kailah Staes, Catherine Duncan, Jeff Igo, Sean Facelli, Julio C BMC Med Inform Decis Mak Research Article BACKGROUND: Death records are a rich source of data, which can be used to assist with public surveillance and/or decision support. However, to use this type of data for such purposes it has to be transformed into a coded format to make it computable. Because the cause of death in the certificates is reported as free text, encoding the data is currently the single largest barrier of using death certificates for surveillance. Therefore, the purpose of this study was to demonstrate the feasibility of using a pipeline, composed of a detection rule and a natural language processor, for the real time encoding of death certificates using the identification of pneumonia and influenza cases as an example and demonstrating that its accuracy is comparable to existing methods. RESULTS: A Death Certificates Pipeline (DCP) was developed to automatically code death certificates and identify pneumonia and influenza cases. The pipeline used MetaMap to code death certificates from the Utah Department of Health for the year 2008. The output of MetaMap was then accessed by detection rules which flagged pneumonia and influenza cases based on the Centers of Disease and Control and Prevention (CDC) case definition. The output from the DCP was compared with the current method used by the CDC and with a keyword search. Recall, precision, positive predictive value and F-measure with respect to the CDC method were calculated for the two other methods considered here. The two different techniques compared here with the CDC method showed the following recall/ precision results: DCP: 0.998/0.98 and keyword searching: 0.96/0.96. The F-measure were 0.99 and 0.96 respectively (DCP and keyword searching). Both the keyword and the DCP can run in interactive form with modest computer resources, but DCP showed superior performance. CONCLUSION: The pipeline proposed here for coding death certificates and the detection of cases is feasible and can be extended to other conditions. This method provides an alternative that allows for coding free-text death certificates in real time that may increase its utilization not only in the public health domain but also for biomedical researchers and developers. TRIAL REGISTRATION: This study did not involved any clinical trials. BioMed Central 2012-05-08 /pmc/articles/PMC3444937/ /pubmed/22569097 http://dx.doi.org/10.1186/1472-6947-12-37 Text en Copyright ©2012 Davis et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Davis, Kailah Staes, Catherine Duncan, Jeff Igo, Sean Facelli, Julio C Identification of pneumonia and influenza deaths using the death certificate pipeline |
title | Identification of pneumonia and influenza deaths using the death certificate pipeline |
title_full | Identification of pneumonia and influenza deaths using the death certificate pipeline |
title_fullStr | Identification of pneumonia and influenza deaths using the death certificate pipeline |
title_full_unstemmed | Identification of pneumonia and influenza deaths using the death certificate pipeline |
title_short | Identification of pneumonia and influenza deaths using the death certificate pipeline |
title_sort | identification of pneumonia and influenza deaths using the death certificate pipeline |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3444937/ https://www.ncbi.nlm.nih.gov/pubmed/22569097 http://dx.doi.org/10.1186/1472-6947-12-37 |
work_keys_str_mv | AT daviskailah identificationofpneumoniaandinfluenzadeathsusingthedeathcertificatepipeline AT staescatherine identificationofpneumoniaandinfluenzadeathsusingthedeathcertificatepipeline AT duncanjeff identificationofpneumoniaandinfluenzadeathsusingthedeathcertificatepipeline AT igosean identificationofpneumoniaandinfluenzadeathsusingthedeathcertificatepipeline AT facellijulioc identificationofpneumoniaandinfluenzadeathsusingthedeathcertificatepipeline |