Cargando…

Facilitating Surveillance of Pulmonary Invasive Mold Diseases in Patients with Haematological Malignancies by Screening Computed Tomography Reports Using Natural Language Processing

PURPOSE: Prospective surveillance of invasive mold diseases (IMDs) in haematology patients should be standard of care but is hampered by the absence of a reliable laboratory prompt and the difficulty of manual surveillance. We used a high throughput technology, natural language processing (NLP), to...

Descripción completa

Detalles Bibliográficos
Autores principales: Ananda-Rajah, Michelle R., Martinez, David, Slavin, Monica A., Cavedon, Lawrence, Dooley, Michael, Cheng, Allen, Thursky, Karin A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4175456/
https://www.ncbi.nlm.nih.gov/pubmed/25250675
http://dx.doi.org/10.1371/journal.pone.0107797
_version_ 1782336485550194688
author Ananda-Rajah, Michelle R.
Martinez, David
Slavin, Monica A.
Cavedon, Lawrence
Dooley, Michael
Cheng, Allen
Thursky, Karin A.
author_facet Ananda-Rajah, Michelle R.
Martinez, David
Slavin, Monica A.
Cavedon, Lawrence
Dooley, Michael
Cheng, Allen
Thursky, Karin A.
author_sort Ananda-Rajah, Michelle R.
collection PubMed
description PURPOSE: Prospective surveillance of invasive mold diseases (IMDs) in haematology patients should be standard of care but is hampered by the absence of a reliable laboratory prompt and the difficulty of manual surveillance. We used a high throughput technology, natural language processing (NLP), to develop a classifier based on machine learning techniques to screen computed tomography (CT) reports supportive for IMDs. PATIENTS AND METHODS: We conducted a retrospective case-control study of CT reports from the clinical encounter and up to 12-weeks after, from a random subset of 79 of 270 case patients with 33 probable/proven IMDs by international definitions, and 68 of 257 uninfected-control patients identified from 3 tertiary haematology centres. The classifier was trained and tested on a reference standard of 449 physician annotated reports including a development subset (n = 366), from a total of 1880 reports, using 10-fold cross validation, comparing binary and probabilistic predictions to the reference standard to generate sensitivity, specificity and area under the receiver-operating-curve (ROC). RESULTS: For the development subset, sensitivity/specificity was 91% (95%CI 86% to 94%)/79% (95%CI 71% to 84%) and ROC area was 0.92 (95%CI 89% to 94%). Of 25 (5.6%) missed notifications, only 4 (0.9%) reports were regarded as clinically significant. CONCLUSION: CT reports are a readily available and timely resource that may be exploited by NLP to facilitate continuous prospective IMD surveillance with translational benefits beyond surveillance alone.
format Online
Article
Text
id pubmed-4175456
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41754562014-10-02 Facilitating Surveillance of Pulmonary Invasive Mold Diseases in Patients with Haematological Malignancies by Screening Computed Tomography Reports Using Natural Language Processing Ananda-Rajah, Michelle R. Martinez, David Slavin, Monica A. Cavedon, Lawrence Dooley, Michael Cheng, Allen Thursky, Karin A. PLoS One Research Article PURPOSE: Prospective surveillance of invasive mold diseases (IMDs) in haematology patients should be standard of care but is hampered by the absence of a reliable laboratory prompt and the difficulty of manual surveillance. We used a high throughput technology, natural language processing (NLP), to develop a classifier based on machine learning techniques to screen computed tomography (CT) reports supportive for IMDs. PATIENTS AND METHODS: We conducted a retrospective case-control study of CT reports from the clinical encounter and up to 12-weeks after, from a random subset of 79 of 270 case patients with 33 probable/proven IMDs by international definitions, and 68 of 257 uninfected-control patients identified from 3 tertiary haematology centres. The classifier was trained and tested on a reference standard of 449 physician annotated reports including a development subset (n = 366), from a total of 1880 reports, using 10-fold cross validation, comparing binary and probabilistic predictions to the reference standard to generate sensitivity, specificity and area under the receiver-operating-curve (ROC). RESULTS: For the development subset, sensitivity/specificity was 91% (95%CI 86% to 94%)/79% (95%CI 71% to 84%) and ROC area was 0.92 (95%CI 89% to 94%). Of 25 (5.6%) missed notifications, only 4 (0.9%) reports were regarded as clinically significant. CONCLUSION: CT reports are a readily available and timely resource that may be exploited by NLP to facilitate continuous prospective IMD surveillance with translational benefits beyond surveillance alone. Public Library of Science 2014-09-24 /pmc/articles/PMC4175456/ /pubmed/25250675 http://dx.doi.org/10.1371/journal.pone.0107797 Text en © 2014 Ananda-Rajah et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ananda-Rajah, Michelle R.
Martinez, David
Slavin, Monica A.
Cavedon, Lawrence
Dooley, Michael
Cheng, Allen
Thursky, Karin A.
Facilitating Surveillance of Pulmonary Invasive Mold Diseases in Patients with Haematological Malignancies by Screening Computed Tomography Reports Using Natural Language Processing
title Facilitating Surveillance of Pulmonary Invasive Mold Diseases in Patients with Haematological Malignancies by Screening Computed Tomography Reports Using Natural Language Processing
title_full Facilitating Surveillance of Pulmonary Invasive Mold Diseases in Patients with Haematological Malignancies by Screening Computed Tomography Reports Using Natural Language Processing
title_fullStr Facilitating Surveillance of Pulmonary Invasive Mold Diseases in Patients with Haematological Malignancies by Screening Computed Tomography Reports Using Natural Language Processing
title_full_unstemmed Facilitating Surveillance of Pulmonary Invasive Mold Diseases in Patients with Haematological Malignancies by Screening Computed Tomography Reports Using Natural Language Processing
title_short Facilitating Surveillance of Pulmonary Invasive Mold Diseases in Patients with Haematological Malignancies by Screening Computed Tomography Reports Using Natural Language Processing
title_sort facilitating surveillance of pulmonary invasive mold diseases in patients with haematological malignancies by screening computed tomography reports using natural language processing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4175456/
https://www.ncbi.nlm.nih.gov/pubmed/25250675
http://dx.doi.org/10.1371/journal.pone.0107797
work_keys_str_mv AT anandarajahmicheller facilitatingsurveillanceofpulmonaryinvasivemolddiseasesinpatientswithhaematologicalmalignanciesbyscreeningcomputedtomographyreportsusingnaturallanguageprocessing
AT martinezdavid facilitatingsurveillanceofpulmonaryinvasivemolddiseasesinpatientswithhaematologicalmalignanciesbyscreeningcomputedtomographyreportsusingnaturallanguageprocessing
AT slavinmonicaa facilitatingsurveillanceofpulmonaryinvasivemolddiseasesinpatientswithhaematologicalmalignanciesbyscreeningcomputedtomographyreportsusingnaturallanguageprocessing
AT cavedonlawrence facilitatingsurveillanceofpulmonaryinvasivemolddiseasesinpatientswithhaematologicalmalignanciesbyscreeningcomputedtomographyreportsusingnaturallanguageprocessing
AT dooleymichael facilitatingsurveillanceofpulmonaryinvasivemolddiseasesinpatientswithhaematologicalmalignanciesbyscreeningcomputedtomographyreportsusingnaturallanguageprocessing
AT chengallen facilitatingsurveillanceofpulmonaryinvasivemolddiseasesinpatientswithhaematologicalmalignanciesbyscreeningcomputedtomographyreportsusingnaturallanguageprocessing
AT thurskykarina facilitatingsurveillanceofpulmonaryinvasivemolddiseasesinpatientswithhaematologicalmalignanciesbyscreeningcomputedtomographyreportsusingnaturallanguageprocessing