Cargando…

Using case-level context to classify cancer pathology reports

Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence—for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from in...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Shang, Alawad, Mohammed, Schaefferkoetter, Noah, Penberthy, Lynne, Wu, Xiao-Cheng, Durbin, Eric B., Coyle, Linda, Ramanathan, Arvind, Tourassi, Georgia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7217446/
https://www.ncbi.nlm.nih.gov/pubmed/32396579
http://dx.doi.org/10.1371/journal.pone.0232840
_version_ 1783532599668375552
author Gao, Shang
Alawad, Mohammed
Schaefferkoetter, Noah
Penberthy, Lynne
Wu, Xiao-Cheng
Durbin, Eric B.
Coyle, Linda
Ramanathan, Arvind
Tourassi, Georgia
author_facet Gao, Shang
Alawad, Mohammed
Schaefferkoetter, Noah
Penberthy, Lynne
Wu, Xiao-Cheng
Durbin, Eric B.
Coyle, Linda
Ramanathan, Arvind
Tourassi, Georgia
author_sort Gao, Shang
collection PubMed
description Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence—for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from individual reports, but also to capture aggregate information regarding the entire cancer case based off case-level context from all reports in the sequence. In this paper, we introduce a simple modular add-on for capturing case-level context that is designed to be compatible with most existing deep learning architectures for text classification on individual reports. We test our approach on a corpus of 431,433 cancer pathology reports, and we show that incorporating case-level context significantly boosts classification accuracy across six classification tasks—site, subsite, laterality, histology, behavior, and grade. We expect that with minimal modifications, our add-on can be applied towards a wide range of other clinical text-based tasks.
format Online
Article
Text
id pubmed-7217446
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-72174462020-05-26 Using case-level context to classify cancer pathology reports Gao, Shang Alawad, Mohammed Schaefferkoetter, Noah Penberthy, Lynne Wu, Xiao-Cheng Durbin, Eric B. Coyle, Linda Ramanathan, Arvind Tourassi, Georgia PLoS One Research Article Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence—for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from individual reports, but also to capture aggregate information regarding the entire cancer case based off case-level context from all reports in the sequence. In this paper, we introduce a simple modular add-on for capturing case-level context that is designed to be compatible with most existing deep learning architectures for text classification on individual reports. We test our approach on a corpus of 431,433 cancer pathology reports, and we show that incorporating case-level context significantly boosts classification accuracy across six classification tasks—site, subsite, laterality, histology, behavior, and grade. We expect that with minimal modifications, our add-on can be applied towards a wide range of other clinical text-based tasks. Public Library of Science 2020-05-12 /pmc/articles/PMC7217446/ /pubmed/32396579 http://dx.doi.org/10.1371/journal.pone.0232840 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Gao, Shang
Alawad, Mohammed
Schaefferkoetter, Noah
Penberthy, Lynne
Wu, Xiao-Cheng
Durbin, Eric B.
Coyle, Linda
Ramanathan, Arvind
Tourassi, Georgia
Using case-level context to classify cancer pathology reports
title Using case-level context to classify cancer pathology reports
title_full Using case-level context to classify cancer pathology reports
title_fullStr Using case-level context to classify cancer pathology reports
title_full_unstemmed Using case-level context to classify cancer pathology reports
title_short Using case-level context to classify cancer pathology reports
title_sort using case-level context to classify cancer pathology reports
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7217446/
https://www.ncbi.nlm.nih.gov/pubmed/32396579
http://dx.doi.org/10.1371/journal.pone.0232840
work_keys_str_mv AT gaoshang usingcaselevelcontexttoclassifycancerpathologyreports
AT alawadmohammed usingcaselevelcontexttoclassifycancerpathologyreports
AT schaefferkoetternoah usingcaselevelcontexttoclassifycancerpathologyreports
AT penberthylynne usingcaselevelcontexttoclassifycancerpathologyreports
AT wuxiaocheng usingcaselevelcontexttoclassifycancerpathologyreports
AT durbinericb usingcaselevelcontexttoclassifycancerpathologyreports
AT coylelinda usingcaselevelcontexttoclassifycancerpathologyreports
AT ramanathanarvind usingcaselevelcontexttoclassifycancerpathologyreports
AT tourassigeorgia usingcaselevelcontexttoclassifycancerpathologyreports