Cargando…
Using case-level context to classify cancer pathology reports
Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence—for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from in...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7217446/ https://www.ncbi.nlm.nih.gov/pubmed/32396579 http://dx.doi.org/10.1371/journal.pone.0232840 |
_version_ | 1783532599668375552 |
---|---|
author | Gao, Shang Alawad, Mohammed Schaefferkoetter, Noah Penberthy, Lynne Wu, Xiao-Cheng Durbin, Eric B. Coyle, Linda Ramanathan, Arvind Tourassi, Georgia |
author_facet | Gao, Shang Alawad, Mohammed Schaefferkoetter, Noah Penberthy, Lynne Wu, Xiao-Cheng Durbin, Eric B. Coyle, Linda Ramanathan, Arvind Tourassi, Georgia |
author_sort | Gao, Shang |
collection | PubMed |
description | Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence—for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from individual reports, but also to capture aggregate information regarding the entire cancer case based off case-level context from all reports in the sequence. In this paper, we introduce a simple modular add-on for capturing case-level context that is designed to be compatible with most existing deep learning architectures for text classification on individual reports. We test our approach on a corpus of 431,433 cancer pathology reports, and we show that incorporating case-level context significantly boosts classification accuracy across six classification tasks—site, subsite, laterality, histology, behavior, and grade. We expect that with minimal modifications, our add-on can be applied towards a wide range of other clinical text-based tasks. |
format | Online Article Text |
id | pubmed-7217446 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-72174462020-05-26 Using case-level context to classify cancer pathology reports Gao, Shang Alawad, Mohammed Schaefferkoetter, Noah Penberthy, Lynne Wu, Xiao-Cheng Durbin, Eric B. Coyle, Linda Ramanathan, Arvind Tourassi, Georgia PLoS One Research Article Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence—for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from individual reports, but also to capture aggregate information regarding the entire cancer case based off case-level context from all reports in the sequence. In this paper, we introduce a simple modular add-on for capturing case-level context that is designed to be compatible with most existing deep learning architectures for text classification on individual reports. We test our approach on a corpus of 431,433 cancer pathology reports, and we show that incorporating case-level context significantly boosts classification accuracy across six classification tasks—site, subsite, laterality, histology, behavior, and grade. We expect that with minimal modifications, our add-on can be applied towards a wide range of other clinical text-based tasks. Public Library of Science 2020-05-12 /pmc/articles/PMC7217446/ /pubmed/32396579 http://dx.doi.org/10.1371/journal.pone.0232840 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Gao, Shang Alawad, Mohammed Schaefferkoetter, Noah Penberthy, Lynne Wu, Xiao-Cheng Durbin, Eric B. Coyle, Linda Ramanathan, Arvind Tourassi, Georgia Using case-level context to classify cancer pathology reports |
title | Using case-level context to classify cancer pathology reports |
title_full | Using case-level context to classify cancer pathology reports |
title_fullStr | Using case-level context to classify cancer pathology reports |
title_full_unstemmed | Using case-level context to classify cancer pathology reports |
title_short | Using case-level context to classify cancer pathology reports |
title_sort | using case-level context to classify cancer pathology reports |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7217446/ https://www.ncbi.nlm.nih.gov/pubmed/32396579 http://dx.doi.org/10.1371/journal.pone.0232840 |
work_keys_str_mv | AT gaoshang usingcaselevelcontexttoclassifycancerpathologyreports AT alawadmohammed usingcaselevelcontexttoclassifycancerpathologyreports AT schaefferkoetternoah usingcaselevelcontexttoclassifycancerpathologyreports AT penberthylynne usingcaselevelcontexttoclassifycancerpathologyreports AT wuxiaocheng usingcaselevelcontexttoclassifycancerpathologyreports AT durbinericb usingcaselevelcontexttoclassifycancerpathologyreports AT coylelinda usingcaselevelcontexttoclassifycancerpathologyreports AT ramanathanarvind usingcaselevelcontexttoclassifycancerpathologyreports AT tourassigeorgia usingcaselevelcontexttoclassifycancerpathologyreports |