Cargando…

Using case-level context to classify cancer pathology reports

Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence—for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from in...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Shang, Alawad, Mohammed, Schaefferkoetter, Noah, Penberthy, Lynne, Wu, Xiao-Cheng, Durbin, Eric B., Coyle, Linda, Ramanathan, Arvind, Tourassi, Georgia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7217446/
https://www.ncbi.nlm.nih.gov/pubmed/32396579
http://dx.doi.org/10.1371/journal.pone.0232840
Descripción
Sumario:Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence—for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from individual reports, but also to capture aggregate information regarding the entire cancer case based off case-level context from all reports in the sequence. In this paper, we introduce a simple modular add-on for capturing case-level context that is designed to be compatible with most existing deep learning architectures for text classification on individual reports. We test our approach on a corpus of 431,433 cancer pathology reports, and we show that incorporating case-level context significantly boosts classification accuracy across six classification tasks—site, subsite, laterality, histology, behavior, and grade. We expect that with minimal modifications, our add-on can be applied towards a wide range of other clinical text-based tasks.