Cargando…

Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing

Pathology reports represent a primary source of information for cancer registries. University Malaya Medical Centre (UMMC) is a tertiary hospital responsible for training pathologists; thus narrative reporting becomes important. However, the unstructured free-text reports made the information extrac...

Descripción completa

Detalles Bibliográficos
Autores principales: Tan, Wee-Ming, Teoh, Kean-Hooi, Ganggayah, Mogana Darshini, Taib, Nur Aishah, Zaini, Hana Salwani, Dhillon, Sarinder Kaur
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9027647/
https://www.ncbi.nlm.nih.gov/pubmed/35453927
http://dx.doi.org/10.3390/diagnostics12040879
_version_ 1784691419138490368
author Tan, Wee-Ming
Teoh, Kean-Hooi
Ganggayah, Mogana Darshini
Taib, Nur Aishah
Zaini, Hana Salwani
Dhillon, Sarinder Kaur
author_facet Tan, Wee-Ming
Teoh, Kean-Hooi
Ganggayah, Mogana Darshini
Taib, Nur Aishah
Zaini, Hana Salwani
Dhillon, Sarinder Kaur
author_sort Tan, Wee-Ming
collection PubMed
description Pathology reports represent a primary source of information for cancer registries. University Malaya Medical Centre (UMMC) is a tertiary hospital responsible for training pathologists; thus narrative reporting becomes important. However, the unstructured free-text reports made the information extraction process tedious for clinical audits and data analysis-related research. This study aims to develop an automated natural language processing (NLP) algorithm to summarize the existing narrative breast pathology report from UMMC to a narrower structured synoptic pathology report with a checklist-style report template to ease the creation of pathology reports. The development of the rule-based NLP algorithm was based on the R programming language by using 593 pathology specimens from 174 patients provided by the Department of Pathology, UMMC. The pathologist provides specific keywords for data elements to define the semantic rules of the NLP. The system was evaluated by calculating the precision, recall, and F1-score. The proposed NLP algorithm achieved a micro-F1 score of 99.50% and a macro-F1 score of 98.97% on 178 specimens with 25 data elements. This achievement correlated to clinicians’ needs, which could improve communication between pathologists and clinicians. The study presented here is significant, as structured data is easily minable and could generate important insights.
format Online
Article
Text
id pubmed-9027647
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-90276472022-04-23 Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing Tan, Wee-Ming Teoh, Kean-Hooi Ganggayah, Mogana Darshini Taib, Nur Aishah Zaini, Hana Salwani Dhillon, Sarinder Kaur Diagnostics (Basel) Article Pathology reports represent a primary source of information for cancer registries. University Malaya Medical Centre (UMMC) is a tertiary hospital responsible for training pathologists; thus narrative reporting becomes important. However, the unstructured free-text reports made the information extraction process tedious for clinical audits and data analysis-related research. This study aims to develop an automated natural language processing (NLP) algorithm to summarize the existing narrative breast pathology report from UMMC to a narrower structured synoptic pathology report with a checklist-style report template to ease the creation of pathology reports. The development of the rule-based NLP algorithm was based on the R programming language by using 593 pathology specimens from 174 patients provided by the Department of Pathology, UMMC. The pathologist provides specific keywords for data elements to define the semantic rules of the NLP. The system was evaluated by calculating the precision, recall, and F1-score. The proposed NLP algorithm achieved a micro-F1 score of 99.50% and a macro-F1 score of 98.97% on 178 specimens with 25 data elements. This achievement correlated to clinicians’ needs, which could improve communication between pathologists and clinicians. The study presented here is significant, as structured data is easily minable and could generate important insights. MDPI 2022-04-01 /pmc/articles/PMC9027647/ /pubmed/35453927 http://dx.doi.org/10.3390/diagnostics12040879 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tan, Wee-Ming
Teoh, Kean-Hooi
Ganggayah, Mogana Darshini
Taib, Nur Aishah
Zaini, Hana Salwani
Dhillon, Sarinder Kaur
Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing
title Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing
title_full Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing
title_fullStr Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing
title_full_unstemmed Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing
title_short Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing
title_sort automated generation of synoptic reports from narrative pathology reports in university malaya medical centre using natural language processing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9027647/
https://www.ncbi.nlm.nih.gov/pubmed/35453927
http://dx.doi.org/10.3390/diagnostics12040879
work_keys_str_mv AT tanweeming automatedgenerationofsynopticreportsfromnarrativepathologyreportsinuniversitymalayamedicalcentreusingnaturallanguageprocessing
AT teohkeanhooi automatedgenerationofsynopticreportsfromnarrativepathologyreportsinuniversitymalayamedicalcentreusingnaturallanguageprocessing
AT ganggayahmoganadarshini automatedgenerationofsynopticreportsfromnarrativepathologyreportsinuniversitymalayamedicalcentreusingnaturallanguageprocessing
AT taibnuraishah automatedgenerationofsynopticreportsfromnarrativepathologyreportsinuniversitymalayamedicalcentreusingnaturallanguageprocessing
AT zainihanasalwani automatedgenerationofsynopticreportsfromnarrativepathologyreportsinuniversitymalayamedicalcentreusingnaturallanguageprocessing
AT dhillonsarinderkaur automatedgenerationofsynopticreportsfromnarrativepathologyreportsinuniversitymalayamedicalcentreusingnaturallanguageprocessing