Cargando…

Natural Language Processing in Diagnostic Texts from Nephropathology

Introduction: This study investigates whether it is possible to predict a final diagnosis based on a written nephropathological description—as a surrogate for image analysis—using various NLP methods. Methods: For this work, 1107 unlabelled nephropathological reports were included. (i) First, after...

Descripción completa

Detalles Bibliográficos
Autores principales: Legnar, Maximilian, Daumke, Philipp, Hesser, Jürgen, Porubsky, Stefan, Popovic, Zoran, Bindzus, Jan Niklas, Siemoneit, Joern-Helge Heinrich, Weis, Cleo-Aron
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9325286/
https://www.ncbi.nlm.nih.gov/pubmed/35885630
http://dx.doi.org/10.3390/diagnostics12071726
_version_ 1784757012667564032
author Legnar, Maximilian
Daumke, Philipp
Hesser, Jürgen
Porubsky, Stefan
Popovic, Zoran
Bindzus, Jan Niklas
Siemoneit, Joern-Helge Heinrich
Weis, Cleo-Aron
author_facet Legnar, Maximilian
Daumke, Philipp
Hesser, Jürgen
Porubsky, Stefan
Popovic, Zoran
Bindzus, Jan Niklas
Siemoneit, Joern-Helge Heinrich
Weis, Cleo-Aron
author_sort Legnar, Maximilian
collection PubMed
description Introduction: This study investigates whether it is possible to predict a final diagnosis based on a written nephropathological description—as a surrogate for image analysis—using various NLP methods. Methods: For this work, 1107 unlabelled nephropathological reports were included. (i) First, after separating each report into its microscopic description and diagnosis section, the diagnosis sections were clustered unsupervised to less than 20 diagnostic groups using different clustering techniques. (ii) Second, different text classification methods were used to predict the diagnostic group based on the microscopic description section. Results: The best clustering results (i) could be achieved with HDBSCAN, using BoW-based feature extraction methods. Based on keywords, these clusters can be mapped to certain diagnostic groups. A transformer encoder-based approach as well as an SVM worked best regarding diagnosis prediction based on the histomorphological description (ii). Certain diagnosis groups reached F1-scores of up to 0.892 while others achieved weak classification metrics. Conclusion: While textual morphological description alone enables retrieving the correct diagnosis for some entities, it does not work sufficiently for other entities. This is in accordance with a previous image analysis study on glomerular change patterns, where some diagnoses are associated with one pattern, but for others, there exists a complex pattern combination.
format Online
Article
Text
id pubmed-9325286
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93252862022-07-27 Natural Language Processing in Diagnostic Texts from Nephropathology Legnar, Maximilian Daumke, Philipp Hesser, Jürgen Porubsky, Stefan Popovic, Zoran Bindzus, Jan Niklas Siemoneit, Joern-Helge Heinrich Weis, Cleo-Aron Diagnostics (Basel) Article Introduction: This study investigates whether it is possible to predict a final diagnosis based on a written nephropathological description—as a surrogate for image analysis—using various NLP methods. Methods: For this work, 1107 unlabelled nephropathological reports were included. (i) First, after separating each report into its microscopic description and diagnosis section, the diagnosis sections were clustered unsupervised to less than 20 diagnostic groups using different clustering techniques. (ii) Second, different text classification methods were used to predict the diagnostic group based on the microscopic description section. Results: The best clustering results (i) could be achieved with HDBSCAN, using BoW-based feature extraction methods. Based on keywords, these clusters can be mapped to certain diagnostic groups. A transformer encoder-based approach as well as an SVM worked best regarding diagnosis prediction based on the histomorphological description (ii). Certain diagnosis groups reached F1-scores of up to 0.892 while others achieved weak classification metrics. Conclusion: While textual morphological description alone enables retrieving the correct diagnosis for some entities, it does not work sufficiently for other entities. This is in accordance with a previous image analysis study on glomerular change patterns, where some diagnoses are associated with one pattern, but for others, there exists a complex pattern combination. MDPI 2022-07-15 /pmc/articles/PMC9325286/ /pubmed/35885630 http://dx.doi.org/10.3390/diagnostics12071726 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Legnar, Maximilian
Daumke, Philipp
Hesser, Jürgen
Porubsky, Stefan
Popovic, Zoran
Bindzus, Jan Niklas
Siemoneit, Joern-Helge Heinrich
Weis, Cleo-Aron
Natural Language Processing in Diagnostic Texts from Nephropathology
title Natural Language Processing in Diagnostic Texts from Nephropathology
title_full Natural Language Processing in Diagnostic Texts from Nephropathology
title_fullStr Natural Language Processing in Diagnostic Texts from Nephropathology
title_full_unstemmed Natural Language Processing in Diagnostic Texts from Nephropathology
title_short Natural Language Processing in Diagnostic Texts from Nephropathology
title_sort natural language processing in diagnostic texts from nephropathology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9325286/
https://www.ncbi.nlm.nih.gov/pubmed/35885630
http://dx.doi.org/10.3390/diagnostics12071726
work_keys_str_mv AT legnarmaximilian naturallanguageprocessingindiagnostictextsfromnephropathology
AT daumkephilipp naturallanguageprocessingindiagnostictextsfromnephropathology
AT hesserjurgen naturallanguageprocessingindiagnostictextsfromnephropathology
AT porubskystefan naturallanguageprocessingindiagnostictextsfromnephropathology
AT popoviczoran naturallanguageprocessingindiagnostictextsfromnephropathology
AT bindzusjanniklas naturallanguageprocessingindiagnostictextsfromnephropathology
AT siemoneitjoernhelgeheinrich naturallanguageprocessingindiagnostictextsfromnephropathology
AT weiscleoaron naturallanguageprocessingindiagnostictextsfromnephropathology