Cargando…

The added value of text from Dutch general practitioner notes in predictive modeling

OBJECTIVE: This work aims to explore the value of Dutch unstructured data, in combination with structured data, for the development of prognostic prediction models in a general practitioner (GP) setting. MATERIALS AND METHODS: We trained and validated prediction models for 4 common clinical predicti...

Descripción completa

Detalles Bibliográficos
Autores principales: Seinen, Tom M, Kors, Jan A, van Mulligen, Erik M, Fridgeirsson, Egill, Rijnbeek, Peter R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10654855/
https://www.ncbi.nlm.nih.gov/pubmed/37587084
http://dx.doi.org/10.1093/jamia/ocad160
_version_ 1785136706827059200
author Seinen, Tom M
Kors, Jan A
van Mulligen, Erik M
Fridgeirsson, Egill
Rijnbeek, Peter R
author_facet Seinen, Tom M
Kors, Jan A
van Mulligen, Erik M
Fridgeirsson, Egill
Rijnbeek, Peter R
author_sort Seinen, Tom M
collection PubMed
description OBJECTIVE: This work aims to explore the value of Dutch unstructured data, in combination with structured data, for the development of prognostic prediction models in a general practitioner (GP) setting. MATERIALS AND METHODS: We trained and validated prediction models for 4 common clinical prediction problems using various sparse text representations, common prediction algorithms, and observational GP electronic health record (EHR) data. We trained and validated 84 models internally and externally on data from different EHR systems. RESULTS: On average, over all the different text representations and prediction algorithms, models only using text data performed better or similar to models using structured data alone in 2 prediction tasks. Additionally, in these 2 tasks, the combination of structured and text data outperformed models using structured or text data alone. No large performance differences were found between the different text representations and prediction algorithms. DISCUSSION: Our findings indicate that the use of unstructured data alone can result in well-performing prediction models for some clinical prediction problems. Furthermore, the performance improvement achieved by combining structured and text data highlights the added value. Additionally, we demonstrate the significance of clinical natural language processing research in languages other than English and the possibility of validating text-based prediction models across various EHR systems. CONCLUSION: Our study highlights the potential benefits of incorporating unstructured data in clinical prediction models in a GP setting. Although the added value of unstructured data may vary depending on the specific prediction task, our findings suggest that it has the potential to enhance patient care.
format Online
Article
Text
id pubmed-10654855
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106548552023-08-16 The added value of text from Dutch general practitioner notes in predictive modeling Seinen, Tom M Kors, Jan A van Mulligen, Erik M Fridgeirsson, Egill Rijnbeek, Peter R J Am Med Inform Assoc Research and Applications OBJECTIVE: This work aims to explore the value of Dutch unstructured data, in combination with structured data, for the development of prognostic prediction models in a general practitioner (GP) setting. MATERIALS AND METHODS: We trained and validated prediction models for 4 common clinical prediction problems using various sparse text representations, common prediction algorithms, and observational GP electronic health record (EHR) data. We trained and validated 84 models internally and externally on data from different EHR systems. RESULTS: On average, over all the different text representations and prediction algorithms, models only using text data performed better or similar to models using structured data alone in 2 prediction tasks. Additionally, in these 2 tasks, the combination of structured and text data outperformed models using structured or text data alone. No large performance differences were found between the different text representations and prediction algorithms. DISCUSSION: Our findings indicate that the use of unstructured data alone can result in well-performing prediction models for some clinical prediction problems. Furthermore, the performance improvement achieved by combining structured and text data highlights the added value. Additionally, we demonstrate the significance of clinical natural language processing research in languages other than English and the possibility of validating text-based prediction models across various EHR systems. CONCLUSION: Our study highlights the potential benefits of incorporating unstructured data in clinical prediction models in a GP setting. Although the added value of unstructured data may vary depending on the specific prediction task, our findings suggest that it has the potential to enhance patient care. Oxford University Press 2023-08-16 /pmc/articles/PMC10654855/ /pubmed/37587084 http://dx.doi.org/10.1093/jamia/ocad160 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research and Applications
Seinen, Tom M
Kors, Jan A
van Mulligen, Erik M
Fridgeirsson, Egill
Rijnbeek, Peter R
The added value of text from Dutch general practitioner notes in predictive modeling
title The added value of text from Dutch general practitioner notes in predictive modeling
title_full The added value of text from Dutch general practitioner notes in predictive modeling
title_fullStr The added value of text from Dutch general practitioner notes in predictive modeling
title_full_unstemmed The added value of text from Dutch general practitioner notes in predictive modeling
title_short The added value of text from Dutch general practitioner notes in predictive modeling
title_sort added value of text from dutch general practitioner notes in predictive modeling
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10654855/
https://www.ncbi.nlm.nih.gov/pubmed/37587084
http://dx.doi.org/10.1093/jamia/ocad160
work_keys_str_mv AT seinentomm theaddedvalueoftextfromdutchgeneralpractitionernotesinpredictivemodeling
AT korsjana theaddedvalueoftextfromdutchgeneralpractitionernotesinpredictivemodeling
AT vanmulligenerikm theaddedvalueoftextfromdutchgeneralpractitionernotesinpredictivemodeling
AT fridgeirssonegill theaddedvalueoftextfromdutchgeneralpractitionernotesinpredictivemodeling
AT rijnbeekpeterr theaddedvalueoftextfromdutchgeneralpractitionernotesinpredictivemodeling
AT seinentomm addedvalueoftextfromdutchgeneralpractitionernotesinpredictivemodeling
AT korsjana addedvalueoftextfromdutchgeneralpractitionernotesinpredictivemodeling
AT vanmulligenerikm addedvalueoftextfromdutchgeneralpractitionernotesinpredictivemodeling
AT fridgeirssonegill addedvalueoftextfromdutchgeneralpractitionernotesinpredictivemodeling
AT rijnbeekpeterr addedvalueoftextfromdutchgeneralpractitionernotesinpredictivemodeling