Cargando…

Predicting future falls in older people using natural language processing of general practitioners’ clinical notes

BACKGROUND: Falls in older people are common and morbid. Prediction models can help identifying individuals at higher fall risk. Electronic health records (EHR) offer an opportunity to develop automated prediction tools that may help to identify fall-prone individuals and lower clinical workload. Ho...

Descripción completa

Detalles Bibliográficos
Autores principales: Dormosh, Noman, Schut, Martijn C, Heymans, Martijn W, Maarsingh, Otto, Bouman, Jonathan, van der Velde, Nathalie, Abu-Hanna, Ameen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10071555/
https://www.ncbi.nlm.nih.gov/pubmed/37014000
http://dx.doi.org/10.1093/ageing/afad046
_version_ 1785019221347926016
author Dormosh, Noman
Schut, Martijn C
Heymans, Martijn W
Maarsingh, Otto
Bouman, Jonathan
van der Velde, Nathalie
Abu-Hanna, Ameen
author_facet Dormosh, Noman
Schut, Martijn C
Heymans, Martijn W
Maarsingh, Otto
Bouman, Jonathan
van der Velde, Nathalie
Abu-Hanna, Ameen
author_sort Dormosh, Noman
collection PubMed
description BACKGROUND: Falls in older people are common and morbid. Prediction models can help identifying individuals at higher fall risk. Electronic health records (EHR) offer an opportunity to develop automated prediction tools that may help to identify fall-prone individuals and lower clinical workload. However, existing models primarily utilise structured EHR data and neglect information in unstructured data. Using machine learning and natural language processing (NLP), we aimed to examine the predictive performance provided by unstructured clinical notes, and their incremental performance over structured data to predict falls. METHODS: We used primary care EHR data of people aged 65 or over. We developed three logistic regression models using the least absolute shrinkage and selection operator: one using structured clinical variables (Baseline), one with topics extracted from unstructured clinical notes (Topic-based) and one by adding clinical variables to the extracted topics (Combi). Model performance was assessed in terms of discrimination using the area under the receiver operating characteristic curve (AUC), and calibration by calibration plots. We used 10-fold cross-validation to validate the approach. RESULTS: Data of 35,357 individuals were analysed, of which 4,734 experienced falls. Our NLP topic modelling technique discovered 151 topics from the unstructured clinical notes. AUCs and 95% confidence intervals of the Baseline, Topic-based and Combi models were 0.709 (0.700–0.719), 0.685 (0.676–0.694) and 0.718 (0.708–0.727), respectively. All the models showed good calibration. CONCLUSIONS: Unstructured clinical notes are an additional viable data source to develop and improve prediction models for falls compared to traditional prediction models, but the clinical relevance remains limited.
format Online
Article
Text
id pubmed-10071555
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100715552023-04-05 Predicting future falls in older people using natural language processing of general practitioners’ clinical notes Dormosh, Noman Schut, Martijn C Heymans, Martijn W Maarsingh, Otto Bouman, Jonathan van der Velde, Nathalie Abu-Hanna, Ameen Age Ageing Research Paper BACKGROUND: Falls in older people are common and morbid. Prediction models can help identifying individuals at higher fall risk. Electronic health records (EHR) offer an opportunity to develop automated prediction tools that may help to identify fall-prone individuals and lower clinical workload. However, existing models primarily utilise structured EHR data and neglect information in unstructured data. Using machine learning and natural language processing (NLP), we aimed to examine the predictive performance provided by unstructured clinical notes, and their incremental performance over structured data to predict falls. METHODS: We used primary care EHR data of people aged 65 or over. We developed three logistic regression models using the least absolute shrinkage and selection operator: one using structured clinical variables (Baseline), one with topics extracted from unstructured clinical notes (Topic-based) and one by adding clinical variables to the extracted topics (Combi). Model performance was assessed in terms of discrimination using the area under the receiver operating characteristic curve (AUC), and calibration by calibration plots. We used 10-fold cross-validation to validate the approach. RESULTS: Data of 35,357 individuals were analysed, of which 4,734 experienced falls. Our NLP topic modelling technique discovered 151 topics from the unstructured clinical notes. AUCs and 95% confidence intervals of the Baseline, Topic-based and Combi models were 0.709 (0.700–0.719), 0.685 (0.676–0.694) and 0.718 (0.708–0.727), respectively. All the models showed good calibration. CONCLUSIONS: Unstructured clinical notes are an additional viable data source to develop and improve prediction models for falls compared to traditional prediction models, but the clinical relevance remains limited. Oxford University Press 2023-04-01 /pmc/articles/PMC10071555/ /pubmed/37014000 http://dx.doi.org/10.1093/ageing/afad046 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the British Geriatrics Society. All rights reserved. For permissions, please email: journals.permissions@oup.com https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research Paper
Dormosh, Noman
Schut, Martijn C
Heymans, Martijn W
Maarsingh, Otto
Bouman, Jonathan
van der Velde, Nathalie
Abu-Hanna, Ameen
Predicting future falls in older people using natural language processing of general practitioners’ clinical notes
title Predicting future falls in older people using natural language processing of general practitioners’ clinical notes
title_full Predicting future falls in older people using natural language processing of general practitioners’ clinical notes
title_fullStr Predicting future falls in older people using natural language processing of general practitioners’ clinical notes
title_full_unstemmed Predicting future falls in older people using natural language processing of general practitioners’ clinical notes
title_short Predicting future falls in older people using natural language processing of general practitioners’ clinical notes
title_sort predicting future falls in older people using natural language processing of general practitioners’ clinical notes
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10071555/
https://www.ncbi.nlm.nih.gov/pubmed/37014000
http://dx.doi.org/10.1093/ageing/afad046
work_keys_str_mv AT dormoshnoman predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes
AT schutmartijnc predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes
AT heymansmartijnw predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes
AT maarsinghotto predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes
AT boumanjonathan predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes
AT vanderveldenathalie predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes
AT abuhannaameen predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes