Cargando…
Predicting future falls in older people using natural language processing of general practitioners’ clinical notes
BACKGROUND: Falls in older people are common and morbid. Prediction models can help identifying individuals at higher fall risk. Electronic health records (EHR) offer an opportunity to develop automated prediction tools that may help to identify fall-prone individuals and lower clinical workload. Ho...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10071555/ https://www.ncbi.nlm.nih.gov/pubmed/37014000 http://dx.doi.org/10.1093/ageing/afad046 |
_version_ | 1785019221347926016 |
---|---|
author | Dormosh, Noman Schut, Martijn C Heymans, Martijn W Maarsingh, Otto Bouman, Jonathan van der Velde, Nathalie Abu-Hanna, Ameen |
author_facet | Dormosh, Noman Schut, Martijn C Heymans, Martijn W Maarsingh, Otto Bouman, Jonathan van der Velde, Nathalie Abu-Hanna, Ameen |
author_sort | Dormosh, Noman |
collection | PubMed |
description | BACKGROUND: Falls in older people are common and morbid. Prediction models can help identifying individuals at higher fall risk. Electronic health records (EHR) offer an opportunity to develop automated prediction tools that may help to identify fall-prone individuals and lower clinical workload. However, existing models primarily utilise structured EHR data and neglect information in unstructured data. Using machine learning and natural language processing (NLP), we aimed to examine the predictive performance provided by unstructured clinical notes, and their incremental performance over structured data to predict falls. METHODS: We used primary care EHR data of people aged 65 or over. We developed three logistic regression models using the least absolute shrinkage and selection operator: one using structured clinical variables (Baseline), one with topics extracted from unstructured clinical notes (Topic-based) and one by adding clinical variables to the extracted topics (Combi). Model performance was assessed in terms of discrimination using the area under the receiver operating characteristic curve (AUC), and calibration by calibration plots. We used 10-fold cross-validation to validate the approach. RESULTS: Data of 35,357 individuals were analysed, of which 4,734 experienced falls. Our NLP topic modelling technique discovered 151 topics from the unstructured clinical notes. AUCs and 95% confidence intervals of the Baseline, Topic-based and Combi models were 0.709 (0.700–0.719), 0.685 (0.676–0.694) and 0.718 (0.708–0.727), respectively. All the models showed good calibration. CONCLUSIONS: Unstructured clinical notes are an additional viable data source to develop and improve prediction models for falls compared to traditional prediction models, but the clinical relevance remains limited. |
format | Online Article Text |
id | pubmed-10071555 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-100715552023-04-05 Predicting future falls in older people using natural language processing of general practitioners’ clinical notes Dormosh, Noman Schut, Martijn C Heymans, Martijn W Maarsingh, Otto Bouman, Jonathan van der Velde, Nathalie Abu-Hanna, Ameen Age Ageing Research Paper BACKGROUND: Falls in older people are common and morbid. Prediction models can help identifying individuals at higher fall risk. Electronic health records (EHR) offer an opportunity to develop automated prediction tools that may help to identify fall-prone individuals and lower clinical workload. However, existing models primarily utilise structured EHR data and neglect information in unstructured data. Using machine learning and natural language processing (NLP), we aimed to examine the predictive performance provided by unstructured clinical notes, and their incremental performance over structured data to predict falls. METHODS: We used primary care EHR data of people aged 65 or over. We developed three logistic regression models using the least absolute shrinkage and selection operator: one using structured clinical variables (Baseline), one with topics extracted from unstructured clinical notes (Topic-based) and one by adding clinical variables to the extracted topics (Combi). Model performance was assessed in terms of discrimination using the area under the receiver operating characteristic curve (AUC), and calibration by calibration plots. We used 10-fold cross-validation to validate the approach. RESULTS: Data of 35,357 individuals were analysed, of which 4,734 experienced falls. Our NLP topic modelling technique discovered 151 topics from the unstructured clinical notes. AUCs and 95% confidence intervals of the Baseline, Topic-based and Combi models were 0.709 (0.700–0.719), 0.685 (0.676–0.694) and 0.718 (0.708–0.727), respectively. All the models showed good calibration. CONCLUSIONS: Unstructured clinical notes are an additional viable data source to develop and improve prediction models for falls compared to traditional prediction models, but the clinical relevance remains limited. Oxford University Press 2023-04-01 /pmc/articles/PMC10071555/ /pubmed/37014000 http://dx.doi.org/10.1093/ageing/afad046 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the British Geriatrics Society. All rights reserved. For permissions, please email: journals.permissions@oup.com https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Research Paper Dormosh, Noman Schut, Martijn C Heymans, Martijn W Maarsingh, Otto Bouman, Jonathan van der Velde, Nathalie Abu-Hanna, Ameen Predicting future falls in older people using natural language processing of general practitioners’ clinical notes |
title | Predicting future falls in older people using natural language processing of general practitioners’ clinical notes |
title_full | Predicting future falls in older people using natural language processing of general practitioners’ clinical notes |
title_fullStr | Predicting future falls in older people using natural language processing of general practitioners’ clinical notes |
title_full_unstemmed | Predicting future falls in older people using natural language processing of general practitioners’ clinical notes |
title_short | Predicting future falls in older people using natural language processing of general practitioners’ clinical notes |
title_sort | predicting future falls in older people using natural language processing of general practitioners’ clinical notes |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10071555/ https://www.ncbi.nlm.nih.gov/pubmed/37014000 http://dx.doi.org/10.1093/ageing/afad046 |
work_keys_str_mv | AT dormoshnoman predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes AT schutmartijnc predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes AT heymansmartijnw predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes AT maarsinghotto predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes AT boumanjonathan predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes AT vanderveldenathalie predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes AT abuhannaameen predictingfuturefallsinolderpeopleusingnaturallanguageprocessingofgeneralpractitionersclinicalnotes |