Cargando…

Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review

OBJECTIVE: To describe and appraise the use of artificial intelligence (AI) techniques that can cope with longitudinal data from electronic health records (EHRs) to predict health-related outcomes. METHODS: This review included studies in any language that: EHR was at least one of the data sources,...

Descripción completa

Detalles Bibliográficos
Autores principales: Carrasco-Ribelles, Lucía A, Llanes-Jurado, José, Gallego-Moll, Carlos, Cabrera-Bean, Margarita, Monteagudo-Zaragoza, Mònica, Violán, Concepción, Zabaleta-del-Olmo, Edurne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10654870/
https://www.ncbi.nlm.nih.gov/pubmed/37659105
http://dx.doi.org/10.1093/jamia/ocad168
_version_ 1785136708513169408
author Carrasco-Ribelles, Lucía A
Llanes-Jurado, José
Gallego-Moll, Carlos
Cabrera-Bean, Margarita
Monteagudo-Zaragoza, Mònica
Violán, Concepción
Zabaleta-del-Olmo, Edurne
author_facet Carrasco-Ribelles, Lucía A
Llanes-Jurado, José
Gallego-Moll, Carlos
Cabrera-Bean, Margarita
Monteagudo-Zaragoza, Mònica
Violán, Concepción
Zabaleta-del-Olmo, Edurne
author_sort Carrasco-Ribelles, Lucía A
collection PubMed
description OBJECTIVE: To describe and appraise the use of artificial intelligence (AI) techniques that can cope with longitudinal data from electronic health records (EHRs) to predict health-related outcomes. METHODS: This review included studies in any language that: EHR was at least one of the data sources, collected longitudinal data, used an AI technique capable of handling longitudinal data, and predicted any health-related outcomes. We searched MEDLINE, Scopus, Web of Science, and IEEE Xplorer from inception to January 3, 2022. Information on the dataset, prediction task, data preprocessing, feature selection, method, validation, performance, and implementation was extracted and summarized using descriptive statistics. Risk of bias and completeness of reporting were assessed using a short form of PROBAST and TRIPOD, respectively. RESULTS: Eighty-one studies were included. Follow-up time and number of registers per patient varied greatly, and most predicted disease development or next event based on diagnoses and drug treatments. Architectures generally were based on Recurrent Neural Networks-like layers, though in recent years combining different layers or transformers has become more popular. About half of the included studies performed hyperparameter tuning and used attention mechanisms. Most performed a single train-test partition and could not correctly assess the variability of the model’s performance. Reporting quality was poor, and a third of the studies were at high risk of bias. CONCLUSIONS: AI models are increasingly using longitudinal data. However, the heterogeneity in reporting methodology and results, and the lack of public EHR datasets and code sharing, complicate the possibility of replication. REGISTRATION: PROSPERO database (CRD42022331388).
format Online
Article
Text
id pubmed-10654870
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106548702023-09-02 Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review Carrasco-Ribelles, Lucía A Llanes-Jurado, José Gallego-Moll, Carlos Cabrera-Bean, Margarita Monteagudo-Zaragoza, Mònica Violán, Concepción Zabaleta-del-Olmo, Edurne J Am Med Inform Assoc Review OBJECTIVE: To describe and appraise the use of artificial intelligence (AI) techniques that can cope with longitudinal data from electronic health records (EHRs) to predict health-related outcomes. METHODS: This review included studies in any language that: EHR was at least one of the data sources, collected longitudinal data, used an AI technique capable of handling longitudinal data, and predicted any health-related outcomes. We searched MEDLINE, Scopus, Web of Science, and IEEE Xplorer from inception to January 3, 2022. Information on the dataset, prediction task, data preprocessing, feature selection, method, validation, performance, and implementation was extracted and summarized using descriptive statistics. Risk of bias and completeness of reporting were assessed using a short form of PROBAST and TRIPOD, respectively. RESULTS: Eighty-one studies were included. Follow-up time and number of registers per patient varied greatly, and most predicted disease development or next event based on diagnoses and drug treatments. Architectures generally were based on Recurrent Neural Networks-like layers, though in recent years combining different layers or transformers has become more popular. About half of the included studies performed hyperparameter tuning and used attention mechanisms. Most performed a single train-test partition and could not correctly assess the variability of the model’s performance. Reporting quality was poor, and a third of the studies were at high risk of bias. CONCLUSIONS: AI models are increasingly using longitudinal data. However, the heterogeneity in reporting methodology and results, and the lack of public EHR datasets and code sharing, complicate the possibility of replication. REGISTRATION: PROSPERO database (CRD42022331388). Oxford University Press 2023-09-02 /pmc/articles/PMC10654870/ /pubmed/37659105 http://dx.doi.org/10.1093/jamia/ocad168 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Review
Carrasco-Ribelles, Lucía A
Llanes-Jurado, José
Gallego-Moll, Carlos
Cabrera-Bean, Margarita
Monteagudo-Zaragoza, Mònica
Violán, Concepción
Zabaleta-del-Olmo, Edurne
Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review
title Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review
title_full Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review
title_fullStr Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review
title_full_unstemmed Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review
title_short Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review
title_sort prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10654870/
https://www.ncbi.nlm.nih.gov/pubmed/37659105
http://dx.doi.org/10.1093/jamia/ocad168
work_keys_str_mv AT carrascoribellesluciaa predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview
AT llanesjuradojose predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview
AT gallegomollcarlos predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview
AT cabrerabeanmargarita predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview
AT monteagudozaragozamonica predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview
AT violanconcepcion predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview
AT zabaletadelolmoedurne predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview