Cargando…
Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review
OBJECTIVE: To describe and appraise the use of artificial intelligence (AI) techniques that can cope with longitudinal data from electronic health records (EHRs) to predict health-related outcomes. METHODS: This review included studies in any language that: EHR was at least one of the data sources,...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10654870/ https://www.ncbi.nlm.nih.gov/pubmed/37659105 http://dx.doi.org/10.1093/jamia/ocad168 |
_version_ | 1785136708513169408 |
---|---|
author | Carrasco-Ribelles, Lucía A Llanes-Jurado, José Gallego-Moll, Carlos Cabrera-Bean, Margarita Monteagudo-Zaragoza, Mònica Violán, Concepción Zabaleta-del-Olmo, Edurne |
author_facet | Carrasco-Ribelles, Lucía A Llanes-Jurado, José Gallego-Moll, Carlos Cabrera-Bean, Margarita Monteagudo-Zaragoza, Mònica Violán, Concepción Zabaleta-del-Olmo, Edurne |
author_sort | Carrasco-Ribelles, Lucía A |
collection | PubMed |
description | OBJECTIVE: To describe and appraise the use of artificial intelligence (AI) techniques that can cope with longitudinal data from electronic health records (EHRs) to predict health-related outcomes. METHODS: This review included studies in any language that: EHR was at least one of the data sources, collected longitudinal data, used an AI technique capable of handling longitudinal data, and predicted any health-related outcomes. We searched MEDLINE, Scopus, Web of Science, and IEEE Xplorer from inception to January 3, 2022. Information on the dataset, prediction task, data preprocessing, feature selection, method, validation, performance, and implementation was extracted and summarized using descriptive statistics. Risk of bias and completeness of reporting were assessed using a short form of PROBAST and TRIPOD, respectively. RESULTS: Eighty-one studies were included. Follow-up time and number of registers per patient varied greatly, and most predicted disease development or next event based on diagnoses and drug treatments. Architectures generally were based on Recurrent Neural Networks-like layers, though in recent years combining different layers or transformers has become more popular. About half of the included studies performed hyperparameter tuning and used attention mechanisms. Most performed a single train-test partition and could not correctly assess the variability of the model’s performance. Reporting quality was poor, and a third of the studies were at high risk of bias. CONCLUSIONS: AI models are increasingly using longitudinal data. However, the heterogeneity in reporting methodology and results, and the lack of public EHR datasets and code sharing, complicate the possibility of replication. REGISTRATION: PROSPERO database (CRD42022331388). |
format | Online Article Text |
id | pubmed-10654870 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106548702023-09-02 Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review Carrasco-Ribelles, Lucía A Llanes-Jurado, José Gallego-Moll, Carlos Cabrera-Bean, Margarita Monteagudo-Zaragoza, Mònica Violán, Concepción Zabaleta-del-Olmo, Edurne J Am Med Inform Assoc Review OBJECTIVE: To describe and appraise the use of artificial intelligence (AI) techniques that can cope with longitudinal data from electronic health records (EHRs) to predict health-related outcomes. METHODS: This review included studies in any language that: EHR was at least one of the data sources, collected longitudinal data, used an AI technique capable of handling longitudinal data, and predicted any health-related outcomes. We searched MEDLINE, Scopus, Web of Science, and IEEE Xplorer from inception to January 3, 2022. Information on the dataset, prediction task, data preprocessing, feature selection, method, validation, performance, and implementation was extracted and summarized using descriptive statistics. Risk of bias and completeness of reporting were assessed using a short form of PROBAST and TRIPOD, respectively. RESULTS: Eighty-one studies were included. Follow-up time and number of registers per patient varied greatly, and most predicted disease development or next event based on diagnoses and drug treatments. Architectures generally were based on Recurrent Neural Networks-like layers, though in recent years combining different layers or transformers has become more popular. About half of the included studies performed hyperparameter tuning and used attention mechanisms. Most performed a single train-test partition and could not correctly assess the variability of the model’s performance. Reporting quality was poor, and a third of the studies were at high risk of bias. CONCLUSIONS: AI models are increasingly using longitudinal data. However, the heterogeneity in reporting methodology and results, and the lack of public EHR datasets and code sharing, complicate the possibility of replication. REGISTRATION: PROSPERO database (CRD42022331388). Oxford University Press 2023-09-02 /pmc/articles/PMC10654870/ /pubmed/37659105 http://dx.doi.org/10.1093/jamia/ocad168 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Review Carrasco-Ribelles, Lucía A Llanes-Jurado, José Gallego-Moll, Carlos Cabrera-Bean, Margarita Monteagudo-Zaragoza, Mònica Violán, Concepción Zabaleta-del-Olmo, Edurne Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review |
title | Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review |
title_full | Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review |
title_fullStr | Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review |
title_full_unstemmed | Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review |
title_short | Prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review |
title_sort | prediction models using artificial intelligence and longitudinal data from electronic health records: a systematic methodological review |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10654870/ https://www.ncbi.nlm.nih.gov/pubmed/37659105 http://dx.doi.org/10.1093/jamia/ocad168 |
work_keys_str_mv | AT carrascoribellesluciaa predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview AT llanesjuradojose predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview AT gallegomollcarlos predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview AT cabrerabeanmargarita predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview AT monteagudozaragozamonica predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview AT violanconcepcion predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview AT zabaletadelolmoedurne predictionmodelsusingartificialintelligenceandlongitudinaldatafromelectronichealthrecordsasystematicmethodologicalreview |