Cargando…

Ensemble machine learning methods in screening electronic health records: A scoping review

BACKGROUND: Electronic health records provide the opportunity to identify undiagnosed individuals likely to have a given disease using machine learning techniques, and who could then benefit from more medical screening and case finding, reducing the number needed to screen with convenience and healt...

Descripción completa

Detalles Bibliográficos
Autores principales: Stevens, Christophe AT, Lyons, Alexander RM, Dharmayat, Kanika I, Mahani, Alireza, Ray, Kausik K, Vallejo-Vaz, Antonio J, Sharabiani, Mansour TA
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10176785/
https://www.ncbi.nlm.nih.gov/pubmed/37188075
http://dx.doi.org/10.1177/20552076231173225
_version_ 1785040499034292224
author Stevens, Christophe AT
Lyons, Alexander RM
Dharmayat, Kanika I
Mahani, Alireza
Ray, Kausik K
Vallejo-Vaz, Antonio J
Sharabiani, Mansour TA
author_facet Stevens, Christophe AT
Lyons, Alexander RM
Dharmayat, Kanika I
Mahani, Alireza
Ray, Kausik K
Vallejo-Vaz, Antonio J
Sharabiani, Mansour TA
author_sort Stevens, Christophe AT
collection PubMed
description BACKGROUND: Electronic health records provide the opportunity to identify undiagnosed individuals likely to have a given disease using machine learning techniques, and who could then benefit from more medical screening and case finding, reducing the number needed to screen with convenience and healthcare cost savings. Ensemble machine learning models combining multiple prediction estimates into one are often said to provide better predictive performances than non-ensemble models. Yet, to our knowledge, no literature review summarises the use and performances of different types of ensemble machine learning models in the context of medical pre-screening. METHOD: We aimed to conduct a scoping review of the literature reporting the derivation of ensemble machine learning models for screening of electronic health records. We searched EMBASE and MEDLINE databases across all years applying a formal search strategy using terms related to medical screening, electronic health records and machine learning. Data were collected, analysed, and reported in accordance with the PRISMA scoping review guideline. RESULTS: A total of 3355 articles were retrieved, of which 145 articles met our inclusion criteria and were included in this study. Ensemble machine learning models were increasingly employed across several medical specialties and often outperformed non-ensemble approaches. Ensemble machine learning models with complex combination strategies and heterogeneous classifiers often outperformed other types of ensemble machine learning models but were also less used. Ensemble machine learning models methodologies, processing steps and data sources were often not clearly described. CONCLUSIONS: Our work highlights the importance of deriving and comparing the performances of different types of ensemble machine learning models when screening electronic health records and underscores the need for more comprehensive reporting of machine learning methodologies employed in clinical research.
format Online
Article
Text
id pubmed-10176785
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-101767852023-05-13 Ensemble machine learning methods in screening electronic health records: A scoping review Stevens, Christophe AT Lyons, Alexander RM Dharmayat, Kanika I Mahani, Alireza Ray, Kausik K Vallejo-Vaz, Antonio J Sharabiani, Mansour TA Digit Health Review Article BACKGROUND: Electronic health records provide the opportunity to identify undiagnosed individuals likely to have a given disease using machine learning techniques, and who could then benefit from more medical screening and case finding, reducing the number needed to screen with convenience and healthcare cost savings. Ensemble machine learning models combining multiple prediction estimates into one are often said to provide better predictive performances than non-ensemble models. Yet, to our knowledge, no literature review summarises the use and performances of different types of ensemble machine learning models in the context of medical pre-screening. METHOD: We aimed to conduct a scoping review of the literature reporting the derivation of ensemble machine learning models for screening of electronic health records. We searched EMBASE and MEDLINE databases across all years applying a formal search strategy using terms related to medical screening, electronic health records and machine learning. Data were collected, analysed, and reported in accordance with the PRISMA scoping review guideline. RESULTS: A total of 3355 articles were retrieved, of which 145 articles met our inclusion criteria and were included in this study. Ensemble machine learning models were increasingly employed across several medical specialties and often outperformed non-ensemble approaches. Ensemble machine learning models with complex combination strategies and heterogeneous classifiers often outperformed other types of ensemble machine learning models but were also less used. Ensemble machine learning models methodologies, processing steps and data sources were often not clearly described. CONCLUSIONS: Our work highlights the importance of deriving and comparing the performances of different types of ensemble machine learning models when screening electronic health records and underscores the need for more comprehensive reporting of machine learning methodologies employed in clinical research. SAGE Publications 2023-05-09 /pmc/articles/PMC10176785/ /pubmed/37188075 http://dx.doi.org/10.1177/20552076231173225 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Review Article
Stevens, Christophe AT
Lyons, Alexander RM
Dharmayat, Kanika I
Mahani, Alireza
Ray, Kausik K
Vallejo-Vaz, Antonio J
Sharabiani, Mansour TA
Ensemble machine learning methods in screening electronic health records: A scoping review
title Ensemble machine learning methods in screening electronic health records: A scoping review
title_full Ensemble machine learning methods in screening electronic health records: A scoping review
title_fullStr Ensemble machine learning methods in screening electronic health records: A scoping review
title_full_unstemmed Ensemble machine learning methods in screening electronic health records: A scoping review
title_short Ensemble machine learning methods in screening electronic health records: A scoping review
title_sort ensemble machine learning methods in screening electronic health records: a scoping review
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10176785/
https://www.ncbi.nlm.nih.gov/pubmed/37188075
http://dx.doi.org/10.1177/20552076231173225
work_keys_str_mv AT stevenschristopheat ensemblemachinelearningmethodsinscreeningelectronichealthrecordsascopingreview
AT lyonsalexanderrm ensemblemachinelearningmethodsinscreeningelectronichealthrecordsascopingreview
AT dharmayatkanikai ensemblemachinelearningmethodsinscreeningelectronichealthrecordsascopingreview
AT mahanialireza ensemblemachinelearningmethodsinscreeningelectronichealthrecordsascopingreview
AT raykausikk ensemblemachinelearningmethodsinscreeningelectronichealthrecordsascopingreview
AT vallejovazantonioj ensemblemachinelearningmethodsinscreeningelectronichealthrecordsascopingreview
AT sharabianimansourta ensemblemachinelearningmethodsinscreeningelectronichealthrecordsascopingreview