Cargando…

Reporting of Model Performance and Statistical Methods in Studies That Use Machine Learning to Develop Clinical Prediction Models: Protocol for a Systematic Review

BACKGROUND: With the growing excitement of the potential benefits of using machine learning and artificial intelligence in medicine, the number of published clinical prediction models that use these approaches has increased. However, there is evidence (albeit limited) that suggests that the reportin...

Descripción completa

Detalles Bibliográficos
Autores principales: Weaver, Colin George Wyllie, Basmadjian, Robert B, Williamson, Tyler, McBrien, Kerry, Sajobi, Tolu, Boyne, Devon, Yusuf, Mohamed, Ronksley, Paul Everett
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8931652/
https://www.ncbi.nlm.nih.gov/pubmed/35238322
http://dx.doi.org/10.2196/30956
_version_ 1784671307722391552
author Weaver, Colin George Wyllie
Basmadjian, Robert B
Williamson, Tyler
McBrien, Kerry
Sajobi, Tolu
Boyne, Devon
Yusuf, Mohamed
Ronksley, Paul Everett
author_facet Weaver, Colin George Wyllie
Basmadjian, Robert B
Williamson, Tyler
McBrien, Kerry
Sajobi, Tolu
Boyne, Devon
Yusuf, Mohamed
Ronksley, Paul Everett
author_sort Weaver, Colin George Wyllie
collection PubMed
description BACKGROUND: With the growing excitement of the potential benefits of using machine learning and artificial intelligence in medicine, the number of published clinical prediction models that use these approaches has increased. However, there is evidence (albeit limited) that suggests that the reporting of machine learning–specific aspects in these studies is poor. Further, there are no reviews assessing the reporting quality or broadly accepted reporting guidelines for these aspects. OBJECTIVE: This paper presents the protocol for a systematic review that will assess the reporting quality of machine learning–specific aspects in studies that use machine learning to develop clinical prediction models. METHODS: We will include studies that use a supervised machine learning algorithm to develop a prediction model for use in clinical practice (ie, for diagnosis or prognosis of a condition or identification of candidates for health care interventions). We will search MEDLINE for studies published in 2019, pseudorandomly sort the records, and screen until we obtain 100 studies that meet our inclusion criteria. We will assess reporting quality with a novel checklist developed in parallel with this review, which includes content derived from existing reporting guidelines, textbooks, and consultations with experts. The checklist will cover 4 key areas where the reporting of machine learning studies is unique: modelling steps (order and data used for each step), model performance (eg, reporting the performance of each model compared), statistical methods (eg, describing the tuning approach), and presentation of models (eg, specifying the predictors that contributed to the final model). RESULTS: We completed data analysis in August 2021 and are writing the manuscript. We expect to submit the results to a peer-reviewed journal in early 2022. CONCLUSIONS: This review will contribute to more standardized and complete reporting in the field by identifying areas where reporting is poor and can be improved. TRIAL REGISTRATION: PROSPERO International Prospective Register of Systematic Reviews CRD42020206167; https://www.crd.york.ac.uk/PROSPERO/display_record.php?RecordID=206167 INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): RR1-10.2196/30956
format Online
Article
Text
id pubmed-8931652
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-89316522022-03-19 Reporting of Model Performance and Statistical Methods in Studies That Use Machine Learning to Develop Clinical Prediction Models: Protocol for a Systematic Review Weaver, Colin George Wyllie Basmadjian, Robert B Williamson, Tyler McBrien, Kerry Sajobi, Tolu Boyne, Devon Yusuf, Mohamed Ronksley, Paul Everett JMIR Res Protoc Protocol BACKGROUND: With the growing excitement of the potential benefits of using machine learning and artificial intelligence in medicine, the number of published clinical prediction models that use these approaches has increased. However, there is evidence (albeit limited) that suggests that the reporting of machine learning–specific aspects in these studies is poor. Further, there are no reviews assessing the reporting quality or broadly accepted reporting guidelines for these aspects. OBJECTIVE: This paper presents the protocol for a systematic review that will assess the reporting quality of machine learning–specific aspects in studies that use machine learning to develop clinical prediction models. METHODS: We will include studies that use a supervised machine learning algorithm to develop a prediction model for use in clinical practice (ie, for diagnosis or prognosis of a condition or identification of candidates for health care interventions). We will search MEDLINE for studies published in 2019, pseudorandomly sort the records, and screen until we obtain 100 studies that meet our inclusion criteria. We will assess reporting quality with a novel checklist developed in parallel with this review, which includes content derived from existing reporting guidelines, textbooks, and consultations with experts. The checklist will cover 4 key areas where the reporting of machine learning studies is unique: modelling steps (order and data used for each step), model performance (eg, reporting the performance of each model compared), statistical methods (eg, describing the tuning approach), and presentation of models (eg, specifying the predictors that contributed to the final model). RESULTS: We completed data analysis in August 2021 and are writing the manuscript. We expect to submit the results to a peer-reviewed journal in early 2022. CONCLUSIONS: This review will contribute to more standardized and complete reporting in the field by identifying areas where reporting is poor and can be improved. TRIAL REGISTRATION: PROSPERO International Prospective Register of Systematic Reviews CRD42020206167; https://www.crd.york.ac.uk/PROSPERO/display_record.php?RecordID=206167 INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): RR1-10.2196/30956 JMIR Publications 2022-03-03 /pmc/articles/PMC8931652/ /pubmed/35238322 http://dx.doi.org/10.2196/30956 Text en ©Colin George Wyllie Weaver, Robert B Basmadjian, Tyler Williamson, Kerry McBrien, Tolu Sajobi, Devon Boyne, Mohamed Yusuf, Paul Everett Ronksley. Originally published in JMIR Research Protocols (https://www.researchprotocols.org), 03.03.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Research Protocols, is properly cited. The complete bibliographic information, a link to the original publication on https://www.researchprotocols.org, as well as this copyright and license information must be included.
spellingShingle Protocol
Weaver, Colin George Wyllie
Basmadjian, Robert B
Williamson, Tyler
McBrien, Kerry
Sajobi, Tolu
Boyne, Devon
Yusuf, Mohamed
Ronksley, Paul Everett
Reporting of Model Performance and Statistical Methods in Studies That Use Machine Learning to Develop Clinical Prediction Models: Protocol for a Systematic Review
title Reporting of Model Performance and Statistical Methods in Studies That Use Machine Learning to Develop Clinical Prediction Models: Protocol for a Systematic Review
title_full Reporting of Model Performance and Statistical Methods in Studies That Use Machine Learning to Develop Clinical Prediction Models: Protocol for a Systematic Review
title_fullStr Reporting of Model Performance and Statistical Methods in Studies That Use Machine Learning to Develop Clinical Prediction Models: Protocol for a Systematic Review
title_full_unstemmed Reporting of Model Performance and Statistical Methods in Studies That Use Machine Learning to Develop Clinical Prediction Models: Protocol for a Systematic Review
title_short Reporting of Model Performance and Statistical Methods in Studies That Use Machine Learning to Develop Clinical Prediction Models: Protocol for a Systematic Review
title_sort reporting of model performance and statistical methods in studies that use machine learning to develop clinical prediction models: protocol for a systematic review
topic Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8931652/
https://www.ncbi.nlm.nih.gov/pubmed/35238322
http://dx.doi.org/10.2196/30956
work_keys_str_mv AT weavercolingeorgewyllie reportingofmodelperformanceandstatisticalmethodsinstudiesthatusemachinelearningtodevelopclinicalpredictionmodelsprotocolforasystematicreview
AT basmadjianrobertb reportingofmodelperformanceandstatisticalmethodsinstudiesthatusemachinelearningtodevelopclinicalpredictionmodelsprotocolforasystematicreview
AT williamsontyler reportingofmodelperformanceandstatisticalmethodsinstudiesthatusemachinelearningtodevelopclinicalpredictionmodelsprotocolforasystematicreview
AT mcbrienkerry reportingofmodelperformanceandstatisticalmethodsinstudiesthatusemachinelearningtodevelopclinicalpredictionmodelsprotocolforasystematicreview
AT sajobitolu reportingofmodelperformanceandstatisticalmethodsinstudiesthatusemachinelearningtodevelopclinicalpredictionmodelsprotocolforasystematicreview
AT boynedevon reportingofmodelperformanceandstatisticalmethodsinstudiesthatusemachinelearningtodevelopclinicalpredictionmodelsprotocolforasystematicreview
AT yusufmohamed reportingofmodelperformanceandstatisticalmethodsinstudiesthatusemachinelearningtodevelopclinicalpredictionmodelsprotocolforasystematicreview
AT ronksleypauleverett reportingofmodelperformanceandstatisticalmethodsinstudiesthatusemachinelearningtodevelopclinicalpredictionmodelsprotocolforasystematicreview