Cargando…

Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View

BACKGROUND: As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Wei, Phung, Dinh, Tran, Truyen, Gupta, Sunil, Rana, Santu, Karmakar, Chandan, Shilton, Alistair, Yearwood, John, Dimitrova, Nevenka, Ho, Tu Bao, Venkatesh, Svetha, Berk, Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5238707/
https://www.ncbi.nlm.nih.gov/pubmed/27986644
http://dx.doi.org/10.2196/jmir.5870
_version_ 1782495757911195648
author Luo, Wei
Phung, Dinh
Tran, Truyen
Gupta, Sunil
Rana, Santu
Karmakar, Chandan
Shilton, Alistair
Yearwood, John
Dimitrova, Nevenka
Ho, Tu Bao
Venkatesh, Svetha
Berk, Michael
author_facet Luo, Wei
Phung, Dinh
Tran, Truyen
Gupta, Sunil
Rana, Santu
Karmakar, Chandan
Shilton, Alistair
Yearwood, John
Dimitrova, Nevenka
Ho, Tu Bao
Venkatesh, Svetha
Berk, Michael
author_sort Luo, Wei
collection PubMed
description BACKGROUND: As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. OBJECTIVE: To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. METHODS: A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. RESULTS: The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. CONCLUSIONS: A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.
format Online
Article
Text
id pubmed-5238707
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-52387072017-01-30 Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View Luo, Wei Phung, Dinh Tran, Truyen Gupta, Sunil Rana, Santu Karmakar, Chandan Shilton, Alistair Yearwood, John Dimitrova, Nevenka Ho, Tu Bao Venkatesh, Svetha Berk, Michael J Med Internet Res Original Paper BACKGROUND: As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. OBJECTIVE: To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. METHODS: A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. RESULTS: The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. CONCLUSIONS: A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community. JMIR Publications 2016-12-16 /pmc/articles/PMC5238707/ /pubmed/27986644 http://dx.doi.org/10.2196/jmir.5870 Text en ©Wei Luo, Dinh Phung, Truyen Tran, Sunil Gupta, Santu Rana, Chandan Karmakar, Alistair Shilton, John Yearwood, Nevenka Dimitrova, Tu Bao Ho, Svetha Venkatesh, Michael Berk. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 16.12.2016. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Luo, Wei
Phung, Dinh
Tran, Truyen
Gupta, Sunil
Rana, Santu
Karmakar, Chandan
Shilton, Alistair
Yearwood, John
Dimitrova, Nevenka
Ho, Tu Bao
Venkatesh, Svetha
Berk, Michael
Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View
title Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View
title_full Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View
title_fullStr Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View
title_full_unstemmed Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View
title_short Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View
title_sort guidelines for developing and reporting machine learning predictive models in biomedical research: a multidisciplinary view
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5238707/
https://www.ncbi.nlm.nih.gov/pubmed/27986644
http://dx.doi.org/10.2196/jmir.5870
work_keys_str_mv AT luowei guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview
AT phungdinh guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview
AT trantruyen guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview
AT guptasunil guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview
AT ranasantu guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview
AT karmakarchandan guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview
AT shiltonalistair guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview
AT yearwoodjohn guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview
AT dimitrovanevenka guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview
AT hotubao guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview
AT venkateshsvetha guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview
AT berkmichael guidelinesfordevelopingandreportingmachinelearningpredictivemodelsinbiomedicalresearchamultidisciplinaryview