Cargando…

Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives

Machine-learning techniques have been increasing in popularity within medicine during the past decade. However, these computational techniques are not presented in statistical lectures throughout medical school and are perceived to have a high barrier to entry. The objective is to develop a concise...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Alexander A, Huang, Samuel Y
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cureus 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10625496/
https://www.ncbi.nlm.nih.gov/pubmed/37933338
http://dx.doi.org/10.7759/cureus.46549
_version_ 1785131143276789760
author Huang, Alexander A
Huang, Samuel Y
author_facet Huang, Alexander A
Huang, Samuel Y
author_sort Huang, Alexander A
collection PubMed
description Machine-learning techniques have been increasing in popularity within medicine during the past decade. However, these computational techniques are not presented in statistical lectures throughout medical school and are perceived to have a high barrier to entry. The objective is to develop a concise pipeline with publicly available data to decrease the learning time towards using machine learning for medical research and quality-improvement initiatives. This report utilized a publicly available machine-learning data package in R (MLDataR) and computational packages (XGBoost) to highlight techniques for machine-learning model development and visualization with SHaply Additive exPlanations (SHAP). A simple six-step process along with example code was constructed to build and visualize machine-learning models. A concrete set of three steps was developed to help with interpretation. Further teaching of these methods could benefit researchers by providing alternative methods for data analysis in medical studies. These could help researchers without computational experience to get a feel for machine learning to better understand the literature and technique.
format Online
Article
Text
id pubmed-10625496
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cureus
record_format MEDLINE/PubMed
spelling pubmed-106254962023-11-06 Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives Huang, Alexander A Huang, Samuel Y Cureus Internal Medicine Machine-learning techniques have been increasing in popularity within medicine during the past decade. However, these computational techniques are not presented in statistical lectures throughout medical school and are perceived to have a high barrier to entry. The objective is to develop a concise pipeline with publicly available data to decrease the learning time towards using machine learning for medical research and quality-improvement initiatives. This report utilized a publicly available machine-learning data package in R (MLDataR) and computational packages (XGBoost) to highlight techniques for machine-learning model development and visualization with SHaply Additive exPlanations (SHAP). A simple six-step process along with example code was constructed to build and visualize machine-learning models. A concrete set of three steps was developed to help with interpretation. Further teaching of these methods could benefit researchers by providing alternative methods for data analysis in medical studies. These could help researchers without computational experience to get a feel for machine learning to better understand the literature and technique. Cureus 2023-10-05 /pmc/articles/PMC10625496/ /pubmed/37933338 http://dx.doi.org/10.7759/cureus.46549 Text en Copyright © 2023, Huang et al. https://creativecommons.org/licenses/by/3.0/This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Internal Medicine
Huang, Alexander A
Huang, Samuel Y
Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives
title Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives
title_full Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives
title_fullStr Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives
title_full_unstemmed Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives
title_short Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives
title_sort technical report: machine-learning pipeline for medical research and quality-improvement initiatives
topic Internal Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10625496/
https://www.ncbi.nlm.nih.gov/pubmed/37933338
http://dx.doi.org/10.7759/cureus.46549
work_keys_str_mv AT huangalexandera technicalreportmachinelearningpipelineformedicalresearchandqualityimprovementinitiatives
AT huangsamuely technicalreportmachinelearningpipelineformedicalresearchandqualityimprovementinitiatives