Cargando…
Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives
Machine-learning techniques have been increasing in popularity within medicine during the past decade. However, these computational techniques are not presented in statistical lectures throughout medical school and are perceived to have a high barrier to entry. The objective is to develop a concise...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cureus
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10625496/ https://www.ncbi.nlm.nih.gov/pubmed/37933338 http://dx.doi.org/10.7759/cureus.46549 |
_version_ | 1785131143276789760 |
---|---|
author | Huang, Alexander A Huang, Samuel Y |
author_facet | Huang, Alexander A Huang, Samuel Y |
author_sort | Huang, Alexander A |
collection | PubMed |
description | Machine-learning techniques have been increasing in popularity within medicine during the past decade. However, these computational techniques are not presented in statistical lectures throughout medical school and are perceived to have a high barrier to entry. The objective is to develop a concise pipeline with publicly available data to decrease the learning time towards using machine learning for medical research and quality-improvement initiatives. This report utilized a publicly available machine-learning data package in R (MLDataR) and computational packages (XGBoost) to highlight techniques for machine-learning model development and visualization with SHaply Additive exPlanations (SHAP). A simple six-step process along with example code was constructed to build and visualize machine-learning models. A concrete set of three steps was developed to help with interpretation. Further teaching of these methods could benefit researchers by providing alternative methods for data analysis in medical studies. These could help researchers without computational experience to get a feel for machine learning to better understand the literature and technique. |
format | Online Article Text |
id | pubmed-10625496 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cureus |
record_format | MEDLINE/PubMed |
spelling | pubmed-106254962023-11-06 Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives Huang, Alexander A Huang, Samuel Y Cureus Internal Medicine Machine-learning techniques have been increasing in popularity within medicine during the past decade. However, these computational techniques are not presented in statistical lectures throughout medical school and are perceived to have a high barrier to entry. The objective is to develop a concise pipeline with publicly available data to decrease the learning time towards using machine learning for medical research and quality-improvement initiatives. This report utilized a publicly available machine-learning data package in R (MLDataR) and computational packages (XGBoost) to highlight techniques for machine-learning model development and visualization with SHaply Additive exPlanations (SHAP). A simple six-step process along with example code was constructed to build and visualize machine-learning models. A concrete set of three steps was developed to help with interpretation. Further teaching of these methods could benefit researchers by providing alternative methods for data analysis in medical studies. These could help researchers without computational experience to get a feel for machine learning to better understand the literature and technique. Cureus 2023-10-05 /pmc/articles/PMC10625496/ /pubmed/37933338 http://dx.doi.org/10.7759/cureus.46549 Text en Copyright © 2023, Huang et al. https://creativecommons.org/licenses/by/3.0/This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Internal Medicine Huang, Alexander A Huang, Samuel Y Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives |
title | Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives |
title_full | Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives |
title_fullStr | Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives |
title_full_unstemmed | Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives |
title_short | Technical Report: Machine-Learning Pipeline for Medical Research and Quality-Improvement Initiatives |
title_sort | technical report: machine-learning pipeline for medical research and quality-improvement initiatives |
topic | Internal Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10625496/ https://www.ncbi.nlm.nih.gov/pubmed/37933338 http://dx.doi.org/10.7759/cureus.46549 |
work_keys_str_mv | AT huangalexandera technicalreportmachinelearningpipelineformedicalresearchandqualityimprovementinitiatives AT huangsamuely technicalreportmachinelearningpipelineformedicalresearchandqualityimprovementinitiatives |