Cargando…

PredicT-ML: a tool for automating machine learning model building with big clinical data

BACKGROUND: Predictive modeling is fundamental to transforming large clinical data sets, or “big clinical data,” into actionable knowledge for various healthcare applications. Machine learning is a major predictive modeling approach, but two barriers make its use in healthcare challenging. First, a...

Descripción completa

Detalles Bibliográficos
Autor principal: Luo, Gang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4897944/
https://www.ncbi.nlm.nih.gov/pubmed/27280018
http://dx.doi.org/10.1186/s13755-016-0018-1
_version_ 1782436267534843904
author Luo, Gang
author_facet Luo, Gang
author_sort Luo, Gang
collection PubMed
description BACKGROUND: Predictive modeling is fundamental to transforming large clinical data sets, or “big clinical data,” into actionable knowledge for various healthcare applications. Machine learning is a major predictive modeling approach, but two barriers make its use in healthcare challenging. First, a machine learning tool user must choose an algorithm and assign one or more model parameters called hyper-parameters before model training. The algorithm and hyper-parameter values used typically impact model accuracy by over 40 %, but their selection requires many labor-intensive manual iterations that can be difficult even for computer scientists. Second, many clinical attributes are repeatedly recorded over time, requiring temporal aggregation before predictive modeling can be performed. Many labor-intensive manual iterations are required to identify a good pair of aggregation period and operator for each clinical attribute. Both barriers result in time and human resource bottlenecks, and preclude healthcare administrators and researchers from asking a series of what-if questions when probing opportunities to use predictive models to improve outcomes and reduce costs. METHODS: This paper describes our design of and vision for PredicT-ML (prediction tool using machine learning), a software system that aims to overcome these barriers and automate machine learning model building with big clinical data. RESULTS: The paper presents the detailed design of PredicT-ML. CONCLUSIONS: PredicT-ML will open the use of big clinical data to thousands of healthcare administrators and researchers and increase the ability to advance clinical research and improve healthcare.
format Online
Article
Text
id pubmed-4897944
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48979442016-06-09 PredicT-ML: a tool for automating machine learning model building with big clinical data Luo, Gang Health Inf Sci Syst Research BACKGROUND: Predictive modeling is fundamental to transforming large clinical data sets, or “big clinical data,” into actionable knowledge for various healthcare applications. Machine learning is a major predictive modeling approach, but two barriers make its use in healthcare challenging. First, a machine learning tool user must choose an algorithm and assign one or more model parameters called hyper-parameters before model training. The algorithm and hyper-parameter values used typically impact model accuracy by over 40 %, but their selection requires many labor-intensive manual iterations that can be difficult even for computer scientists. Second, many clinical attributes are repeatedly recorded over time, requiring temporal aggregation before predictive modeling can be performed. Many labor-intensive manual iterations are required to identify a good pair of aggregation period and operator for each clinical attribute. Both barriers result in time and human resource bottlenecks, and preclude healthcare administrators and researchers from asking a series of what-if questions when probing opportunities to use predictive models to improve outcomes and reduce costs. METHODS: This paper describes our design of and vision for PredicT-ML (prediction tool using machine learning), a software system that aims to overcome these barriers and automate machine learning model building with big clinical data. RESULTS: The paper presents the detailed design of PredicT-ML. CONCLUSIONS: PredicT-ML will open the use of big clinical data to thousands of healthcare administrators and researchers and increase the ability to advance clinical research and improve healthcare. BioMed Central 2016-06-08 /pmc/articles/PMC4897944/ /pubmed/27280018 http://dx.doi.org/10.1186/s13755-016-0018-1 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Luo, Gang
PredicT-ML: a tool for automating machine learning model building with big clinical data
title PredicT-ML: a tool for automating machine learning model building with big clinical data
title_full PredicT-ML: a tool for automating machine learning model building with big clinical data
title_fullStr PredicT-ML: a tool for automating machine learning model building with big clinical data
title_full_unstemmed PredicT-ML: a tool for automating machine learning model building with big clinical data
title_short PredicT-ML: a tool for automating machine learning model building with big clinical data
title_sort predict-ml: a tool for automating machine learning model building with big clinical data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4897944/
https://www.ncbi.nlm.nih.gov/pubmed/27280018
http://dx.doi.org/10.1186/s13755-016-0018-1
work_keys_str_mv AT luogang predictmlatoolforautomatingmachinelearningmodelbuildingwithbigclinicaldata