Cargando…

Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database

The application of machine learning (ML) for use in generating insights and making predictions on new records continues to expand within the medical community. Despite this progress to date, the application of time series analysis has remained underexplored due to complexity of the underlying techni...

Descripción completa

Detalles Bibliográficos
Autores principales: Olsavszky, Victor, Dosius, Mihnea, Vladescu, Cristian, Benecke, Johannes
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7400312/
https://www.ncbi.nlm.nih.gov/pubmed/32664331
http://dx.doi.org/10.3390/ijerph17144979
_version_ 1783566334738563072
author Olsavszky, Victor
Dosius, Mihnea
Vladescu, Cristian
Benecke, Johannes
author_facet Olsavszky, Victor
Dosius, Mihnea
Vladescu, Cristian
Benecke, Johannes
author_sort Olsavszky, Victor
collection PubMed
description The application of machine learning (ML) for use in generating insights and making predictions on new records continues to expand within the medical community. Despite this progress to date, the application of time series analysis has remained underexplored due to complexity of the underlying techniques. In this study, we have deployed a novel ML, called automated time series (AutoTS) machine learning, to automate data processing and the application of a multitude of models to assess which best forecasts future values. This rapid experimentation allows for and enables the selection of the most accurate model in order to perform time series predictions. By using the nation-wide ICD-10 (International Classification of Diseases, Tenth Revision) dataset of hospitalized patients of Romania, we have generated time series datasets over the period of 2008–2018 and performed highly accurate AutoTS predictions for the ten deadliest diseases. Forecast results for the years 2019 and 2020 were generated on a NUTS 2 (Nomenclature of Territorial Units for Statistics) regional level. This is the first study to our knowledge to perform time series forecasting of multiple diseases at a regional level using automated time series machine learning on a national ICD-10 dataset. The deployment of AutoTS technology can help decision makers in implementing targeted national health policies more efficiently.
format Online
Article
Text
id pubmed-7400312
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-74003122020-08-23 Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database Olsavszky, Victor Dosius, Mihnea Vladescu, Cristian Benecke, Johannes Int J Environ Res Public Health Article The application of machine learning (ML) for use in generating insights and making predictions on new records continues to expand within the medical community. Despite this progress to date, the application of time series analysis has remained underexplored due to complexity of the underlying techniques. In this study, we have deployed a novel ML, called automated time series (AutoTS) machine learning, to automate data processing and the application of a multitude of models to assess which best forecasts future values. This rapid experimentation allows for and enables the selection of the most accurate model in order to perform time series predictions. By using the nation-wide ICD-10 (International Classification of Diseases, Tenth Revision) dataset of hospitalized patients of Romania, we have generated time series datasets over the period of 2008–2018 and performed highly accurate AutoTS predictions for the ten deadliest diseases. Forecast results for the years 2019 and 2020 were generated on a NUTS 2 (Nomenclature of Territorial Units for Statistics) regional level. This is the first study to our knowledge to perform time series forecasting of multiple diseases at a regional level using automated time series machine learning on a national ICD-10 dataset. The deployment of AutoTS technology can help decision makers in implementing targeted national health policies more efficiently. MDPI 2020-07-10 2020-07 /pmc/articles/PMC7400312/ /pubmed/32664331 http://dx.doi.org/10.3390/ijerph17144979 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Olsavszky, Victor
Dosius, Mihnea
Vladescu, Cristian
Benecke, Johannes
Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database
title Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database
title_full Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database
title_fullStr Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database
title_full_unstemmed Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database
title_short Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database
title_sort time series analysis and forecasting with automated machine learning on a national icd-10 database
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7400312/
https://www.ncbi.nlm.nih.gov/pubmed/32664331
http://dx.doi.org/10.3390/ijerph17144979
work_keys_str_mv AT olsavszkyvictor timeseriesanalysisandforecastingwithautomatedmachinelearningonanationalicd10database
AT dosiusmihnea timeseriesanalysisandforecastingwithautomatedmachinelearningonanationalicd10database
AT vladescucristian timeseriesanalysisandforecastingwithautomatedmachinelearningonanationalicd10database
AT beneckejohannes timeseriesanalysisandforecastingwithautomatedmachinelearningonanationalicd10database