Cargando…

Segmentation of High Dimensional Time-Series Data Using Mixture of Sparse Principal Component Regression Model with Information Complexity

This paper presents a new and novel hybrid modeling method for the segmentation of high dimensional time-series data using the mixture of the sparse principal components regression (MIX-SPCR) model with information complexity ([Formula: see text]) criterion as the fitness function. Our approach enco...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Yaojin, Bozdogan, Hamparsum
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7597341/
https://www.ncbi.nlm.nih.gov/pubmed/33286939
http://dx.doi.org/10.3390/e22101170
_version_ 1783602326697672704
author Sun, Yaojin
Bozdogan, Hamparsum
author_facet Sun, Yaojin
Bozdogan, Hamparsum
author_sort Sun, Yaojin
collection PubMed
description This paper presents a new and novel hybrid modeling method for the segmentation of high dimensional time-series data using the mixture of the sparse principal components regression (MIX-SPCR) model with information complexity ([Formula: see text]) criterion as the fitness function. Our approach encompasses dimension reduction in high dimensional time-series data and, at the same time, determines the number of component clusters (i.e., number of segments across time-series data) and selects the best subset of predictors. A large-scale Monte Carlo simulation is performed to show the capability of the MIX-SPCR model to identify the correct structure of the time-series data successfully. MIX-SPCR model is also applied to a high dimensional Standard & Poor’s 500 (S&P 500) index data to uncover the time-series’s hidden structure and identify the structure change points. The approach presented in this paper determines both the relationships among the predictor variables and how various predictor variables contribute to the explanatory power of the response variable through the sparsity settings cluster wise.
format Online
Article
Text
id pubmed-7597341
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75973412020-11-09 Segmentation of High Dimensional Time-Series Data Using Mixture of Sparse Principal Component Regression Model with Information Complexity Sun, Yaojin Bozdogan, Hamparsum Entropy (Basel) Article This paper presents a new and novel hybrid modeling method for the segmentation of high dimensional time-series data using the mixture of the sparse principal components regression (MIX-SPCR) model with information complexity ([Formula: see text]) criterion as the fitness function. Our approach encompasses dimension reduction in high dimensional time-series data and, at the same time, determines the number of component clusters (i.e., number of segments across time-series data) and selects the best subset of predictors. A large-scale Monte Carlo simulation is performed to show the capability of the MIX-SPCR model to identify the correct structure of the time-series data successfully. MIX-SPCR model is also applied to a high dimensional Standard & Poor’s 500 (S&P 500) index data to uncover the time-series’s hidden structure and identify the structure change points. The approach presented in this paper determines both the relationships among the predictor variables and how various predictor variables contribute to the explanatory power of the response variable through the sparsity settings cluster wise. MDPI 2020-10-17 /pmc/articles/PMC7597341/ /pubmed/33286939 http://dx.doi.org/10.3390/e22101170 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sun, Yaojin
Bozdogan, Hamparsum
Segmentation of High Dimensional Time-Series Data Using Mixture of Sparse Principal Component Regression Model with Information Complexity
title Segmentation of High Dimensional Time-Series Data Using Mixture of Sparse Principal Component Regression Model with Information Complexity
title_full Segmentation of High Dimensional Time-Series Data Using Mixture of Sparse Principal Component Regression Model with Information Complexity
title_fullStr Segmentation of High Dimensional Time-Series Data Using Mixture of Sparse Principal Component Regression Model with Information Complexity
title_full_unstemmed Segmentation of High Dimensional Time-Series Data Using Mixture of Sparse Principal Component Regression Model with Information Complexity
title_short Segmentation of High Dimensional Time-Series Data Using Mixture of Sparse Principal Component Regression Model with Information Complexity
title_sort segmentation of high dimensional time-series data using mixture of sparse principal component regression model with information complexity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7597341/
https://www.ncbi.nlm.nih.gov/pubmed/33286939
http://dx.doi.org/10.3390/e22101170
work_keys_str_mv AT sunyaojin segmentationofhighdimensionaltimeseriesdatausingmixtureofsparseprincipalcomponentregressionmodelwithinformationcomplexity
AT bozdoganhamparsum segmentationofhighdimensionaltimeseriesdatausingmixtureofsparseprincipalcomponentregressionmodelwithinformationcomplexity