Cargando…

A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series

Feature selection is a known technique to preprocess the data before performing any data mining task. In multivariate time series (MTS) prediction, feature selection needs to find both the most related variables and their corresponding delays. Both aspects, to a certain extent, represent essential c...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Li, Chen, Yanting, Zhang, Zhiyang, Gui, Ning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8318748/
https://www.ncbi.nlm.nih.gov/pubmed/34335722
http://dx.doi.org/10.1155/2021/6911192
_version_ 1783730307521839104
author Cao, Li
Chen, Yanting
Zhang, Zhiyang
Gui, Ning
author_facet Cao, Li
Chen, Yanting
Zhang, Zhiyang
Gui, Ning
author_sort Cao, Li
collection PubMed
description Feature selection is a known technique to preprocess the data before performing any data mining task. In multivariate time series (MTS) prediction, feature selection needs to find both the most related variables and their corresponding delays. Both aspects, to a certain extent, represent essential characteristics of system dynamics. However, the variable and delay selection for MTS is a challenging task when the system is nonlinear and noisy. In this paper, a multiattention-based supervised feature selection method is proposed. It translates the feature weight generation problem into a bidirectional attention generation problem with two parallel placed attention modules. The input 2D data are sliced into 1D data from two orthogonal directions, and each attention module generates attention weights from their respective dimensions. To facilitate the feature selection from the global perspective, we proposed a global weight generation method that calculates a dot product operation on the weight values of the two dimensions. To avoid the disturbance of attention weights due to noise and duplicated features, the final feature weight matrix is calculated based on the statistics of the entire training set. Experimental results show that this proposed method achieves the best performance on compared synthesized, small, medium, and practical industrial datasets, compared to several state-of-the-art baseline feature selection methods.
format Online
Article
Text
id pubmed-8318748
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-83187482021-07-31 A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series Cao, Li Chen, Yanting Zhang, Zhiyang Gui, Ning Comput Intell Neurosci Research Article Feature selection is a known technique to preprocess the data before performing any data mining task. In multivariate time series (MTS) prediction, feature selection needs to find both the most related variables and their corresponding delays. Both aspects, to a certain extent, represent essential characteristics of system dynamics. However, the variable and delay selection for MTS is a challenging task when the system is nonlinear and noisy. In this paper, a multiattention-based supervised feature selection method is proposed. It translates the feature weight generation problem into a bidirectional attention generation problem with two parallel placed attention modules. The input 2D data are sliced into 1D data from two orthogonal directions, and each attention module generates attention weights from their respective dimensions. To facilitate the feature selection from the global perspective, we proposed a global weight generation method that calculates a dot product operation on the weight values of the two dimensions. To avoid the disturbance of attention weights due to noise and duplicated features, the final feature weight matrix is calculated based on the statistics of the entire training set. Experimental results show that this proposed method achieves the best performance on compared synthesized, small, medium, and practical industrial datasets, compared to several state-of-the-art baseline feature selection methods. Hindawi 2021-07-20 /pmc/articles/PMC8318748/ /pubmed/34335722 http://dx.doi.org/10.1155/2021/6911192 Text en Copyright © 2021 Li Cao et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Cao, Li
Chen, Yanting
Zhang, Zhiyang
Gui, Ning
A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series
title A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series
title_full A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series
title_fullStr A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series
title_full_unstemmed A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series
title_short A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series
title_sort multiattention-based supervised feature selection method for multivariate time series
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8318748/
https://www.ncbi.nlm.nih.gov/pubmed/34335722
http://dx.doi.org/10.1155/2021/6911192
work_keys_str_mv AT caoli amultiattentionbasedsupervisedfeatureselectionmethodformultivariatetimeseries
AT chenyanting amultiattentionbasedsupervisedfeatureselectionmethodformultivariatetimeseries
AT zhangzhiyang amultiattentionbasedsupervisedfeatureselectionmethodformultivariatetimeseries
AT guining amultiattentionbasedsupervisedfeatureselectionmethodformultivariatetimeseries
AT caoli multiattentionbasedsupervisedfeatureselectionmethodformultivariatetimeseries
AT chenyanting multiattentionbasedsupervisedfeatureselectionmethodformultivariatetimeseries
AT zhangzhiyang multiattentionbasedsupervisedfeatureselectionmethodformultivariatetimeseries
AT guining multiattentionbasedsupervisedfeatureselectionmethodformultivariatetimeseries