Cargando…

Representing the dynamics of high-dimensional data with non-redundant wavelets

A crucial question in data science is to extract meaningful information embedded in high-dimensional data into a low-dimensional set of features that can represent the original data at different levels. Wavelet analysis is a pervasive method for decomposing time-series signals into a few levels with...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Shanshan, Li, Xingyi, Huang, Tiejun, Liu, Jian K., Yu, Zhaofei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9058841/
https://www.ncbi.nlm.nih.gov/pubmed/35510192
http://dx.doi.org/10.1016/j.patter.2021.100424
_version_ 1784698200405311488
author Jia, Shanshan
Li, Xingyi
Huang, Tiejun
Liu, Jian K.
Yu, Zhaofei
author_facet Jia, Shanshan
Li, Xingyi
Huang, Tiejun
Liu, Jian K.
Yu, Zhaofei
author_sort Jia, Shanshan
collection PubMed
description A crucial question in data science is to extract meaningful information embedded in high-dimensional data into a low-dimensional set of features that can represent the original data at different levels. Wavelet analysis is a pervasive method for decomposing time-series signals into a few levels with detailed temporal resolution. However, obtained wavelets are intertwined and over-represented across levels for each sample and across different samples within one population. Here, using neuroscience data of simulated spikes, experimental spikes, calcium imaging signals, and human electrocorticography signals, we leveraged conditional mutual information between wavelets for feature selection. The meaningfulness of selected features was verified to decode stimulus or condition with high accuracy yet using only a small set of features. These results provide a new way of wavelet analysis for extracting essential features of the dynamics of spatiotemporal neural data, which then enables to support novel model design of machine learning with representative features.
format Online
Article
Text
id pubmed-9058841
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-90588412022-05-03 Representing the dynamics of high-dimensional data with non-redundant wavelets Jia, Shanshan Li, Xingyi Huang, Tiejun Liu, Jian K. Yu, Zhaofei Patterns (N Y) Article A crucial question in data science is to extract meaningful information embedded in high-dimensional data into a low-dimensional set of features that can represent the original data at different levels. Wavelet analysis is a pervasive method for decomposing time-series signals into a few levels with detailed temporal resolution. However, obtained wavelets are intertwined and over-represented across levels for each sample and across different samples within one population. Here, using neuroscience data of simulated spikes, experimental spikes, calcium imaging signals, and human electrocorticography signals, we leveraged conditional mutual information between wavelets for feature selection. The meaningfulness of selected features was verified to decode stimulus or condition with high accuracy yet using only a small set of features. These results provide a new way of wavelet analysis for extracting essential features of the dynamics of spatiotemporal neural data, which then enables to support novel model design of machine learning with representative features. Elsevier 2022-01-06 /pmc/articles/PMC9058841/ /pubmed/35510192 http://dx.doi.org/10.1016/j.patter.2021.100424 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Jia, Shanshan
Li, Xingyi
Huang, Tiejun
Liu, Jian K.
Yu, Zhaofei
Representing the dynamics of high-dimensional data with non-redundant wavelets
title Representing the dynamics of high-dimensional data with non-redundant wavelets
title_full Representing the dynamics of high-dimensional data with non-redundant wavelets
title_fullStr Representing the dynamics of high-dimensional data with non-redundant wavelets
title_full_unstemmed Representing the dynamics of high-dimensional data with non-redundant wavelets
title_short Representing the dynamics of high-dimensional data with non-redundant wavelets
title_sort representing the dynamics of high-dimensional data with non-redundant wavelets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9058841/
https://www.ncbi.nlm.nih.gov/pubmed/35510192
http://dx.doi.org/10.1016/j.patter.2021.100424
work_keys_str_mv AT jiashanshan representingthedynamicsofhighdimensionaldatawithnonredundantwavelets
AT lixingyi representingthedynamicsofhighdimensionaldatawithnonredundantwavelets
AT huangtiejun representingthedynamicsofhighdimensionaldatawithnonredundantwavelets
AT liujiank representingthedynamicsofhighdimensionaldatawithnonredundantwavelets
AT yuzhaofei representingthedynamicsofhighdimensionaldatawithnonredundantwavelets