Cargando…
Manifold-adaptive dimension estimation revisited
Data dimensionality informs us about data complexity and sets limit on the structure of successful signal processing pipelines. In this work we revisit and improve the manifold adaptive Farahmand-Szepesvári-Audibert (FSA) dimension estimator, making it one of the best nearest neighbor-based dimensio...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8771813/ https://www.ncbi.nlm.nih.gov/pubmed/35111907 http://dx.doi.org/10.7717/peerj-cs.790 |
_version_ | 1784635697000349696 |
---|---|
author | Benkő, Zsigmond Stippinger, Marcell Rehus, Roberta Bencze, Attila Fabó, Dániel Hajnal, Boglárka Eröss, Loránd G. Telcs, András Somogyvári, Zoltán |
author_facet | Benkő, Zsigmond Stippinger, Marcell Rehus, Roberta Bencze, Attila Fabó, Dániel Hajnal, Boglárka Eröss, Loránd G. Telcs, András Somogyvári, Zoltán |
author_sort | Benkő, Zsigmond |
collection | PubMed |
description | Data dimensionality informs us about data complexity and sets limit on the structure of successful signal processing pipelines. In this work we revisit and improve the manifold adaptive Farahmand-Szepesvári-Audibert (FSA) dimension estimator, making it one of the best nearest neighbor-based dimension estimators available. We compute the probability density function of local FSA estimates, if the local manifold density is uniform. Based on the probability density function, we propose to use the median of local estimates as a basic global measure of intrinsic dimensionality, and we demonstrate the advantages of this asymptotically unbiased estimator over the previously proposed statistics: the mode and the mean. Additionally, from the probability density function, we derive the maximum likelihood formula for global intrinsic dimensionality, if i.i.d. holds. We tackle edge and finite-sample effects with an exponential correction formula, calibrated on hypercube datasets. We compare the performance of the corrected median-FSA estimator with kNN estimators: maximum likelihood (Levina-Bickel), the 2NN and two implementations of DANCo (R and MATLAB). We show that corrected median-FSA estimator beats the maximum likelihood estimator and it is on equal footing with DANCo for standard synthetic benchmarks according to mean percentage error and error rate metrics. With the median-FSA algorithm, we reveal diverse changes in the neural dynamics while resting state and during epileptic seizures. We identify brain areas with lower-dimensional dynamics that are possible causal sources and candidates for being seizure onset zones. |
format | Online Article Text |
id | pubmed-8771813 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-87718132022-02-01 Manifold-adaptive dimension estimation revisited Benkő, Zsigmond Stippinger, Marcell Rehus, Roberta Bencze, Attila Fabó, Dániel Hajnal, Boglárka Eröss, Loránd G. Telcs, András Somogyvári, Zoltán PeerJ Comput Sci Brain-Computer Interface Data dimensionality informs us about data complexity and sets limit on the structure of successful signal processing pipelines. In this work we revisit and improve the manifold adaptive Farahmand-Szepesvári-Audibert (FSA) dimension estimator, making it one of the best nearest neighbor-based dimension estimators available. We compute the probability density function of local FSA estimates, if the local manifold density is uniform. Based on the probability density function, we propose to use the median of local estimates as a basic global measure of intrinsic dimensionality, and we demonstrate the advantages of this asymptotically unbiased estimator over the previously proposed statistics: the mode and the mean. Additionally, from the probability density function, we derive the maximum likelihood formula for global intrinsic dimensionality, if i.i.d. holds. We tackle edge and finite-sample effects with an exponential correction formula, calibrated on hypercube datasets. We compare the performance of the corrected median-FSA estimator with kNN estimators: maximum likelihood (Levina-Bickel), the 2NN and two implementations of DANCo (R and MATLAB). We show that corrected median-FSA estimator beats the maximum likelihood estimator and it is on equal footing with DANCo for standard synthetic benchmarks according to mean percentage error and error rate metrics. With the median-FSA algorithm, we reveal diverse changes in the neural dynamics while resting state and during epileptic seizures. We identify brain areas with lower-dimensional dynamics that are possible causal sources and candidates for being seizure onset zones. PeerJ Inc. 2022-01-06 /pmc/articles/PMC8771813/ /pubmed/35111907 http://dx.doi.org/10.7717/peerj-cs.790 Text en ©2022 Benkő et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Brain-Computer Interface Benkő, Zsigmond Stippinger, Marcell Rehus, Roberta Bencze, Attila Fabó, Dániel Hajnal, Boglárka Eröss, Loránd G. Telcs, András Somogyvári, Zoltán Manifold-adaptive dimension estimation revisited |
title | Manifold-adaptive dimension estimation revisited |
title_full | Manifold-adaptive dimension estimation revisited |
title_fullStr | Manifold-adaptive dimension estimation revisited |
title_full_unstemmed | Manifold-adaptive dimension estimation revisited |
title_short | Manifold-adaptive dimension estimation revisited |
title_sort | manifold-adaptive dimension estimation revisited |
topic | Brain-Computer Interface |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8771813/ https://www.ncbi.nlm.nih.gov/pubmed/35111907 http://dx.doi.org/10.7717/peerj-cs.790 |
work_keys_str_mv | AT benkozsigmond manifoldadaptivedimensionestimationrevisited AT stippingermarcell manifoldadaptivedimensionestimationrevisited AT rehusroberta manifoldadaptivedimensionestimationrevisited AT benczeattila manifoldadaptivedimensionestimationrevisited AT fabodaniel manifoldadaptivedimensionestimationrevisited AT hajnalboglarka manifoldadaptivedimensionestimationrevisited AT erosslorandg manifoldadaptivedimensionestimationrevisited AT telcsandras manifoldadaptivedimensionestimationrevisited AT somogyvarizoltan manifoldadaptivedimensionestimationrevisited |