Cargando…
Euler characteristic curves and profiles: a stable shape invariant for big data problems
Tools of topological data analysis provide stable summaries encapsulating the shape of the considered data. Persistent homology, the most standard and well-studied data summary, suffers a number of limitations; its computations are hard to distribute, and it is hard to generalize to multifiltrations...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10646871/ https://www.ncbi.nlm.nih.gov/pubmed/37966428 http://dx.doi.org/10.1093/gigascience/giad094 |
_version_ | 1785147467523686400 |
---|---|
author | Dłotko, Paweł Gurnari, Davide |
author_facet | Dłotko, Paweł Gurnari, Davide |
author_sort | Dłotko, Paweł |
collection | PubMed |
description | Tools of topological data analysis provide stable summaries encapsulating the shape of the considered data. Persistent homology, the most standard and well-studied data summary, suffers a number of limitations; its computations are hard to distribute, and it is hard to generalize to multifiltrations and is computationally prohibitive for big datasets. In this article, we study the concept of Euler characteristics curves for 1-parameter filtrations and Euler characteristic profiles for multiparameter filtrations. While being a weaker invariant in one dimension, we show that Euler characteristic–based approaches do not possess some handicaps of persistent homology; we show efficient algorithms to compute them in a distributed way, their generalization to multifiltrations, and practical applicability for big data problems. In addition, we show that the Euler curves and profiles enjoy a certain type of stability, which makes them robust tools for data analysis. Lastly, to show their practical applicability, multiple use cases are considered. |
format | Online Article Text |
id | pubmed-10646871 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106468712023-11-15 Euler characteristic curves and profiles: a stable shape invariant for big data problems Dłotko, Paweł Gurnari, Davide Gigascience Research Tools of topological data analysis provide stable summaries encapsulating the shape of the considered data. Persistent homology, the most standard and well-studied data summary, suffers a number of limitations; its computations are hard to distribute, and it is hard to generalize to multifiltrations and is computationally prohibitive for big datasets. In this article, we study the concept of Euler characteristics curves for 1-parameter filtrations and Euler characteristic profiles for multiparameter filtrations. While being a weaker invariant in one dimension, we show that Euler characteristic–based approaches do not possess some handicaps of persistent homology; we show efficient algorithms to compute them in a distributed way, their generalization to multifiltrations, and practical applicability for big data problems. In addition, we show that the Euler curves and profiles enjoy a certain type of stability, which makes them robust tools for data analysis. Lastly, to show their practical applicability, multiple use cases are considered. Oxford University Press 2023-11-15 /pmc/articles/PMC10646871/ /pubmed/37966428 http://dx.doi.org/10.1093/gigascience/giad094 Text en © The Author(s) 2023. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Dłotko, Paweł Gurnari, Davide Euler characteristic curves and profiles: a stable shape invariant for big data problems |
title | Euler characteristic curves and profiles: a stable shape invariant for big data problems |
title_full | Euler characteristic curves and profiles: a stable shape invariant for big data problems |
title_fullStr | Euler characteristic curves and profiles: a stable shape invariant for big data problems |
title_full_unstemmed | Euler characteristic curves and profiles: a stable shape invariant for big data problems |
title_short | Euler characteristic curves and profiles: a stable shape invariant for big data problems |
title_sort | euler characteristic curves and profiles: a stable shape invariant for big data problems |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10646871/ https://www.ncbi.nlm.nih.gov/pubmed/37966428 http://dx.doi.org/10.1093/gigascience/giad094 |
work_keys_str_mv | AT dłotkopaweł eulercharacteristiccurvesandprofilesastableshapeinvariantforbigdataproblems AT gurnaridavide eulercharacteristiccurvesandprofilesastableshapeinvariantforbigdataproblems |