Cargando…

Persistent homology classification algorithm

Data classification is an important aspect of machine learning, as it is utilized to solve issues in a wide variety of contexts. There are numerous classifiers, but there is no single best-performing classifier for all types of data, as the no free lunch theorem implies. Topological data analysis is...

Descripción completa

Detalles Bibliográficos
Autor principal: De Lara, Mark Lexter D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280283/
https://www.ncbi.nlm.nih.gov/pubmed/37346603
http://dx.doi.org/10.7717/peerj-cs.1195
_version_ 1785060764252372992
author De Lara, Mark Lexter D.
author_facet De Lara, Mark Lexter D.
author_sort De Lara, Mark Lexter D.
collection PubMed
description Data classification is an important aspect of machine learning, as it is utilized to solve issues in a wide variety of contexts. There are numerous classifiers, but there is no single best-performing classifier for all types of data, as the no free lunch theorem implies. Topological data analysis is an emerging topic concerned with the shape of data. One of the key tools in this field for analyzing the shape or topological properties of a dataset is persistent homology, an algebraic topology-based method for estimating the topological features of a space of points that persists across several resolutions. This study proposes a supervised learning classification algorithm that makes use of persistent homology between training data classes in the form of persistence diagrams to predict the output category of new observations. Validation of the developed algorithm was performed on real-world and synthetic datasets. The performance of the proposed classification algorithm on these datasets was compared to that of the most widely used classifiers. Validation runs demonstrated that the proposed persistent homology classification algorithm performed at par if not better than the majority of classifiers considered.
format Online
Article
Text
id pubmed-10280283
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-102802832023-06-21 Persistent homology classification algorithm De Lara, Mark Lexter D. PeerJ Comput Sci Algorithms and Analysis of Algorithms Data classification is an important aspect of machine learning, as it is utilized to solve issues in a wide variety of contexts. There are numerous classifiers, but there is no single best-performing classifier for all types of data, as the no free lunch theorem implies. Topological data analysis is an emerging topic concerned with the shape of data. One of the key tools in this field for analyzing the shape or topological properties of a dataset is persistent homology, an algebraic topology-based method for estimating the topological features of a space of points that persists across several resolutions. This study proposes a supervised learning classification algorithm that makes use of persistent homology between training data classes in the form of persistence diagrams to predict the output category of new observations. Validation of the developed algorithm was performed on real-world and synthetic datasets. The performance of the proposed classification algorithm on these datasets was compared to that of the most widely used classifiers. Validation runs demonstrated that the proposed persistent homology classification algorithm performed at par if not better than the majority of classifiers considered. PeerJ Inc. 2023-01-10 /pmc/articles/PMC10280283/ /pubmed/37346603 http://dx.doi.org/10.7717/peerj-cs.1195 Text en © 2023 De Lara https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Algorithms and Analysis of Algorithms
De Lara, Mark Lexter D.
Persistent homology classification algorithm
title Persistent homology classification algorithm
title_full Persistent homology classification algorithm
title_fullStr Persistent homology classification algorithm
title_full_unstemmed Persistent homology classification algorithm
title_short Persistent homology classification algorithm
title_sort persistent homology classification algorithm
topic Algorithms and Analysis of Algorithms
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280283/
https://www.ncbi.nlm.nih.gov/pubmed/37346603
http://dx.doi.org/10.7717/peerj-cs.1195
work_keys_str_mv AT delaramarklexterd persistenthomologyclassificationalgorithm