Cargando…
Persistent homology classification algorithm
Data classification is an important aspect of machine learning, as it is utilized to solve issues in a wide variety of contexts. There are numerous classifiers, but there is no single best-performing classifier for all types of data, as the no free lunch theorem implies. Topological data analysis is...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280283/ https://www.ncbi.nlm.nih.gov/pubmed/37346603 http://dx.doi.org/10.7717/peerj-cs.1195 |
_version_ | 1785060764252372992 |
---|---|
author | De Lara, Mark Lexter D. |
author_facet | De Lara, Mark Lexter D. |
author_sort | De Lara, Mark Lexter D. |
collection | PubMed |
description | Data classification is an important aspect of machine learning, as it is utilized to solve issues in a wide variety of contexts. There are numerous classifiers, but there is no single best-performing classifier for all types of data, as the no free lunch theorem implies. Topological data analysis is an emerging topic concerned with the shape of data. One of the key tools in this field for analyzing the shape or topological properties of a dataset is persistent homology, an algebraic topology-based method for estimating the topological features of a space of points that persists across several resolutions. This study proposes a supervised learning classification algorithm that makes use of persistent homology between training data classes in the form of persistence diagrams to predict the output category of new observations. Validation of the developed algorithm was performed on real-world and synthetic datasets. The performance of the proposed classification algorithm on these datasets was compared to that of the most widely used classifiers. Validation runs demonstrated that the proposed persistent homology classification algorithm performed at par if not better than the majority of classifiers considered. |
format | Online Article Text |
id | pubmed-10280283 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-102802832023-06-21 Persistent homology classification algorithm De Lara, Mark Lexter D. PeerJ Comput Sci Algorithms and Analysis of Algorithms Data classification is an important aspect of machine learning, as it is utilized to solve issues in a wide variety of contexts. There are numerous classifiers, but there is no single best-performing classifier for all types of data, as the no free lunch theorem implies. Topological data analysis is an emerging topic concerned with the shape of data. One of the key tools in this field for analyzing the shape or topological properties of a dataset is persistent homology, an algebraic topology-based method for estimating the topological features of a space of points that persists across several resolutions. This study proposes a supervised learning classification algorithm that makes use of persistent homology between training data classes in the form of persistence diagrams to predict the output category of new observations. Validation of the developed algorithm was performed on real-world and synthetic datasets. The performance of the proposed classification algorithm on these datasets was compared to that of the most widely used classifiers. Validation runs demonstrated that the proposed persistent homology classification algorithm performed at par if not better than the majority of classifiers considered. PeerJ Inc. 2023-01-10 /pmc/articles/PMC10280283/ /pubmed/37346603 http://dx.doi.org/10.7717/peerj-cs.1195 Text en © 2023 De Lara https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Algorithms and Analysis of Algorithms De Lara, Mark Lexter D. Persistent homology classification algorithm |
title | Persistent homology classification algorithm |
title_full | Persistent homology classification algorithm |
title_fullStr | Persistent homology classification algorithm |
title_full_unstemmed | Persistent homology classification algorithm |
title_short | Persistent homology classification algorithm |
title_sort | persistent homology classification algorithm |
topic | Algorithms and Analysis of Algorithms |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280283/ https://www.ncbi.nlm.nih.gov/pubmed/37346603 http://dx.doi.org/10.7717/peerj-cs.1195 |
work_keys_str_mv | AT delaramarklexterd persistenthomologyclassificationalgorithm |