Cargando…
On parsimony and clustering
This work is motivated by applications of parsimonious cladograms for the purpose of analyzing non-biological data. Parsimonious cladograms were introduced as a means to help understanding the tree of life, and are now used in fields related to biological sciences at large, e.g., to analyze viruses...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280621/ https://www.ncbi.nlm.nih.gov/pubmed/37346541 http://dx.doi.org/10.7717/peerj-cs.1339 |
_version_ | 1785060837263671296 |
---|---|
author | Oggier, Frédérique Datta, Anwitaman |
author_facet | Oggier, Frédérique Datta, Anwitaman |
author_sort | Oggier, Frédérique |
collection | PubMed |
description | This work is motivated by applications of parsimonious cladograms for the purpose of analyzing non-biological data. Parsimonious cladograms were introduced as a means to help understanding the tree of life, and are now used in fields related to biological sciences at large, e.g., to analyze viruses or to predict the structure of proteins. We revisit parsimonious cladograms through the lens of clustering and compare cladograms optimized for parsimony with dendograms obtained from single linkage hierarchical clustering. We show that despite similarities in both approaches, there exist datasets whose clustering dendogram is incompatible with parsimony optimization. Furthermore, we provide numerical examples to compare via F-scores the clustering obtained through both parsimonious cladograms and single linkage hierarchical dendograms. |
format | Online Article Text |
id | pubmed-10280621 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-102806212023-06-21 On parsimony and clustering Oggier, Frédérique Datta, Anwitaman PeerJ Comput Sci Bioinformatics This work is motivated by applications of parsimonious cladograms for the purpose of analyzing non-biological data. Parsimonious cladograms were introduced as a means to help understanding the tree of life, and are now used in fields related to biological sciences at large, e.g., to analyze viruses or to predict the structure of proteins. We revisit parsimonious cladograms through the lens of clustering and compare cladograms optimized for parsimony with dendograms obtained from single linkage hierarchical clustering. We show that despite similarities in both approaches, there exist datasets whose clustering dendogram is incompatible with parsimony optimization. Furthermore, we provide numerical examples to compare via F-scores the clustering obtained through both parsimonious cladograms and single linkage hierarchical dendograms. PeerJ Inc. 2023-04-20 /pmc/articles/PMC10280621/ /pubmed/37346541 http://dx.doi.org/10.7717/peerj-cs.1339 Text en © 2023 Oggier and Datta https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Oggier, Frédérique Datta, Anwitaman On parsimony and clustering |
title | On parsimony and clustering |
title_full | On parsimony and clustering |
title_fullStr | On parsimony and clustering |
title_full_unstemmed | On parsimony and clustering |
title_short | On parsimony and clustering |
title_sort | on parsimony and clustering |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280621/ https://www.ncbi.nlm.nih.gov/pubmed/37346541 http://dx.doi.org/10.7717/peerj-cs.1339 |
work_keys_str_mv | AT oggierfrederique onparsimonyandclustering AT dattaanwitaman onparsimonyandclustering |