Cargando…

How Fitch-Margoliash Algorithm can Benefit from Multi Dimensional Scaling

Whatever the phylogenetic method, genetic sequences are often described as strings of characters, thus molecular sequences can be viewed as elements of a multi-dimensional space. As a consequence, studying motion in this space (ie, the evolutionary process) must deal with the amazing features of hig...

Descripción completa

Detalles Bibliográficos
Autores principales: Lespinats, Sylvain, Grando, Delphine, Maréchal, Eric, Hakimi, Mohamed-Ali, Tenaillon, Olivier, Bastien, Olivier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3118699/
https://www.ncbi.nlm.nih.gov/pubmed/21697992
http://dx.doi.org/10.4137/EBO.S7048
_version_ 1782206497365688320
author Lespinats, Sylvain
Grando, Delphine
Maréchal, Eric
Hakimi, Mohamed-Ali
Tenaillon, Olivier
Bastien, Olivier
author_facet Lespinats, Sylvain
Grando, Delphine
Maréchal, Eric
Hakimi, Mohamed-Ali
Tenaillon, Olivier
Bastien, Olivier
author_sort Lespinats, Sylvain
collection PubMed
description Whatever the phylogenetic method, genetic sequences are often described as strings of characters, thus molecular sequences can be viewed as elements of a multi-dimensional space. As a consequence, studying motion in this space (ie, the evolutionary process) must deal with the amazing features of high-dimensional spaces like concentration of measured phenomenon. To study how these features might influence phylogeny reconstructions, we examined a particular popular method: the Fitch-Margoliash algorithm, which belongs to the Least Squares methods. We show that the Least Squares methods are closely related to Multi Dimensional Scaling. Indeed, criteria for Fitch-Margoliash and Sammon’s mapping are somewhat similar. However, the prolific research in Multi Dimensional Scaling has definitely allowed outclassing Sammon’s mapping. Least Square methods for tree reconstruction can now take advantage of these improvements. However, “false neighborhood” and “tears” are the two main risks in dimensionality reduction field: “false neighborhood” corresponds to a widely separated data in the original space that are found close in representation space, and neighbor data that are displayed in remote positions constitute a “tear”. To address this problem, we took advantage of the concepts of “continuity” and “trustworthiness” in the tree reconstruction field, which limit the risk of “false neighborhood” and “tears”. We also point out the concentration of measured phenomenon as a source of error and introduce here new criteria to build phylogenies with improved preservation of distances and robustness. The authors and the Evolutionary Bioinformatics Journal dedicate this article to the memory of Professor W.M. Fitch (1929–2011).
format Online
Article
Text
id pubmed-3118699
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-31186992011-06-22 How Fitch-Margoliash Algorithm can Benefit from Multi Dimensional Scaling Lespinats, Sylvain Grando, Delphine Maréchal, Eric Hakimi, Mohamed-Ali Tenaillon, Olivier Bastien, Olivier Evol Bioinform Online Original Research Whatever the phylogenetic method, genetic sequences are often described as strings of characters, thus molecular sequences can be viewed as elements of a multi-dimensional space. As a consequence, studying motion in this space (ie, the evolutionary process) must deal with the amazing features of high-dimensional spaces like concentration of measured phenomenon. To study how these features might influence phylogeny reconstructions, we examined a particular popular method: the Fitch-Margoliash algorithm, which belongs to the Least Squares methods. We show that the Least Squares methods are closely related to Multi Dimensional Scaling. Indeed, criteria for Fitch-Margoliash and Sammon’s mapping are somewhat similar. However, the prolific research in Multi Dimensional Scaling has definitely allowed outclassing Sammon’s mapping. Least Square methods for tree reconstruction can now take advantage of these improvements. However, “false neighborhood” and “tears” are the two main risks in dimensionality reduction field: “false neighborhood” corresponds to a widely separated data in the original space that are found close in representation space, and neighbor data that are displayed in remote positions constitute a “tear”. To address this problem, we took advantage of the concepts of “continuity” and “trustworthiness” in the tree reconstruction field, which limit the risk of “false neighborhood” and “tears”. We also point out the concentration of measured phenomenon as a source of error and introduce here new criteria to build phylogenies with improved preservation of distances and robustness. The authors and the Evolutionary Bioinformatics Journal dedicate this article to the memory of Professor W.M. Fitch (1929–2011). Libertas Academica 2011-06-07 /pmc/articles/PMC3118699/ /pubmed/21697992 http://dx.doi.org/10.4137/EBO.S7048 Text en © the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited.
spellingShingle Original Research
Lespinats, Sylvain
Grando, Delphine
Maréchal, Eric
Hakimi, Mohamed-Ali
Tenaillon, Olivier
Bastien, Olivier
How Fitch-Margoliash Algorithm can Benefit from Multi Dimensional Scaling
title How Fitch-Margoliash Algorithm can Benefit from Multi Dimensional Scaling
title_full How Fitch-Margoliash Algorithm can Benefit from Multi Dimensional Scaling
title_fullStr How Fitch-Margoliash Algorithm can Benefit from Multi Dimensional Scaling
title_full_unstemmed How Fitch-Margoliash Algorithm can Benefit from Multi Dimensional Scaling
title_short How Fitch-Margoliash Algorithm can Benefit from Multi Dimensional Scaling
title_sort how fitch-margoliash algorithm can benefit from multi dimensional scaling
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3118699/
https://www.ncbi.nlm.nih.gov/pubmed/21697992
http://dx.doi.org/10.4137/EBO.S7048
work_keys_str_mv AT lespinatssylvain howfitchmargoliashalgorithmcanbenefitfrommultidimensionalscaling
AT grandodelphine howfitchmargoliashalgorithmcanbenefitfrommultidimensionalscaling
AT marechaleric howfitchmargoliashalgorithmcanbenefitfrommultidimensionalscaling
AT hakimimohamedali howfitchmargoliashalgorithmcanbenefitfrommultidimensionalscaling
AT tenaillonolivier howfitchmargoliashalgorithmcanbenefitfrommultidimensionalscaling
AT bastienolivier howfitchmargoliashalgorithmcanbenefitfrommultidimensionalscaling