Cargando…

Computational Reproducibility of Molecular Phylogenies

Repeated runs of the same program can generate different molecular phylogenies from identical data sets under the same analytical conditions. This lack of reproducibility of inferred phylogenies casts a long shadow on downstream research employing these phylogenies in areas such as comparative genom...

Descripción completa

Detalles Bibliográficos
Autores principales: Kumar, Sudhir, Tao, Qiqing, Lamarca, Alessandra P, Tamura, Koichiro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10370456/
https://www.ncbi.nlm.nih.gov/pubmed/37467477
http://dx.doi.org/10.1093/molbev/msad165
_version_ 1785077938501189632
author Kumar, Sudhir
Tao, Qiqing
Lamarca, Alessandra P
Tamura, Koichiro
author_facet Kumar, Sudhir
Tao, Qiqing
Lamarca, Alessandra P
Tamura, Koichiro
author_sort Kumar, Sudhir
collection PubMed
description Repeated runs of the same program can generate different molecular phylogenies from identical data sets under the same analytical conditions. This lack of reproducibility of inferred phylogenies casts a long shadow on downstream research employing these phylogenies in areas such as comparative genomics, systematics, and functional biology. We have assessed the relative accuracies and log-likelihoods of alternative phylogenies generated for computer-simulated and empirical data sets. Our findings indicate that these alternative phylogenies reconstruct evolutionary relationships with comparable accuracy. They also have similar log-likelihoods that are not inferior to the log-likelihoods of the true tree. We determined that the direct relationship between irreproducibility and inaccuracy is due to their common dependence on the amount of phylogenetic information in the data. While computational reproducibility can be enhanced through more extensive heuristic searches for the maximum likelihood tree, this does not lead to higher accuracy. We conclude that computational irreproducibility plays a minor role in molecular phylogenetics.
format Online
Article
Text
id pubmed-10370456
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103704562023-07-27 Computational Reproducibility of Molecular Phylogenies Kumar, Sudhir Tao, Qiqing Lamarca, Alessandra P Tamura, Koichiro Mol Biol Evol Letter Repeated runs of the same program can generate different molecular phylogenies from identical data sets under the same analytical conditions. This lack of reproducibility of inferred phylogenies casts a long shadow on downstream research employing these phylogenies in areas such as comparative genomics, systematics, and functional biology. We have assessed the relative accuracies and log-likelihoods of alternative phylogenies generated for computer-simulated and empirical data sets. Our findings indicate that these alternative phylogenies reconstruct evolutionary relationships with comparable accuracy. They also have similar log-likelihoods that are not inferior to the log-likelihoods of the true tree. We determined that the direct relationship between irreproducibility and inaccuracy is due to their common dependence on the amount of phylogenetic information in the data. While computational reproducibility can be enhanced through more extensive heuristic searches for the maximum likelihood tree, this does not lead to higher accuracy. We conclude that computational irreproducibility plays a minor role in molecular phylogenetics. Oxford University Press 2023-07-19 /pmc/articles/PMC10370456/ /pubmed/37467477 http://dx.doi.org/10.1093/molbev/msad165 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Letter
Kumar, Sudhir
Tao, Qiqing
Lamarca, Alessandra P
Tamura, Koichiro
Computational Reproducibility of Molecular Phylogenies
title Computational Reproducibility of Molecular Phylogenies
title_full Computational Reproducibility of Molecular Phylogenies
title_fullStr Computational Reproducibility of Molecular Phylogenies
title_full_unstemmed Computational Reproducibility of Molecular Phylogenies
title_short Computational Reproducibility of Molecular Phylogenies
title_sort computational reproducibility of molecular phylogenies
topic Letter
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10370456/
https://www.ncbi.nlm.nih.gov/pubmed/37467477
http://dx.doi.org/10.1093/molbev/msad165
work_keys_str_mv AT kumarsudhir computationalreproducibilityofmolecularphylogenies
AT taoqiqing computationalreproducibilityofmolecularphylogenies
AT lamarcaalessandrap computationalreproducibilityofmolecularphylogenies
AT tamurakoichiro computationalreproducibilityofmolecularphylogenies