Cargando…
Computational Reproducibility of Molecular Phylogenies
Repeated runs of the same program can generate different molecular phylogenies from identical data sets under the same analytical conditions. This lack of reproducibility of inferred phylogenies casts a long shadow on downstream research employing these phylogenies in areas such as comparative genom...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10370456/ https://www.ncbi.nlm.nih.gov/pubmed/37467477 http://dx.doi.org/10.1093/molbev/msad165 |
_version_ | 1785077938501189632 |
---|---|
author | Kumar, Sudhir Tao, Qiqing Lamarca, Alessandra P Tamura, Koichiro |
author_facet | Kumar, Sudhir Tao, Qiqing Lamarca, Alessandra P Tamura, Koichiro |
author_sort | Kumar, Sudhir |
collection | PubMed |
description | Repeated runs of the same program can generate different molecular phylogenies from identical data sets under the same analytical conditions. This lack of reproducibility of inferred phylogenies casts a long shadow on downstream research employing these phylogenies in areas such as comparative genomics, systematics, and functional biology. We have assessed the relative accuracies and log-likelihoods of alternative phylogenies generated for computer-simulated and empirical data sets. Our findings indicate that these alternative phylogenies reconstruct evolutionary relationships with comparable accuracy. They also have similar log-likelihoods that are not inferior to the log-likelihoods of the true tree. We determined that the direct relationship between irreproducibility and inaccuracy is due to their common dependence on the amount of phylogenetic information in the data. While computational reproducibility can be enhanced through more extensive heuristic searches for the maximum likelihood tree, this does not lead to higher accuracy. We conclude that computational irreproducibility plays a minor role in molecular phylogenetics. |
format | Online Article Text |
id | pubmed-10370456 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-103704562023-07-27 Computational Reproducibility of Molecular Phylogenies Kumar, Sudhir Tao, Qiqing Lamarca, Alessandra P Tamura, Koichiro Mol Biol Evol Letter Repeated runs of the same program can generate different molecular phylogenies from identical data sets under the same analytical conditions. This lack of reproducibility of inferred phylogenies casts a long shadow on downstream research employing these phylogenies in areas such as comparative genomics, systematics, and functional biology. We have assessed the relative accuracies and log-likelihoods of alternative phylogenies generated for computer-simulated and empirical data sets. Our findings indicate that these alternative phylogenies reconstruct evolutionary relationships with comparable accuracy. They also have similar log-likelihoods that are not inferior to the log-likelihoods of the true tree. We determined that the direct relationship between irreproducibility and inaccuracy is due to their common dependence on the amount of phylogenetic information in the data. While computational reproducibility can be enhanced through more extensive heuristic searches for the maximum likelihood tree, this does not lead to higher accuracy. We conclude that computational irreproducibility plays a minor role in molecular phylogenetics. Oxford University Press 2023-07-19 /pmc/articles/PMC10370456/ /pubmed/37467477 http://dx.doi.org/10.1093/molbev/msad165 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Letter Kumar, Sudhir Tao, Qiqing Lamarca, Alessandra P Tamura, Koichiro Computational Reproducibility of Molecular Phylogenies |
title | Computational Reproducibility of Molecular Phylogenies |
title_full | Computational Reproducibility of Molecular Phylogenies |
title_fullStr | Computational Reproducibility of Molecular Phylogenies |
title_full_unstemmed | Computational Reproducibility of Molecular Phylogenies |
title_short | Computational Reproducibility of Molecular Phylogenies |
title_sort | computational reproducibility of molecular phylogenies |
topic | Letter |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10370456/ https://www.ncbi.nlm.nih.gov/pubmed/37467477 http://dx.doi.org/10.1093/molbev/msad165 |
work_keys_str_mv | AT kumarsudhir computationalreproducibilityofmolecularphylogenies AT taoqiqing computationalreproducibilityofmolecularphylogenies AT lamarcaalessandrap computationalreproducibilityofmolecularphylogenies AT tamurakoichiro computationalreproducibilityofmolecularphylogenies |