Cargando…

Lagrange-NG: The next generation of Lagrange

Computing ancestral ranges via the Dispersion Extinction and Cladogensis (DEC) model of biogeography is characterized by an exponential number of states relative to the number of regions considered. This is because the DEC model requires computing a large matrix exponential, which typically accounts...

Descripción completa

Detalles Bibliográficos
Autores principales: Bettisworth, Ben, Smith, Stephen A, Stamatakis, Alexandros
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10198646/
https://www.ncbi.nlm.nih.gov/pubmed/36705582
http://dx.doi.org/10.1093/sysbio/syad002
_version_ 1785044777588228096
author Bettisworth, Ben
Smith, Stephen A
Stamatakis, Alexandros
author_facet Bettisworth, Ben
Smith, Stephen A
Stamatakis, Alexandros
author_sort Bettisworth, Ben
collection PubMed
description Computing ancestral ranges via the Dispersion Extinction and Cladogensis (DEC) model of biogeography is characterized by an exponential number of states relative to the number of regions considered. This is because the DEC model requires computing a large matrix exponential, which typically accounts for up to 80% of overall runtime. Therefore, the kinds of biogeographical analyses that can be conducted under the DEC model are limited by the number of regions under consideration. In this work, we present a completely redesigned efficient version of the popular tool Lagrange which is up to 49 times faster with multithreading enabled, and is also 26 times faster when using only one thread. We call this new version Lagrange-NG (Lagrange-Next Generation). The increased computational efficiency allows Lagrange-NG to analyze datasets with a large number of regions in a reasonable amount of time, up to 12 regions in approximately 18 min. We achieve these speedups using a relatively new method of computing the matrix exponential based on Krylov subspaces. In order to validate the correctness of Lagrange-NG, we also introduce a novel metric on range distributions for trees so that researchers can assess the difference between any two range inferences. Finally, Lagrange-NG exhibits substantially higher adherence to coding quality standards. It improves a respective software quality indicator as implemented in the SoftWipe tool from average (5.5; Lagrange) to high (7.8; Lagrange-NG). Lagrange-NG is freely available under GPL2. [Biogeography; Phylogenetics; DEC Model.]
format Online
Article
Text
id pubmed-10198646
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101986462023-05-20 Lagrange-NG: The next generation of Lagrange Bettisworth, Ben Smith, Stephen A Stamatakis, Alexandros Syst Biol Software for Systematics and Evolution Computing ancestral ranges via the Dispersion Extinction and Cladogensis (DEC) model of biogeography is characterized by an exponential number of states relative to the number of regions considered. This is because the DEC model requires computing a large matrix exponential, which typically accounts for up to 80% of overall runtime. Therefore, the kinds of biogeographical analyses that can be conducted under the DEC model are limited by the number of regions under consideration. In this work, we present a completely redesigned efficient version of the popular tool Lagrange which is up to 49 times faster with multithreading enabled, and is also 26 times faster when using only one thread. We call this new version Lagrange-NG (Lagrange-Next Generation). The increased computational efficiency allows Lagrange-NG to analyze datasets with a large number of regions in a reasonable amount of time, up to 12 regions in approximately 18 min. We achieve these speedups using a relatively new method of computing the matrix exponential based on Krylov subspaces. In order to validate the correctness of Lagrange-NG, we also introduce a novel metric on range distributions for trees so that researchers can assess the difference between any two range inferences. Finally, Lagrange-NG exhibits substantially higher adherence to coding quality standards. It improves a respective software quality indicator as implemented in the SoftWipe tool from average (5.5; Lagrange) to high (7.8; Lagrange-NG). Lagrange-NG is freely available under GPL2. [Biogeography; Phylogenetics; DEC Model.] Oxford University Press 2023-01-27 /pmc/articles/PMC10198646/ /pubmed/36705582 http://dx.doi.org/10.1093/sysbio/syad002 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the Society of Systematic Biologists. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Software for Systematics and Evolution
Bettisworth, Ben
Smith, Stephen A
Stamatakis, Alexandros
Lagrange-NG: The next generation of Lagrange
title Lagrange-NG: The next generation of Lagrange
title_full Lagrange-NG: The next generation of Lagrange
title_fullStr Lagrange-NG: The next generation of Lagrange
title_full_unstemmed Lagrange-NG: The next generation of Lagrange
title_short Lagrange-NG: The next generation of Lagrange
title_sort lagrange-ng: the next generation of lagrange
topic Software for Systematics and Evolution
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10198646/
https://www.ncbi.nlm.nih.gov/pubmed/36705582
http://dx.doi.org/10.1093/sysbio/syad002
work_keys_str_mv AT bettisworthben lagrangengthenextgenerationoflagrange
AT smithstephena lagrangengthenextgenerationoflagrange
AT stamatakisalexandros lagrangengthenextgenerationoflagrange