Cargando…

Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches

Traditional approaches to probabilistic phylogenetic inference have relied on information-theoretic criteria to select among a relatively small set of substitution models. These model selection criteria have recently been called into question when applied to richer models, including models that invo...

Descripción completa

Detalles Bibliográficos
Autores principales: Bujaki, Thomas, Van Looyen, Katharine, Rodrigue, Nicolas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371494/
https://www.ncbi.nlm.nih.gov/pubmed/37502274
http://dx.doi.org/10.1093/bioadv/vbad091
_version_ 1785078158622457856
author Bujaki, Thomas
Van Looyen, Katharine
Rodrigue, Nicolas
author_facet Bujaki, Thomas
Van Looyen, Katharine
Rodrigue, Nicolas
author_sort Bujaki, Thomas
collection PubMed
description Traditional approaches to probabilistic phylogenetic inference have relied on information-theoretic criteria to select among a relatively small set of substitution models. These model selection criteria have recently been called into question when applied to richer models, including models that invoke mixtures of nucleotide frequency profiles. At the nucleotide level, we are therefore left without a clear picture of mixture models’ contribution to overall predictive power relative to other modeling approaches. Here, we utilize a Bayesian cross-validation method to directly measure the predictive performance of a wide range of nucleotide substitution models. We compare the relative contributions of free nucleotide exchangeability parameters, gamma-distributed rates across sites, and mixtures of nucleotide frequencies with both finite and infinite mixture frameworks. We find that the most important contributor to a model’s predictive power is the use of a sufficiently rich mixture of nucleotide frequencies. These results suggest that mixture models should be given greater consideration in nucleotide-level phylogenetic inference.
format Online
Article
Text
id pubmed-10371494
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103714942023-07-27 Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches Bujaki, Thomas Van Looyen, Katharine Rodrigue, Nicolas Bioinform Adv Original Article Traditional approaches to probabilistic phylogenetic inference have relied on information-theoretic criteria to select among a relatively small set of substitution models. These model selection criteria have recently been called into question when applied to richer models, including models that invoke mixtures of nucleotide frequency profiles. At the nucleotide level, we are therefore left without a clear picture of mixture models’ contribution to overall predictive power relative to other modeling approaches. Here, we utilize a Bayesian cross-validation method to directly measure the predictive performance of a wide range of nucleotide substitution models. We compare the relative contributions of free nucleotide exchangeability parameters, gamma-distributed rates across sites, and mixtures of nucleotide frequencies with both finite and infinite mixture frameworks. We find that the most important contributor to a model’s predictive power is the use of a sufficiently rich mixture of nucleotide frequencies. These results suggest that mixture models should be given greater consideration in nucleotide-level phylogenetic inference. Oxford University Press 2023-07-14 /pmc/articles/PMC10371494/ /pubmed/37502274 http://dx.doi.org/10.1093/bioadv/vbad091 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Bujaki, Thomas
Van Looyen, Katharine
Rodrigue, Nicolas
Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches
title Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches
title_full Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches
title_fullStr Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches
title_full_unstemmed Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches
title_short Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches
title_sort measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371494/
https://www.ncbi.nlm.nih.gov/pubmed/37502274
http://dx.doi.org/10.1093/bioadv/vbad091
work_keys_str_mv AT bujakithomas measuringtherelativecontributiontopredictivepowerofmodernnucleotidesubstitutionmodelingapproaches
AT vanlooyenkatharine measuringtherelativecontributiontopredictivepowerofmodernnucleotidesubstitutionmodelingapproaches
AT rodriguenicolas measuringtherelativecontributiontopredictivepowerofmodernnucleotidesubstitutionmodelingapproaches