Cargando…
Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches
Traditional approaches to probabilistic phylogenetic inference have relied on information-theoretic criteria to select among a relatively small set of substitution models. These model selection criteria have recently been called into question when applied to richer models, including models that invo...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371494/ https://www.ncbi.nlm.nih.gov/pubmed/37502274 http://dx.doi.org/10.1093/bioadv/vbad091 |
_version_ | 1785078158622457856 |
---|---|
author | Bujaki, Thomas Van Looyen, Katharine Rodrigue, Nicolas |
author_facet | Bujaki, Thomas Van Looyen, Katharine Rodrigue, Nicolas |
author_sort | Bujaki, Thomas |
collection | PubMed |
description | Traditional approaches to probabilistic phylogenetic inference have relied on information-theoretic criteria to select among a relatively small set of substitution models. These model selection criteria have recently been called into question when applied to richer models, including models that invoke mixtures of nucleotide frequency profiles. At the nucleotide level, we are therefore left without a clear picture of mixture models’ contribution to overall predictive power relative to other modeling approaches. Here, we utilize a Bayesian cross-validation method to directly measure the predictive performance of a wide range of nucleotide substitution models. We compare the relative contributions of free nucleotide exchangeability parameters, gamma-distributed rates across sites, and mixtures of nucleotide frequencies with both finite and infinite mixture frameworks. We find that the most important contributor to a model’s predictive power is the use of a sufficiently rich mixture of nucleotide frequencies. These results suggest that mixture models should be given greater consideration in nucleotide-level phylogenetic inference. |
format | Online Article Text |
id | pubmed-10371494 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-103714942023-07-27 Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches Bujaki, Thomas Van Looyen, Katharine Rodrigue, Nicolas Bioinform Adv Original Article Traditional approaches to probabilistic phylogenetic inference have relied on information-theoretic criteria to select among a relatively small set of substitution models. These model selection criteria have recently been called into question when applied to richer models, including models that invoke mixtures of nucleotide frequency profiles. At the nucleotide level, we are therefore left without a clear picture of mixture models’ contribution to overall predictive power relative to other modeling approaches. Here, we utilize a Bayesian cross-validation method to directly measure the predictive performance of a wide range of nucleotide substitution models. We compare the relative contributions of free nucleotide exchangeability parameters, gamma-distributed rates across sites, and mixtures of nucleotide frequencies with both finite and infinite mixture frameworks. We find that the most important contributor to a model’s predictive power is the use of a sufficiently rich mixture of nucleotide frequencies. These results suggest that mixture models should be given greater consideration in nucleotide-level phylogenetic inference. Oxford University Press 2023-07-14 /pmc/articles/PMC10371494/ /pubmed/37502274 http://dx.doi.org/10.1093/bioadv/vbad091 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Bujaki, Thomas Van Looyen, Katharine Rodrigue, Nicolas Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches |
title | Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches |
title_full | Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches |
title_fullStr | Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches |
title_full_unstemmed | Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches |
title_short | Measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches |
title_sort | measuring the relative contribution to predictive power of modern nucleotide substitution modeling approaches |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10371494/ https://www.ncbi.nlm.nih.gov/pubmed/37502274 http://dx.doi.org/10.1093/bioadv/vbad091 |
work_keys_str_mv | AT bujakithomas measuringtherelativecontributiontopredictivepowerofmodernnucleotidesubstitutionmodelingapproaches AT vanlooyenkatharine measuringtherelativecontributiontopredictivepowerofmodernnucleotidesubstitutionmodelingapproaches AT rodriguenicolas measuringtherelativecontributiontopredictivepowerofmodernnucleotidesubstitutionmodelingapproaches |