Cargando…

Context-dependent codon partition models provide significant increases in model fit in atpB and rbcL protein-coding genes

BACKGROUND: Accurate modelling of substitution processes in protein-coding sequences is often hampered by the computational burdens associated with full codon models. Lately, codon partition models have been proposed as a viable alternative, mimicking the substitution behaviour of codon models at a...

Descripción completa

Detalles Bibliográficos
Autores principales: Baele, Guy, Van de Peer, Yves, Vansteelandt, Stijn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3126739/
https://www.ncbi.nlm.nih.gov/pubmed/21619569
http://dx.doi.org/10.1186/1471-2148-11-145
_version_ 1782207280567025664
author Baele, Guy
Van de Peer, Yves
Vansteelandt, Stijn
author_facet Baele, Guy
Van de Peer, Yves
Vansteelandt, Stijn
author_sort Baele, Guy
collection PubMed
description BACKGROUND: Accurate modelling of substitution processes in protein-coding sequences is often hampered by the computational burdens associated with full codon models. Lately, codon partition models have been proposed as a viable alternative, mimicking the substitution behaviour of codon models at a low computational cost. Such codon partition models however impose independent evolution of the different codon positions, which is overly restrictive from a biological point of view. Given that empirical research has provided indications of context-dependent substitution patterns at four-fold degenerate sites, we take those indications into account in this paper. RESULTS: We present so-called context-dependent codon partition models to assess previous empirical claims that the evolution of four-fold degenerate sites is strongly dependent on the composition of its two flanking bases. To this end, we have estimated and compared various existing independent models, codon models, codon partition models and context-dependent codon partition models for the atpB and rbcL genes of the chloroplast genome, which are frequently used in plant systematics. Such context-dependent codon partition models employ a full dependency scheme for four-fold degenerate sites, whilst maintaining the independence assumption for the first and second codon positions. CONCLUSIONS: We show that, both in the atpB and rbcL alignments of a collection of land plants, these context-dependent codon partition models significantly improve model fit over existing codon partition models. Using Bayes factors based on thermodynamic integration, we show that in both datasets the same context-dependent codon partition model yields the largest increase in model fit compared to an independent evolutionary model. Context-dependent codon partition models hence perform closer to codon models, which remain the best performing models at a drastically increased computational cost, compared to codon partition models, but remain computationally interesting alternatives to codon models. Finally, we observe that the substitution patterns in both datasets are drastically different, leading to the conclusion that combined analysis of these two genes using a single model may not be advisable from a context-dependent point of view.
format Online
Article
Text
id pubmed-3126739
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31267392011-06-30 Context-dependent codon partition models provide significant increases in model fit in atpB and rbcL protein-coding genes Baele, Guy Van de Peer, Yves Vansteelandt, Stijn BMC Evol Biol Research Article BACKGROUND: Accurate modelling of substitution processes in protein-coding sequences is often hampered by the computational burdens associated with full codon models. Lately, codon partition models have been proposed as a viable alternative, mimicking the substitution behaviour of codon models at a low computational cost. Such codon partition models however impose independent evolution of the different codon positions, which is overly restrictive from a biological point of view. Given that empirical research has provided indications of context-dependent substitution patterns at four-fold degenerate sites, we take those indications into account in this paper. RESULTS: We present so-called context-dependent codon partition models to assess previous empirical claims that the evolution of four-fold degenerate sites is strongly dependent on the composition of its two flanking bases. To this end, we have estimated and compared various existing independent models, codon models, codon partition models and context-dependent codon partition models for the atpB and rbcL genes of the chloroplast genome, which are frequently used in plant systematics. Such context-dependent codon partition models employ a full dependency scheme for four-fold degenerate sites, whilst maintaining the independence assumption for the first and second codon positions. CONCLUSIONS: We show that, both in the atpB and rbcL alignments of a collection of land plants, these context-dependent codon partition models significantly improve model fit over existing codon partition models. Using Bayes factors based on thermodynamic integration, we show that in both datasets the same context-dependent codon partition model yields the largest increase in model fit compared to an independent evolutionary model. Context-dependent codon partition models hence perform closer to codon models, which remain the best performing models at a drastically increased computational cost, compared to codon partition models, but remain computationally interesting alternatives to codon models. Finally, we observe that the substitution patterns in both datasets are drastically different, leading to the conclusion that combined analysis of these two genes using a single model may not be advisable from a context-dependent point of view. BioMed Central 2011-05-27 /pmc/articles/PMC3126739/ /pubmed/21619569 http://dx.doi.org/10.1186/1471-2148-11-145 Text en Copyright ©2011 Baele et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Baele, Guy
Van de Peer, Yves
Vansteelandt, Stijn
Context-dependent codon partition models provide significant increases in model fit in atpB and rbcL protein-coding genes
title Context-dependent codon partition models provide significant increases in model fit in atpB and rbcL protein-coding genes
title_full Context-dependent codon partition models provide significant increases in model fit in atpB and rbcL protein-coding genes
title_fullStr Context-dependent codon partition models provide significant increases in model fit in atpB and rbcL protein-coding genes
title_full_unstemmed Context-dependent codon partition models provide significant increases in model fit in atpB and rbcL protein-coding genes
title_short Context-dependent codon partition models provide significant increases in model fit in atpB and rbcL protein-coding genes
title_sort context-dependent codon partition models provide significant increases in model fit in atpb and rbcl protein-coding genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3126739/
https://www.ncbi.nlm.nih.gov/pubmed/21619569
http://dx.doi.org/10.1186/1471-2148-11-145
work_keys_str_mv AT baeleguy contextdependentcodonpartitionmodelsprovidesignificantincreasesinmodelfitinatpbandrbclproteincodinggenes
AT vandepeeryves contextdependentcodonpartitionmodelsprovidesignificantincreasesinmodelfitinatpbandrbclproteincodinggenes
AT vansteelandtstijn contextdependentcodonpartitionmodelsprovidesignificantincreasesinmodelfitinatpbandrbclproteincodinggenes