Cargando…

Consequences of Substitution Model Selection on Protein Ancestral Sequence Reconstruction

The selection of the best-fitting substitution model of molecular evolution is a traditional step for phylogenetic inferences, including ancestral sequence reconstruction (ASR). However, a few recent studies suggested that applying this procedure does not affect the accuracy of phylogenetic tree rec...

Descripción completa

Detalles Bibliográficos
Autores principales: Del Amparo, Roberto, Arenas, Miguel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9254009/
https://www.ncbi.nlm.nih.gov/pubmed/35789388
http://dx.doi.org/10.1093/molbev/msac144
_version_ 1784740617908125696
author Del Amparo, Roberto
Arenas, Miguel
author_facet Del Amparo, Roberto
Arenas, Miguel
author_sort Del Amparo, Roberto
collection PubMed
description The selection of the best-fitting substitution model of molecular evolution is a traditional step for phylogenetic inferences, including ancestral sequence reconstruction (ASR). However, a few recent studies suggested that applying this procedure does not affect the accuracy of phylogenetic tree reconstruction. Here, we revisited this debate topic by analyzing the influence of selection among substitution models of protein evolution, with focus on exchangeability matrices, on the accuracy of ASR using simulated and real data. We found that the selected best-fitting substitution model produces the most accurate ancestral sequences, especially if the data present large genetic diversity. Indeed, ancestral sequences reconstructed under substitution models with similar exchangeability matrices were similar, suggesting that if the selected best-fitting model cannot be used for the reconstruction, applying a model similar to the selected one is preferred. We conclude that selecting among substitution models of protein evolution is recommended for reconstructing accurate ancestral sequences.
format Online
Article
Text
id pubmed-9254009
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92540092022-07-05 Consequences of Substitution Model Selection on Protein Ancestral Sequence Reconstruction Del Amparo, Roberto Arenas, Miguel Mol Biol Evol Methods The selection of the best-fitting substitution model of molecular evolution is a traditional step for phylogenetic inferences, including ancestral sequence reconstruction (ASR). However, a few recent studies suggested that applying this procedure does not affect the accuracy of phylogenetic tree reconstruction. Here, we revisited this debate topic by analyzing the influence of selection among substitution models of protein evolution, with focus on exchangeability matrices, on the accuracy of ASR using simulated and real data. We found that the selected best-fitting substitution model produces the most accurate ancestral sequences, especially if the data present large genetic diversity. Indeed, ancestral sequences reconstructed under substitution models with similar exchangeability matrices were similar, suggesting that if the selected best-fitting model cannot be used for the reconstruction, applying a model similar to the selected one is preferred. We conclude that selecting among substitution models of protein evolution is recommended for reconstructing accurate ancestral sequences. Oxford University Press 2022-07-05 /pmc/articles/PMC9254009/ /pubmed/35789388 http://dx.doi.org/10.1093/molbev/msac144 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods
Del Amparo, Roberto
Arenas, Miguel
Consequences of Substitution Model Selection on Protein Ancestral Sequence Reconstruction
title Consequences of Substitution Model Selection on Protein Ancestral Sequence Reconstruction
title_full Consequences of Substitution Model Selection on Protein Ancestral Sequence Reconstruction
title_fullStr Consequences of Substitution Model Selection on Protein Ancestral Sequence Reconstruction
title_full_unstemmed Consequences of Substitution Model Selection on Protein Ancestral Sequence Reconstruction
title_short Consequences of Substitution Model Selection on Protein Ancestral Sequence Reconstruction
title_sort consequences of substitution model selection on protein ancestral sequence reconstruction
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9254009/
https://www.ncbi.nlm.nih.gov/pubmed/35789388
http://dx.doi.org/10.1093/molbev/msac144
work_keys_str_mv AT delamparoroberto consequencesofsubstitutionmodelselectiononproteinancestralsequencereconstruction
AT arenasmiguel consequencesofsubstitutionmodelselectiononproteinancestralsequencereconstruction