Cargando…

ProteinEvolverABC: coestimation of recombination and substitution rates in protein sequences by approximate Bayesian computation

MOTIVATION: The evolutionary processes of mutation and recombination, upon which selection operates, are fundamental to understand the observed molecular diversity. Unlike nucleotide sequences, the estimation of the recombination rate in protein sequences has been little explored, neither implemente...

Descripción completa

Detalles Bibliográficos
Autor principal: Arenas, Miguel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8696103/
https://www.ncbi.nlm.nih.gov/pubmed/34450622
http://dx.doi.org/10.1093/bioinformatics/btab617
_version_ 1784619732214743040
author Arenas, Miguel
author_facet Arenas, Miguel
author_sort Arenas, Miguel
collection PubMed
description MOTIVATION: The evolutionary processes of mutation and recombination, upon which selection operates, are fundamental to understand the observed molecular diversity. Unlike nucleotide sequences, the estimation of the recombination rate in protein sequences has been little explored, neither implemented in evolutionary frameworks, despite protein sequencing methods are largely used. RESULTS: In order to accommodate this need, here I present a computational framework, called ProteinEvolverABC, to jointly estimate recombination and substitution rates from alignments of protein sequences. The framework implements the approximate Bayesian computation approach, with and without regression adjustments and includes a variety of substitution models of protein evolution, demographics and longitudinal sampling. It also implements several nuisance parameters such as heterogeneous amino acid frequencies and rate of change among sites and, proportion of invariable sites. The framework produces accurate coestimation of recombination and substitution rates under diverse evolutionary scenarios. As illustrative examples of usage, I applied it to several viral protein families, including coronaviruses, showing heterogeneous substitution and recombination rates. AVAILABILITY AND IMPLEMENTATION: ProteinEvolverABC is freely available from https://github.com/miguelarenas/proteinevolverabc, includes a graphical user interface for helping the specification of the input settings, extensive documentation and ready-to-use examples. Conveniently, the simulations can run in parallel on multicore machines. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8696103
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86961032022-01-04 ProteinEvolverABC: coestimation of recombination and substitution rates in protein sequences by approximate Bayesian computation Arenas, Miguel Bioinformatics Original Papers MOTIVATION: The evolutionary processes of mutation and recombination, upon which selection operates, are fundamental to understand the observed molecular diversity. Unlike nucleotide sequences, the estimation of the recombination rate in protein sequences has been little explored, neither implemented in evolutionary frameworks, despite protein sequencing methods are largely used. RESULTS: In order to accommodate this need, here I present a computational framework, called ProteinEvolverABC, to jointly estimate recombination and substitution rates from alignments of protein sequences. The framework implements the approximate Bayesian computation approach, with and without regression adjustments and includes a variety of substitution models of protein evolution, demographics and longitudinal sampling. It also implements several nuisance parameters such as heterogeneous amino acid frequencies and rate of change among sites and, proportion of invariable sites. The framework produces accurate coestimation of recombination and substitution rates under diverse evolutionary scenarios. As illustrative examples of usage, I applied it to several viral protein families, including coronaviruses, showing heterogeneous substitution and recombination rates. AVAILABILITY AND IMPLEMENTATION: ProteinEvolverABC is freely available from https://github.com/miguelarenas/proteinevolverabc, includes a graphical user interface for helping the specification of the input settings, extensive documentation and ready-to-use examples. Conveniently, the simulations can run in parallel on multicore machines. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-08-27 /pmc/articles/PMC8696103/ /pubmed/34450622 http://dx.doi.org/10.1093/bioinformatics/btab617 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Arenas, Miguel
ProteinEvolverABC: coestimation of recombination and substitution rates in protein sequences by approximate Bayesian computation
title ProteinEvolverABC: coestimation of recombination and substitution rates in protein sequences by approximate Bayesian computation
title_full ProteinEvolverABC: coestimation of recombination and substitution rates in protein sequences by approximate Bayesian computation
title_fullStr ProteinEvolverABC: coestimation of recombination and substitution rates in protein sequences by approximate Bayesian computation
title_full_unstemmed ProteinEvolverABC: coestimation of recombination and substitution rates in protein sequences by approximate Bayesian computation
title_short ProteinEvolverABC: coestimation of recombination and substitution rates in protein sequences by approximate Bayesian computation
title_sort proteinevolverabc: coestimation of recombination and substitution rates in protein sequences by approximate bayesian computation
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8696103/
https://www.ncbi.nlm.nih.gov/pubmed/34450622
http://dx.doi.org/10.1093/bioinformatics/btab617
work_keys_str_mv AT arenasmiguel proteinevolverabccoestimationofrecombinationandsubstitutionratesinproteinsequencesbyapproximatebayesiancomputation