Cargando…

Beginner's Guide on the Use of PAML to Detect Positive Selection

The CODEML program in the PAML package has been widely used to analyze protein-coding gene sequences to estimate the synonymous and nonsynonymous rates (d(S) and d(N)) and to detect positive Darwinian selection driving protein evolution. For users not familiar with molecular evolutionary analysis, t...

Descripción completa

Detalles Bibliográficos
Autores principales: Álvarez-Carretero, Sandra, Kapli, Paschalia, Yang, Ziheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127084/
https://www.ncbi.nlm.nih.gov/pubmed/37096789
http://dx.doi.org/10.1093/molbev/msad041
_version_ 1785030388866875392
author Álvarez-Carretero, Sandra
Kapli, Paschalia
Yang, Ziheng
author_facet Álvarez-Carretero, Sandra
Kapli, Paschalia
Yang, Ziheng
author_sort Álvarez-Carretero, Sandra
collection PubMed
description The CODEML program in the PAML package has been widely used to analyze protein-coding gene sequences to estimate the synonymous and nonsynonymous rates (d(S) and d(N)) and to detect positive Darwinian selection driving protein evolution. For users not familiar with molecular evolutionary analysis, the program is known to have a steep learning curve. Here, we provide a step-by-step protocol to illustrate the commonly used tests available in the program, including the branch models, the site models, and the branch-site models, which can be used to detect positive selection driving adaptive protein evolution affecting particular lineages of the species phylogeny, affecting a subset of amino acid residues in the protein, and affecting a subset of sites along prespecified lineages, respectively. A data set of the myxovirus (Mx) genes from ten mammal and two bird species is used as an example. We discuss a new feature in CODEML that allows users to perform positive selection tests for multiple genes for the same set of taxa, as is common in modern genome-sequencing projects. The PAML package is distributed at https://github.com/abacus-gene/paml under the GNU license, with support provided at its discussion site (https://groups.google.com/g/pamlsoftware). Data files used in this protocol are available at https://github.com/abacus-gene/paml-tutorial.
format Online
Article
Text
id pubmed-10127084
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101270842023-04-26 Beginner's Guide on the Use of PAML to Detect Positive Selection Álvarez-Carretero, Sandra Kapli, Paschalia Yang, Ziheng Mol Biol Evol Protocols The CODEML program in the PAML package has been widely used to analyze protein-coding gene sequences to estimate the synonymous and nonsynonymous rates (d(S) and d(N)) and to detect positive Darwinian selection driving protein evolution. For users not familiar with molecular evolutionary analysis, the program is known to have a steep learning curve. Here, we provide a step-by-step protocol to illustrate the commonly used tests available in the program, including the branch models, the site models, and the branch-site models, which can be used to detect positive selection driving adaptive protein evolution affecting particular lineages of the species phylogeny, affecting a subset of amino acid residues in the protein, and affecting a subset of sites along prespecified lineages, respectively. A data set of the myxovirus (Mx) genes from ten mammal and two bird species is used as an example. We discuss a new feature in CODEML that allows users to perform positive selection tests for multiple genes for the same set of taxa, as is common in modern genome-sequencing projects. The PAML package is distributed at https://github.com/abacus-gene/paml under the GNU license, with support provided at its discussion site (https://groups.google.com/g/pamlsoftware). Data files used in this protocol are available at https://github.com/abacus-gene/paml-tutorial. Oxford University Press 2023-04-25 /pmc/articles/PMC10127084/ /pubmed/37096789 http://dx.doi.org/10.1093/molbev/msad041 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Protocols
Álvarez-Carretero, Sandra
Kapli, Paschalia
Yang, Ziheng
Beginner's Guide on the Use of PAML to Detect Positive Selection
title Beginner's Guide on the Use of PAML to Detect Positive Selection
title_full Beginner's Guide on the Use of PAML to Detect Positive Selection
title_fullStr Beginner's Guide on the Use of PAML to Detect Positive Selection
title_full_unstemmed Beginner's Guide on the Use of PAML to Detect Positive Selection
title_short Beginner's Guide on the Use of PAML to Detect Positive Selection
title_sort beginner's guide on the use of paml to detect positive selection
topic Protocols
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127084/
https://www.ncbi.nlm.nih.gov/pubmed/37096789
http://dx.doi.org/10.1093/molbev/msad041
work_keys_str_mv AT alvarezcarreterosandra beginnersguideontheuseofpamltodetectpositiveselection
AT kaplipaschalia beginnersguideontheuseofpamltodetectpositiveselection
AT yangziheng beginnersguideontheuseofpamltodetectpositiveselection