Cargando…
Beginner's Guide on the Use of PAML to Detect Positive Selection
The CODEML program in the PAML package has been widely used to analyze protein-coding gene sequences to estimate the synonymous and nonsynonymous rates (d(S) and d(N)) and to detect positive Darwinian selection driving protein evolution. For users not familiar with molecular evolutionary analysis, t...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127084/ https://www.ncbi.nlm.nih.gov/pubmed/37096789 http://dx.doi.org/10.1093/molbev/msad041 |
_version_ | 1785030388866875392 |
---|---|
author | Álvarez-Carretero, Sandra Kapli, Paschalia Yang, Ziheng |
author_facet | Álvarez-Carretero, Sandra Kapli, Paschalia Yang, Ziheng |
author_sort | Álvarez-Carretero, Sandra |
collection | PubMed |
description | The CODEML program in the PAML package has been widely used to analyze protein-coding gene sequences to estimate the synonymous and nonsynonymous rates (d(S) and d(N)) and to detect positive Darwinian selection driving protein evolution. For users not familiar with molecular evolutionary analysis, the program is known to have a steep learning curve. Here, we provide a step-by-step protocol to illustrate the commonly used tests available in the program, including the branch models, the site models, and the branch-site models, which can be used to detect positive selection driving adaptive protein evolution affecting particular lineages of the species phylogeny, affecting a subset of amino acid residues in the protein, and affecting a subset of sites along prespecified lineages, respectively. A data set of the myxovirus (Mx) genes from ten mammal and two bird species is used as an example. We discuss a new feature in CODEML that allows users to perform positive selection tests for multiple genes for the same set of taxa, as is common in modern genome-sequencing projects. The PAML package is distributed at https://github.com/abacus-gene/paml under the GNU license, with support provided at its discussion site (https://groups.google.com/g/pamlsoftware). Data files used in this protocol are available at https://github.com/abacus-gene/paml-tutorial. |
format | Online Article Text |
id | pubmed-10127084 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101270842023-04-26 Beginner's Guide on the Use of PAML to Detect Positive Selection Álvarez-Carretero, Sandra Kapli, Paschalia Yang, Ziheng Mol Biol Evol Protocols The CODEML program in the PAML package has been widely used to analyze protein-coding gene sequences to estimate the synonymous and nonsynonymous rates (d(S) and d(N)) and to detect positive Darwinian selection driving protein evolution. For users not familiar with molecular evolutionary analysis, the program is known to have a steep learning curve. Here, we provide a step-by-step protocol to illustrate the commonly used tests available in the program, including the branch models, the site models, and the branch-site models, which can be used to detect positive selection driving adaptive protein evolution affecting particular lineages of the species phylogeny, affecting a subset of amino acid residues in the protein, and affecting a subset of sites along prespecified lineages, respectively. A data set of the myxovirus (Mx) genes from ten mammal and two bird species is used as an example. We discuss a new feature in CODEML that allows users to perform positive selection tests for multiple genes for the same set of taxa, as is common in modern genome-sequencing projects. The PAML package is distributed at https://github.com/abacus-gene/paml under the GNU license, with support provided at its discussion site (https://groups.google.com/g/pamlsoftware). Data files used in this protocol are available at https://github.com/abacus-gene/paml-tutorial. Oxford University Press 2023-04-25 /pmc/articles/PMC10127084/ /pubmed/37096789 http://dx.doi.org/10.1093/molbev/msad041 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Protocols Álvarez-Carretero, Sandra Kapli, Paschalia Yang, Ziheng Beginner's Guide on the Use of PAML to Detect Positive Selection |
title | Beginner's Guide on the Use of PAML to Detect Positive Selection |
title_full | Beginner's Guide on the Use of PAML to Detect Positive Selection |
title_fullStr | Beginner's Guide on the Use of PAML to Detect Positive Selection |
title_full_unstemmed | Beginner's Guide on the Use of PAML to Detect Positive Selection |
title_short | Beginner's Guide on the Use of PAML to Detect Positive Selection |
title_sort | beginner's guide on the use of paml to detect positive selection |
topic | Protocols |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127084/ https://www.ncbi.nlm.nih.gov/pubmed/37096789 http://dx.doi.org/10.1093/molbev/msad041 |
work_keys_str_mv | AT alvarezcarreterosandra beginnersguideontheuseofpamltodetectpositiveselection AT kaplipaschalia beginnersguideontheuseofpamltodetectpositiveselection AT yangziheng beginnersguideontheuseofpamltodetectpositiveselection |