Cargando…
AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis
The detection of adaptive selection in a system approach considering all protein-coding genes allows for the identification of mechanisms and pathways that enabled adaptation to different environments. Currently, available programs for the estimation of positive selection signals can be divided into...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10612477/ https://www.ncbi.nlm.nih.gov/pubmed/37831426 http://dx.doi.org/10.1093/gbe/evad187 |
_version_ | 1785128709507776512 |
---|---|
author | Ceron-Noriega, Alejandro Schoonenberg, Vivien A C Butter, Falk Levin, Michal |
author_facet | Ceron-Noriega, Alejandro Schoonenberg, Vivien A C Butter, Falk Levin, Michal |
author_sort | Ceron-Noriega, Alejandro |
collection | PubMed |
description | The detection of adaptive selection in a system approach considering all protein-coding genes allows for the identification of mechanisms and pathways that enabled adaptation to different environments. Currently, available programs for the estimation of positive selection signals can be divided into two groups. They are either easy to apply but can analyze only one gene family at a time, restricting system analysis; or they can handle larger cohorts of gene families, but require considerable prerequisite data such as orthology associations, codon alignments, phylogenetic trees, and proper configuration files. All these steps require extensive computational expertise, restricting this endeavor to specialists. Here, we introduce AlexandrusPS, a high-throughput pipeline that overcomes technical challenges when conducting transcriptome-wide positive selection analyses on large sets of nucleotide and protein sequences. The pipeline streamlines 1) the execution of an accurate orthology prediction as a precondition for positive selection analysis, 2) preparing and organizing configuration files for CodeML, 3) performing positive selection analysis using CodeML, and 4) generating an output that is easy to interpret, including all maximum likelihood and log-likelihood test results. The only input needed from the user is the CDS and peptide FASTA files of proteins of interest. The pipeline is provided in a Docker image, requiring no program or module installation, enabling the application of the pipeline in any computing environment. AlexandrusPS and its documentation are available via GitHub (https://github.com/alejocn5/AlexandrusPS). |
format | Online Article Text |
id | pubmed-10612477 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106124772023-10-29 AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis Ceron-Noriega, Alejandro Schoonenberg, Vivien A C Butter, Falk Levin, Michal Genome Biol Evol Letter The detection of adaptive selection in a system approach considering all protein-coding genes allows for the identification of mechanisms and pathways that enabled adaptation to different environments. Currently, available programs for the estimation of positive selection signals can be divided into two groups. They are either easy to apply but can analyze only one gene family at a time, restricting system analysis; or they can handle larger cohorts of gene families, but require considerable prerequisite data such as orthology associations, codon alignments, phylogenetic trees, and proper configuration files. All these steps require extensive computational expertise, restricting this endeavor to specialists. Here, we introduce AlexandrusPS, a high-throughput pipeline that overcomes technical challenges when conducting transcriptome-wide positive selection analyses on large sets of nucleotide and protein sequences. The pipeline streamlines 1) the execution of an accurate orthology prediction as a precondition for positive selection analysis, 2) preparing and organizing configuration files for CodeML, 3) performing positive selection analysis using CodeML, and 4) generating an output that is easy to interpret, including all maximum likelihood and log-likelihood test results. The only input needed from the user is the CDS and peptide FASTA files of proteins of interest. The pipeline is provided in a Docker image, requiring no program or module installation, enabling the application of the pipeline in any computing environment. AlexandrusPS and its documentation are available via GitHub (https://github.com/alejocn5/AlexandrusPS). Oxford University Press 2023-10-13 /pmc/articles/PMC10612477/ /pubmed/37831426 http://dx.doi.org/10.1093/gbe/evad187 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Letter Ceron-Noriega, Alejandro Schoonenberg, Vivien A C Butter, Falk Levin, Michal AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis |
title | AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis |
title_full | AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis |
title_fullStr | AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis |
title_full_unstemmed | AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis |
title_short | AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis |
title_sort | alexandrusps: a user-friendly pipeline for the automated detection of orthologous gene clusters and subsequent positive selection analysis |
topic | Letter |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10612477/ https://www.ncbi.nlm.nih.gov/pubmed/37831426 http://dx.doi.org/10.1093/gbe/evad187 |
work_keys_str_mv | AT ceronnoriegaalejandro alexandruspsauserfriendlypipelinefortheautomateddetectionoforthologousgeneclustersandsubsequentpositiveselectionanalysis AT schoonenbergvivienac alexandruspsauserfriendlypipelinefortheautomateddetectionoforthologousgeneclustersandsubsequentpositiveselectionanalysis AT butterfalk alexandruspsauserfriendlypipelinefortheautomateddetectionoforthologousgeneclustersandsubsequentpositiveselectionanalysis AT levinmichal alexandruspsauserfriendlypipelinefortheautomateddetectionoforthologousgeneclustersandsubsequentpositiveselectionanalysis |