Cargando…

AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis

The detection of adaptive selection in a system approach considering all protein-coding genes allows for the identification of mechanisms and pathways that enabled adaptation to different environments. Currently, available programs for the estimation of positive selection signals can be divided into...

Descripción completa

Detalles Bibliográficos
Autores principales: Ceron-Noriega, Alejandro, Schoonenberg, Vivien A C, Butter, Falk, Levin, Michal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10612477/
https://www.ncbi.nlm.nih.gov/pubmed/37831426
http://dx.doi.org/10.1093/gbe/evad187
_version_ 1785128709507776512
author Ceron-Noriega, Alejandro
Schoonenberg, Vivien A C
Butter, Falk
Levin, Michal
author_facet Ceron-Noriega, Alejandro
Schoonenberg, Vivien A C
Butter, Falk
Levin, Michal
author_sort Ceron-Noriega, Alejandro
collection PubMed
description The detection of adaptive selection in a system approach considering all protein-coding genes allows for the identification of mechanisms and pathways that enabled adaptation to different environments. Currently, available programs for the estimation of positive selection signals can be divided into two groups. They are either easy to apply but can analyze only one gene family at a time, restricting system analysis; or they can handle larger cohorts of gene families, but require considerable prerequisite data such as orthology associations, codon alignments, phylogenetic trees, and proper configuration files. All these steps require extensive computational expertise, restricting this endeavor to specialists. Here, we introduce AlexandrusPS, a high-throughput pipeline that overcomes technical challenges when conducting transcriptome-wide positive selection analyses on large sets of nucleotide and protein sequences. The pipeline streamlines 1) the execution of an accurate orthology prediction as a precondition for positive selection analysis, 2) preparing and organizing configuration files for CodeML, 3) performing positive selection analysis using CodeML, and 4) generating an output that is easy to interpret, including all maximum likelihood and log-likelihood test results. The only input needed from the user is the CDS and peptide FASTA files of proteins of interest. The pipeline is provided in a Docker image, requiring no program or module installation, enabling the application of the pipeline in any computing environment. AlexandrusPS and its documentation are available via GitHub (https://github.com/alejocn5/AlexandrusPS).
format Online
Article
Text
id pubmed-10612477
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106124772023-10-29 AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis Ceron-Noriega, Alejandro Schoonenberg, Vivien A C Butter, Falk Levin, Michal Genome Biol Evol Letter The detection of adaptive selection in a system approach considering all protein-coding genes allows for the identification of mechanisms and pathways that enabled adaptation to different environments. Currently, available programs for the estimation of positive selection signals can be divided into two groups. They are either easy to apply but can analyze only one gene family at a time, restricting system analysis; or they can handle larger cohorts of gene families, but require considerable prerequisite data such as orthology associations, codon alignments, phylogenetic trees, and proper configuration files. All these steps require extensive computational expertise, restricting this endeavor to specialists. Here, we introduce AlexandrusPS, a high-throughput pipeline that overcomes technical challenges when conducting transcriptome-wide positive selection analyses on large sets of nucleotide and protein sequences. The pipeline streamlines 1) the execution of an accurate orthology prediction as a precondition for positive selection analysis, 2) preparing and organizing configuration files for CodeML, 3) performing positive selection analysis using CodeML, and 4) generating an output that is easy to interpret, including all maximum likelihood and log-likelihood test results. The only input needed from the user is the CDS and peptide FASTA files of proteins of interest. The pipeline is provided in a Docker image, requiring no program or module installation, enabling the application of the pipeline in any computing environment. AlexandrusPS and its documentation are available via GitHub (https://github.com/alejocn5/AlexandrusPS). Oxford University Press 2023-10-13 /pmc/articles/PMC10612477/ /pubmed/37831426 http://dx.doi.org/10.1093/gbe/evad187 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Letter
Ceron-Noriega, Alejandro
Schoonenberg, Vivien A C
Butter, Falk
Levin, Michal
AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis
title AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis
title_full AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis
title_fullStr AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis
title_full_unstemmed AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis
title_short AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis
title_sort alexandrusps: a user-friendly pipeline for the automated detection of orthologous gene clusters and subsequent positive selection analysis
topic Letter
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10612477/
https://www.ncbi.nlm.nih.gov/pubmed/37831426
http://dx.doi.org/10.1093/gbe/evad187
work_keys_str_mv AT ceronnoriegaalejandro alexandruspsauserfriendlypipelinefortheautomateddetectionoforthologousgeneclustersandsubsequentpositiveselectionanalysis
AT schoonenbergvivienac alexandruspsauserfriendlypipelinefortheautomateddetectionoforthologousgeneclustersandsubsequentpositiveselectionanalysis
AT butterfalk alexandruspsauserfriendlypipelinefortheautomateddetectionoforthologousgeneclustersandsubsequentpositiveselectionanalysis
AT levinmichal alexandruspsauserfriendlypipelinefortheautomateddetectionoforthologousgeneclustersandsubsequentpositiveselectionanalysis