Cargando…

BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection

Inferring the selective forces that orthologous genes underwent across different lineages can help us understand the evolutionary processes that have shaped their extant diversity and the phenotypes they underlie. The most widespread metric to estimate the selection regimes of coding genes—across si...

Descripción completa

Detalles Bibliográficos
Autores principales: Forni, Giobbe, Ruggieri, Angelo Alberto, Piccinini, Giovanni, Luchetti, Andrea
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495783/
https://www.ncbi.nlm.nih.gov/pubmed/34646450
http://dx.doi.org/10.1002/ece3.7959
_version_ 1784579617183498240
author Forni, Giobbe
Ruggieri, Angelo Alberto
Piccinini, Giovanni
Luchetti, Andrea
author_facet Forni, Giobbe
Ruggieri, Angelo Alberto
Piccinini, Giovanni
Luchetti, Andrea
author_sort Forni, Giobbe
collection PubMed
description Inferring the selective forces that orthologous genes underwent across different lineages can help us understand the evolutionary processes that have shaped their extant diversity and the phenotypes they underlie. The most widespread metric to estimate the selection regimes of coding genes—across sites and phylogenies—is the ratio of nonsynonymous to synonymous substitutions (dN/dS, also known as ω). Nowadays, modern sequencing technologies and the large amount of already available sequence data allow the retrieval of thousands of orthologous genes across large numbers of species. Nonetheless, the tools available to explore selection regimes are not designed to automatically process all genes, and their practical usage is often restricted to the single‐copy ones which are found across all species considered (i.e., ubiquitous genes). This approach limits the scale of the analysis to a fraction of single‐copy genes, which can be as low as an order of magnitude in respect to those which are not consistently found in all species considered (i.e., nonubiquitous genes). Here, we present a workflow named BASE that—leveraging the CodeML framework—eases the inference and interpretation of gene selection regimes in the context of comparative genomics. Although a number of bioinformatics tools have already been developed to facilitate this kind of analyses, BASE is the first to be specifically designed to allow the integration of nonubiquitous genes in a straightforward and reproducible manner. The workflow—along with all relevant documentation—is available at github.com/for‐giobbe/BASE.
format Online
Article
Text
id pubmed-8495783
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-84957832021-10-12 BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection Forni, Giobbe Ruggieri, Angelo Alberto Piccinini, Giovanni Luchetti, Andrea Ecol Evol Original Research Inferring the selective forces that orthologous genes underwent across different lineages can help us understand the evolutionary processes that have shaped their extant diversity and the phenotypes they underlie. The most widespread metric to estimate the selection regimes of coding genes—across sites and phylogenies—is the ratio of nonsynonymous to synonymous substitutions (dN/dS, also known as ω). Nowadays, modern sequencing technologies and the large amount of already available sequence data allow the retrieval of thousands of orthologous genes across large numbers of species. Nonetheless, the tools available to explore selection regimes are not designed to automatically process all genes, and their practical usage is often restricted to the single‐copy ones which are found across all species considered (i.e., ubiquitous genes). This approach limits the scale of the analysis to a fraction of single‐copy genes, which can be as low as an order of magnitude in respect to those which are not consistently found in all species considered (i.e., nonubiquitous genes). Here, we present a workflow named BASE that—leveraging the CodeML framework—eases the inference and interpretation of gene selection regimes in the context of comparative genomics. Although a number of bioinformatics tools have already been developed to facilitate this kind of analyses, BASE is the first to be specifically designed to allow the integration of nonubiquitous genes in a straightforward and reproducible manner. The workflow—along with all relevant documentation—is available at github.com/for‐giobbe/BASE. John Wiley and Sons Inc. 2021-09-03 /pmc/articles/PMC8495783/ /pubmed/34646450 http://dx.doi.org/10.1002/ece3.7959 Text en © 2021 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Forni, Giobbe
Ruggieri, Angelo Alberto
Piccinini, Giovanni
Luchetti, Andrea
BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection
title BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection
title_full BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection
title_fullStr BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection
title_full_unstemmed BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection
title_short BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection
title_sort base: a novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495783/
https://www.ncbi.nlm.nih.gov/pubmed/34646450
http://dx.doi.org/10.1002/ece3.7959
work_keys_str_mv AT fornigiobbe baseanovelworkflowtointegratenonubiquitousgenesincomparativegenomicsanalysesforselection
AT ruggieriangeloalberto baseanovelworkflowtointegratenonubiquitousgenesincomparativegenomicsanalysesforselection
AT piccininigiovanni baseanovelworkflowtointegratenonubiquitousgenesincomparativegenomicsanalysesforselection
AT luchettiandrea baseanovelworkflowtointegratenonubiquitousgenesincomparativegenomicsanalysesforselection