Cargando…
BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection
Inferring the selective forces that orthologous genes underwent across different lineages can help us understand the evolutionary processes that have shaped their extant diversity and the phenotypes they underlie. The most widespread metric to estimate the selection regimes of coding genes—across si...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495783/ https://www.ncbi.nlm.nih.gov/pubmed/34646450 http://dx.doi.org/10.1002/ece3.7959 |
_version_ | 1784579617183498240 |
---|---|
author | Forni, Giobbe Ruggieri, Angelo Alberto Piccinini, Giovanni Luchetti, Andrea |
author_facet | Forni, Giobbe Ruggieri, Angelo Alberto Piccinini, Giovanni Luchetti, Andrea |
author_sort | Forni, Giobbe |
collection | PubMed |
description | Inferring the selective forces that orthologous genes underwent across different lineages can help us understand the evolutionary processes that have shaped their extant diversity and the phenotypes they underlie. The most widespread metric to estimate the selection regimes of coding genes—across sites and phylogenies—is the ratio of nonsynonymous to synonymous substitutions (dN/dS, also known as ω). Nowadays, modern sequencing technologies and the large amount of already available sequence data allow the retrieval of thousands of orthologous genes across large numbers of species. Nonetheless, the tools available to explore selection regimes are not designed to automatically process all genes, and their practical usage is often restricted to the single‐copy ones which are found across all species considered (i.e., ubiquitous genes). This approach limits the scale of the analysis to a fraction of single‐copy genes, which can be as low as an order of magnitude in respect to those which are not consistently found in all species considered (i.e., nonubiquitous genes). Here, we present a workflow named BASE that—leveraging the CodeML framework—eases the inference and interpretation of gene selection regimes in the context of comparative genomics. Although a number of bioinformatics tools have already been developed to facilitate this kind of analyses, BASE is the first to be specifically designed to allow the integration of nonubiquitous genes in a straightforward and reproducible manner. The workflow—along with all relevant documentation—is available at github.com/for‐giobbe/BASE. |
format | Online Article Text |
id | pubmed-8495783 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-84957832021-10-12 BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection Forni, Giobbe Ruggieri, Angelo Alberto Piccinini, Giovanni Luchetti, Andrea Ecol Evol Original Research Inferring the selective forces that orthologous genes underwent across different lineages can help us understand the evolutionary processes that have shaped their extant diversity and the phenotypes they underlie. The most widespread metric to estimate the selection regimes of coding genes—across sites and phylogenies—is the ratio of nonsynonymous to synonymous substitutions (dN/dS, also known as ω). Nowadays, modern sequencing technologies and the large amount of already available sequence data allow the retrieval of thousands of orthologous genes across large numbers of species. Nonetheless, the tools available to explore selection regimes are not designed to automatically process all genes, and their practical usage is often restricted to the single‐copy ones which are found across all species considered (i.e., ubiquitous genes). This approach limits the scale of the analysis to a fraction of single‐copy genes, which can be as low as an order of magnitude in respect to those which are not consistently found in all species considered (i.e., nonubiquitous genes). Here, we present a workflow named BASE that—leveraging the CodeML framework—eases the inference and interpretation of gene selection regimes in the context of comparative genomics. Although a number of bioinformatics tools have already been developed to facilitate this kind of analyses, BASE is the first to be specifically designed to allow the integration of nonubiquitous genes in a straightforward and reproducible manner. The workflow—along with all relevant documentation—is available at github.com/for‐giobbe/BASE. John Wiley and Sons Inc. 2021-09-03 /pmc/articles/PMC8495783/ /pubmed/34646450 http://dx.doi.org/10.1002/ece3.7959 Text en © 2021 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Research Forni, Giobbe Ruggieri, Angelo Alberto Piccinini, Giovanni Luchetti, Andrea BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection |
title | BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection |
title_full | BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection |
title_fullStr | BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection |
title_full_unstemmed | BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection |
title_short | BASE: A novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection |
title_sort | base: a novel workflow to integrate nonubiquitous genes in comparative genomics analyses for selection |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495783/ https://www.ncbi.nlm.nih.gov/pubmed/34646450 http://dx.doi.org/10.1002/ece3.7959 |
work_keys_str_mv | AT fornigiobbe baseanovelworkflowtointegratenonubiquitousgenesincomparativegenomicsanalysesforselection AT ruggieriangeloalberto baseanovelworkflowtointegratenonubiquitousgenesincomparativegenomicsanalysesforselection AT piccininigiovanni baseanovelworkflowtointegratenonubiquitousgenesincomparativegenomicsanalysesforselection AT luchettiandrea baseanovelworkflowtointegratenonubiquitousgenesincomparativegenomicsanalysesforselection |