Cargando…
Ancestral state reconstruction of metabolic pathways across pangenome ensembles
As genome sequencing efforts are unveiling the genetic diversity of the biosphere with an unprecedented speed, there is a need to accurately describe the structural and functional properties of groups of extant species whose genomes have been sequenced, as well as their inferred ancestors, at any gi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Microbiology Society
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725326/ https://www.ncbi.nlm.nih.gov/pubmed/32924924 http://dx.doi.org/10.1099/mgen.0.000429 |
_version_ | 1783620682842636288 |
---|---|
author | Psomopoulos, Fotis E. van Helden, Jacques Médigue, Claudine Chasapi, Anastasia Ouzounis, Christos A. |
author_facet | Psomopoulos, Fotis E. van Helden, Jacques Médigue, Claudine Chasapi, Anastasia Ouzounis, Christos A. |
author_sort | Psomopoulos, Fotis E. |
collection | PubMed |
description | As genome sequencing efforts are unveiling the genetic diversity of the biosphere with an unprecedented speed, there is a need to accurately describe the structural and functional properties of groups of extant species whose genomes have been sequenced, as well as their inferred ancestors, at any given taxonomic level of their phylogeny. Elaborate approaches for the reconstruction of ancestral states at the sequence level have been developed, subsequently augmented by methods based on gene content. While these approaches of sequence or gene-content reconstruction have been successfully deployed, there has been less progress on the explicit inference of functional properties of ancestral genomes, in terms of metabolic pathways and other cellular processes. Herein, we describe PathTrace, an efficient algorithm for parsimony-based reconstructions of the evolutionary history of individual metabolic pathways, pivotal representations of key functional modules of cellular function. The algorithm is implemented as a five-step process through which pathways are represented as fuzzy vectors, where each enzyme is associated with a taxonomic conservation value derived from the phylogenetic profile of its protein sequence. The method is evaluated with a selected benchmark set of pathways against collections of genome sequences from key data resources. By deploying a pangenome-driven approach for pathway sets, we demonstrate that the inferred patterns are largely insensitive to noise, as opposed to gene-content reconstruction methods. In addition, the resulting reconstructions are closely correlated with the evolutionary distance of the taxa under study, suggesting that a diligent selection of target pangenomes is essential for maintaining cohesiveness of the method and consistency of the inference, serving as an internal control for an arbitrary selection of queries. The PathTrace method is a first step towards the large-scale analysis of metabolic pathway evolution and our deeper understanding of functional relationships reflected in emerging pangenome collections. |
format | Online Article Text |
id | pubmed-7725326 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Microbiology Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-77253262020-12-14 Ancestral state reconstruction of metabolic pathways across pangenome ensembles Psomopoulos, Fotis E. van Helden, Jacques Médigue, Claudine Chasapi, Anastasia Ouzounis, Christos A. Microb Genom Research Article As genome sequencing efforts are unveiling the genetic diversity of the biosphere with an unprecedented speed, there is a need to accurately describe the structural and functional properties of groups of extant species whose genomes have been sequenced, as well as their inferred ancestors, at any given taxonomic level of their phylogeny. Elaborate approaches for the reconstruction of ancestral states at the sequence level have been developed, subsequently augmented by methods based on gene content. While these approaches of sequence or gene-content reconstruction have been successfully deployed, there has been less progress on the explicit inference of functional properties of ancestral genomes, in terms of metabolic pathways and other cellular processes. Herein, we describe PathTrace, an efficient algorithm for parsimony-based reconstructions of the evolutionary history of individual metabolic pathways, pivotal representations of key functional modules of cellular function. The algorithm is implemented as a five-step process through which pathways are represented as fuzzy vectors, where each enzyme is associated with a taxonomic conservation value derived from the phylogenetic profile of its protein sequence. The method is evaluated with a selected benchmark set of pathways against collections of genome sequences from key data resources. By deploying a pangenome-driven approach for pathway sets, we demonstrate that the inferred patterns are largely insensitive to noise, as opposed to gene-content reconstruction methods. In addition, the resulting reconstructions are closely correlated with the evolutionary distance of the taxa under study, suggesting that a diligent selection of target pangenomes is essential for maintaining cohesiveness of the method and consistency of the inference, serving as an internal control for an arbitrary selection of queries. The PathTrace method is a first step towards the large-scale analysis of metabolic pathway evolution and our deeper understanding of functional relationships reflected in emerging pangenome collections. Microbiology Society 2020-09-14 /pmc/articles/PMC7725326/ /pubmed/32924924 http://dx.doi.org/10.1099/mgen.0.000429 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License. |
spellingShingle | Research Article Psomopoulos, Fotis E. van Helden, Jacques Médigue, Claudine Chasapi, Anastasia Ouzounis, Christos A. Ancestral state reconstruction of metabolic pathways across pangenome ensembles |
title | Ancestral state reconstruction of metabolic pathways across pangenome ensembles |
title_full | Ancestral state reconstruction of metabolic pathways across pangenome ensembles |
title_fullStr | Ancestral state reconstruction of metabolic pathways across pangenome ensembles |
title_full_unstemmed | Ancestral state reconstruction of metabolic pathways across pangenome ensembles |
title_short | Ancestral state reconstruction of metabolic pathways across pangenome ensembles |
title_sort | ancestral state reconstruction of metabolic pathways across pangenome ensembles |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7725326/ https://www.ncbi.nlm.nih.gov/pubmed/32924924 http://dx.doi.org/10.1099/mgen.0.000429 |
work_keys_str_mv | AT psomopoulosfotise ancestralstatereconstructionofmetabolicpathwaysacrosspangenomeensembles AT vanheldenjacques ancestralstatereconstructionofmetabolicpathwaysacrosspangenomeensembles AT medigueclaudine ancestralstatereconstructionofmetabolicpathwaysacrosspangenomeensembles AT chasapianastasia ancestralstatereconstructionofmetabolicpathwaysacrosspangenomeensembles AT ouzounischristosa ancestralstatereconstructionofmetabolicpathwaysacrosspangenomeensembles |