Cargando…

Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions

Coevolution at the gene level, as reflected by correlated events of gene loss or gain, can be revealed by phylogenetic profile analysis. The optimal method and metric for comparing phylogenetic profiles, especially in eukaryotic genomes, are not yet established. Here, we describe a procedure suitabl...

Descripción completa

Detalles Bibliográficos
Autores principales: Dembech, Elena, Malatesta, Marco, De Rito, Carlo, Mori, Giulia, Cavazzini, Davide, Secchi, Andrea, Morandin, Francesco, Percudani, Riccardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10120013/
https://www.ncbi.nlm.nih.gov/pubmed/37043529
http://dx.doi.org/10.1073/pnas.2218329120
_version_ 1785029107049824256
author Dembech, Elena
Malatesta, Marco
De Rito, Carlo
Mori, Giulia
Cavazzini, Davide
Secchi, Andrea
Morandin, Francesco
Percudani, Riccardo
author_facet Dembech, Elena
Malatesta, Marco
De Rito, Carlo
Mori, Giulia
Cavazzini, Davide
Secchi, Andrea
Morandin, Francesco
Percudani, Riccardo
author_sort Dembech, Elena
collection PubMed
description Coevolution at the gene level, as reflected by correlated events of gene loss or gain, can be revealed by phylogenetic profile analysis. The optimal method and metric for comparing phylogenetic profiles, especially in eukaryotic genomes, are not yet established. Here, we describe a procedure suitable for large-scale analysis, which can reveal coevolution based on the assessment of the statistical significance of correlated presence/absence transitions between gene pairs. This metric can identify coevolution in profiles with low overall similarities and is not affected by similarities lacking coevolutionary information. We applied the procedure to a large collection of 60,912 orthologous gene groups (orthogroups) in 1,264 eukaryotic genomes extracted from OrthoDB. We found significant cotransition scores for 7,825 orthogroups associated in 2,401 coevolving modules linking known and unknown genes in protein complexes and biological pathways. To demonstrate the ability of the method to predict hidden gene associations, we validated through experiments the involvement of vertebrate malate synthase-like genes in the conversion of (S)-ureidoglycolate into glyoxylate and urea, the last step of purine catabolism. This identification explains the presence of glyoxylate cycle genes in metazoa and suggests an anaplerotic role of purine degradation in early eukaryotes.
format Online
Article
Text
id pubmed-10120013
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-101200132023-10-12 Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions Dembech, Elena Malatesta, Marco De Rito, Carlo Mori, Giulia Cavazzini, Davide Secchi, Andrea Morandin, Francesco Percudani, Riccardo Proc Natl Acad Sci U S A Biological Sciences Coevolution at the gene level, as reflected by correlated events of gene loss or gain, can be revealed by phylogenetic profile analysis. The optimal method and metric for comparing phylogenetic profiles, especially in eukaryotic genomes, are not yet established. Here, we describe a procedure suitable for large-scale analysis, which can reveal coevolution based on the assessment of the statistical significance of correlated presence/absence transitions between gene pairs. This metric can identify coevolution in profiles with low overall similarities and is not affected by similarities lacking coevolutionary information. We applied the procedure to a large collection of 60,912 orthologous gene groups (orthogroups) in 1,264 eukaryotic genomes extracted from OrthoDB. We found significant cotransition scores for 7,825 orthogroups associated in 2,401 coevolving modules linking known and unknown genes in protein complexes and biological pathways. To demonstrate the ability of the method to predict hidden gene associations, we validated through experiments the involvement of vertebrate malate synthase-like genes in the conversion of (S)-ureidoglycolate into glyoxylate and urea, the last step of purine catabolism. This identification explains the presence of glyoxylate cycle genes in metazoa and suggests an anaplerotic role of purine degradation in early eukaryotes. National Academy of Sciences 2023-04-12 2023-04-18 /pmc/articles/PMC10120013/ /pubmed/37043529 http://dx.doi.org/10.1073/pnas.2218329120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Dembech, Elena
Malatesta, Marco
De Rito, Carlo
Mori, Giulia
Cavazzini, Davide
Secchi, Andrea
Morandin, Francesco
Percudani, Riccardo
Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions
title Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions
title_full Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions
title_fullStr Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions
title_full_unstemmed Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions
title_short Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions
title_sort identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10120013/
https://www.ncbi.nlm.nih.gov/pubmed/37043529
http://dx.doi.org/10.1073/pnas.2218329120
work_keys_str_mv AT dembechelena identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions
AT malatestamarco identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions
AT deritocarlo identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions
AT morigiulia identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions
AT cavazzinidavide identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions
AT secchiandrea identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions
AT morandinfrancesco identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions
AT percudaniriccardo identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions