Cargando…
Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions
Coevolution at the gene level, as reflected by correlated events of gene loss or gain, can be revealed by phylogenetic profile analysis. The optimal method and metric for comparing phylogenetic profiles, especially in eukaryotic genomes, are not yet established. Here, we describe a procedure suitabl...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10120013/ https://www.ncbi.nlm.nih.gov/pubmed/37043529 http://dx.doi.org/10.1073/pnas.2218329120 |
_version_ | 1785029107049824256 |
---|---|
author | Dembech, Elena Malatesta, Marco De Rito, Carlo Mori, Giulia Cavazzini, Davide Secchi, Andrea Morandin, Francesco Percudani, Riccardo |
author_facet | Dembech, Elena Malatesta, Marco De Rito, Carlo Mori, Giulia Cavazzini, Davide Secchi, Andrea Morandin, Francesco Percudani, Riccardo |
author_sort | Dembech, Elena |
collection | PubMed |
description | Coevolution at the gene level, as reflected by correlated events of gene loss or gain, can be revealed by phylogenetic profile analysis. The optimal method and metric for comparing phylogenetic profiles, especially in eukaryotic genomes, are not yet established. Here, we describe a procedure suitable for large-scale analysis, which can reveal coevolution based on the assessment of the statistical significance of correlated presence/absence transitions between gene pairs. This metric can identify coevolution in profiles with low overall similarities and is not affected by similarities lacking coevolutionary information. We applied the procedure to a large collection of 60,912 orthologous gene groups (orthogroups) in 1,264 eukaryotic genomes extracted from OrthoDB. We found significant cotransition scores for 7,825 orthogroups associated in 2,401 coevolving modules linking known and unknown genes in protein complexes and biological pathways. To demonstrate the ability of the method to predict hidden gene associations, we validated through experiments the involvement of vertebrate malate synthase-like genes in the conversion of (S)-ureidoglycolate into glyoxylate and urea, the last step of purine catabolism. This identification explains the presence of glyoxylate cycle genes in metazoa and suggests an anaplerotic role of purine degradation in early eukaryotes. |
format | Online Article Text |
id | pubmed-10120013 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-101200132023-10-12 Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions Dembech, Elena Malatesta, Marco De Rito, Carlo Mori, Giulia Cavazzini, Davide Secchi, Andrea Morandin, Francesco Percudani, Riccardo Proc Natl Acad Sci U S A Biological Sciences Coevolution at the gene level, as reflected by correlated events of gene loss or gain, can be revealed by phylogenetic profile analysis. The optimal method and metric for comparing phylogenetic profiles, especially in eukaryotic genomes, are not yet established. Here, we describe a procedure suitable for large-scale analysis, which can reveal coevolution based on the assessment of the statistical significance of correlated presence/absence transitions between gene pairs. This metric can identify coevolution in profiles with low overall similarities and is not affected by similarities lacking coevolutionary information. We applied the procedure to a large collection of 60,912 orthologous gene groups (orthogroups) in 1,264 eukaryotic genomes extracted from OrthoDB. We found significant cotransition scores for 7,825 orthogroups associated in 2,401 coevolving modules linking known and unknown genes in protein complexes and biological pathways. To demonstrate the ability of the method to predict hidden gene associations, we validated through experiments the involvement of vertebrate malate synthase-like genes in the conversion of (S)-ureidoglycolate into glyoxylate and urea, the last step of purine catabolism. This identification explains the presence of glyoxylate cycle genes in metazoa and suggests an anaplerotic role of purine degradation in early eukaryotes. National Academy of Sciences 2023-04-12 2023-04-18 /pmc/articles/PMC10120013/ /pubmed/37043529 http://dx.doi.org/10.1073/pnas.2218329120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Biological Sciences Dembech, Elena Malatesta, Marco De Rito, Carlo Mori, Giulia Cavazzini, Davide Secchi, Andrea Morandin, Francesco Percudani, Riccardo Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions |
title | Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions |
title_full | Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions |
title_fullStr | Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions |
title_full_unstemmed | Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions |
title_short | Identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions |
title_sort | identification of hidden associations among eukaryotic genes through statistical analysis of coevolutionary transitions |
topic | Biological Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10120013/ https://www.ncbi.nlm.nih.gov/pubmed/37043529 http://dx.doi.org/10.1073/pnas.2218329120 |
work_keys_str_mv | AT dembechelena identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions AT malatestamarco identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions AT deritocarlo identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions AT morigiulia identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions AT cavazzinidavide identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions AT secchiandrea identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions AT morandinfrancesco identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions AT percudaniriccardo identificationofhiddenassociationsamongeukaryoticgenesthroughstatisticalanalysisofcoevolutionarytransitions |