Cargando…

Gene Family Level Comparative Analysis of Gene Expression in Mammals Validates the Ortholog Conjecture

The ortholog conjecture (OC), which is central to functional annotation of genomes, posits that orthologous genes are functionally more similar than paralogous genes at the same level of sequence divergence. However, a recent study challenged the OC by reporting a greater functional similarity, in t...

Descripción completa

Detalles Bibliográficos
Autores principales: Rogozin, Igor B., Managadze, David, Shabalina, Svetlana A., Koonin, Eugene V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4007545/
https://www.ncbi.nlm.nih.gov/pubmed/24610837
http://dx.doi.org/10.1093/gbe/evu051
_version_ 1782314346269900800
author Rogozin, Igor B.
Managadze, David
Shabalina, Svetlana A.
Koonin, Eugene V.
author_facet Rogozin, Igor B.
Managadze, David
Shabalina, Svetlana A.
Koonin, Eugene V.
author_sort Rogozin, Igor B.
collection PubMed
description The ortholog conjecture (OC), which is central to functional annotation of genomes, posits that orthologous genes are functionally more similar than paralogous genes at the same level of sequence divergence. However, a recent study challenged the OC by reporting a greater functional similarity, in terms of Gene Ontology (GO) annotations and expression profiles, among within-species paralogs compared with orthologs. These findings were taken to indicate that functional similarity of homologous genes is primarily determined by the cellular context of the genes, rather than evolutionary history. However, several subsequent studies suggest that GO annotations and microarray data could artificially inflate functional similarity between paralogs from the same organism. We sought to test the OC using approaches distinct from those used in previous studies. Analysis of a large RNAseq data set from multiple human and mouse tissues shows that expression similarity (correlations coefficients, rank‘s, or Z-scores) between orthologs is substantially greater than that for between-species paralogs with the same sequence divergence, in agreement with the OC and the results of recent detailed analyses. These findings are further corroborated by a fine-grain analysis in which expression profiles of orthologs and paralogs were compared separately for individual gene families. Expression profiles of within-species paralogs are more strongly correlated than profiles of orthologs but it is shown that this is caused by high background noise, that is, correlation between profiles of unrelated genes in the same organism. Z-scores and rank scores show a nonmonotonic dependence of expression profile similarity on sequence divergence. This complexity of gene expression evolution after duplication might be at least partially caused by selection for protein dosage rebalancing following gene duplication.
format Online
Article
Text
id pubmed-4007545
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40075452015-03-07 Gene Family Level Comparative Analysis of Gene Expression in Mammals Validates the Ortholog Conjecture Rogozin, Igor B. Managadze, David Shabalina, Svetlana A. Koonin, Eugene V. Genome Biol Evol Research Article The ortholog conjecture (OC), which is central to functional annotation of genomes, posits that orthologous genes are functionally more similar than paralogous genes at the same level of sequence divergence. However, a recent study challenged the OC by reporting a greater functional similarity, in terms of Gene Ontology (GO) annotations and expression profiles, among within-species paralogs compared with orthologs. These findings were taken to indicate that functional similarity of homologous genes is primarily determined by the cellular context of the genes, rather than evolutionary history. However, several subsequent studies suggest that GO annotations and microarray data could artificially inflate functional similarity between paralogs from the same organism. We sought to test the OC using approaches distinct from those used in previous studies. Analysis of a large RNAseq data set from multiple human and mouse tissues shows that expression similarity (correlations coefficients, rank‘s, or Z-scores) between orthologs is substantially greater than that for between-species paralogs with the same sequence divergence, in agreement with the OC and the results of recent detailed analyses. These findings are further corroborated by a fine-grain analysis in which expression profiles of orthologs and paralogs were compared separately for individual gene families. Expression profiles of within-species paralogs are more strongly correlated than profiles of orthologs but it is shown that this is caused by high background noise, that is, correlation between profiles of unrelated genes in the same organism. Z-scores and rank scores show a nonmonotonic dependence of expression profile similarity on sequence divergence. This complexity of gene expression evolution after duplication might be at least partially caused by selection for protein dosage rebalancing following gene duplication. Oxford University Press 2014-03-07 /pmc/articles/PMC4007545/ /pubmed/24610837 http://dx.doi.org/10.1093/gbe/evu051 Text en Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2014. This work is written by US Government employees and is in the public domain in the US.
spellingShingle Research Article
Rogozin, Igor B.
Managadze, David
Shabalina, Svetlana A.
Koonin, Eugene V.
Gene Family Level Comparative Analysis of Gene Expression in Mammals Validates the Ortholog Conjecture
title Gene Family Level Comparative Analysis of Gene Expression in Mammals Validates the Ortholog Conjecture
title_full Gene Family Level Comparative Analysis of Gene Expression in Mammals Validates the Ortholog Conjecture
title_fullStr Gene Family Level Comparative Analysis of Gene Expression in Mammals Validates the Ortholog Conjecture
title_full_unstemmed Gene Family Level Comparative Analysis of Gene Expression in Mammals Validates the Ortholog Conjecture
title_short Gene Family Level Comparative Analysis of Gene Expression in Mammals Validates the Ortholog Conjecture
title_sort gene family level comparative analysis of gene expression in mammals validates the ortholog conjecture
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4007545/
https://www.ncbi.nlm.nih.gov/pubmed/24610837
http://dx.doi.org/10.1093/gbe/evu051
work_keys_str_mv AT rogozinigorb genefamilylevelcomparativeanalysisofgeneexpressioninmammalsvalidatestheorthologconjecture
AT managadzedavid genefamilylevelcomparativeanalysisofgeneexpressioninmammalsvalidatestheorthologconjecture
AT shabalinasvetlanaa genefamilylevelcomparativeanalysisofgeneexpressioninmammalsvalidatestheorthologconjecture
AT koonineugenev genefamilylevelcomparativeanalysisofgeneexpressioninmammalsvalidatestheorthologconjecture