Cargando…

Genome-Wide Survey for Biologically Functional Pseudogenes

According to current estimates there exist about 20,000 pseudogenes in a mammalian genome. The vast majority of these are disabled and nonfunctional copies of protein-coding genes which, therefore, evolve neutrally. Recent findings that a Makorin1 pseudogene, residing on mouse Chromosome 5, is, inde...

Descripción completa

Detalles Bibliográficos
Autores principales: Svensson, Örjan, Arvestad, Lars, Lagergren, Jens
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1456316/
https://www.ncbi.nlm.nih.gov/pubmed/16680195
http://dx.doi.org/10.1371/journal.pcbi.0020046
_version_ 1782127403823267840
author Svensson, Örjan
Arvestad, Lars
Lagergren, Jens
author_facet Svensson, Örjan
Arvestad, Lars
Lagergren, Jens
author_sort Svensson, Örjan
collection PubMed
description According to current estimates there exist about 20,000 pseudogenes in a mammalian genome. The vast majority of these are disabled and nonfunctional copies of protein-coding genes which, therefore, evolve neutrally. Recent findings that a Makorin1 pseudogene, residing on mouse Chromosome 5, is, indeed, in vivo vital and also evolutionarily preserved, encouraged us to conduct a genome-wide survey for other functional pseudogenes in human, mouse, and chimpanzee. We identify to our knowledge the first examples of conserved pseudogenes common to human and mouse, originating from one duplication predating the human–mouse species split and having evolved as pseudogenes since the species split. Functionality is one possible way to explain the apparently contradictory properties of such pseudogene pairs, i.e., high conservation and ancient origin. The hypothesis of functionality is tested by comparing expression evidence and synteny of the candidates with proper test sets. The tests suggest potential biological function. Our candidate set includes a small set of long-lived pseudogenes whose unknown potential function is retained since before the human–mouse species split, and also a larger group of primate-specific ones found from human–chimpanzee searches. Two processed sequences are notable, their conservation since the human–mouse split being as high as most protein-coding genes; one is derived from the protein Ataxin 7-like 3 (ATX7NL3), and one from the Spinocerebellar ataxia type 1 protein (ATX1). Our approach is comparative and can be applied to any pair of species. It is implemented by a semi-automated pipeline based on cross-species BLAST comparisons and maximum-likelihood phylogeny estimations. To separate pseudogenes from protein-coding genes, we use standard methods, utilizing in-frame disablements, as well as a probabilistic filter based on Ka/Ks ratios.
format Text
id pubmed-1456316
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-14563162006-05-26 Genome-Wide Survey for Biologically Functional Pseudogenes Svensson, Örjan Arvestad, Lars Lagergren, Jens PLoS Comput Biol Research Article According to current estimates there exist about 20,000 pseudogenes in a mammalian genome. The vast majority of these are disabled and nonfunctional copies of protein-coding genes which, therefore, evolve neutrally. Recent findings that a Makorin1 pseudogene, residing on mouse Chromosome 5, is, indeed, in vivo vital and also evolutionarily preserved, encouraged us to conduct a genome-wide survey for other functional pseudogenes in human, mouse, and chimpanzee. We identify to our knowledge the first examples of conserved pseudogenes common to human and mouse, originating from one duplication predating the human–mouse species split and having evolved as pseudogenes since the species split. Functionality is one possible way to explain the apparently contradictory properties of such pseudogene pairs, i.e., high conservation and ancient origin. The hypothesis of functionality is tested by comparing expression evidence and synteny of the candidates with proper test sets. The tests suggest potential biological function. Our candidate set includes a small set of long-lived pseudogenes whose unknown potential function is retained since before the human–mouse species split, and also a larger group of primate-specific ones found from human–chimpanzee searches. Two processed sequences are notable, their conservation since the human–mouse split being as high as most protein-coding genes; one is derived from the protein Ataxin 7-like 3 (ATX7NL3), and one from the Spinocerebellar ataxia type 1 protein (ATX1). Our approach is comparative and can be applied to any pair of species. It is implemented by a semi-automated pipeline based on cross-species BLAST comparisons and maximum-likelihood phylogeny estimations. To separate pseudogenes from protein-coding genes, we use standard methods, utilizing in-frame disablements, as well as a probabilistic filter based on Ka/Ks ratios. Public Library of Science 2006-05 2006-05-05 /pmc/articles/PMC1456316/ /pubmed/16680195 http://dx.doi.org/10.1371/journal.pcbi.0020046 Text en © 2006 Svensson et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Svensson, Örjan
Arvestad, Lars
Lagergren, Jens
Genome-Wide Survey for Biologically Functional Pseudogenes
title Genome-Wide Survey for Biologically Functional Pseudogenes
title_full Genome-Wide Survey for Biologically Functional Pseudogenes
title_fullStr Genome-Wide Survey for Biologically Functional Pseudogenes
title_full_unstemmed Genome-Wide Survey for Biologically Functional Pseudogenes
title_short Genome-Wide Survey for Biologically Functional Pseudogenes
title_sort genome-wide survey for biologically functional pseudogenes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1456316/
https://www.ncbi.nlm.nih.gov/pubmed/16680195
http://dx.doi.org/10.1371/journal.pcbi.0020046
work_keys_str_mv AT svenssonorjan genomewidesurveyforbiologicallyfunctionalpseudogenes
AT arvestadlars genomewidesurveyforbiologicallyfunctionalpseudogenes
AT lagergrenjens genomewidesurveyforbiologicallyfunctionalpseudogenes