Cargando…

Comparative analysis of protein coding sequences from human, mouse and the domesticated pig

BACKGROUND: The availability of abundant sequence data from key model organisms has made large scale studies of molecular evolution an exciting possibility. Here we use full length cDNA alignments comprising more than 700,000 nucleotides from human, mouse, pig and the Japanese pufferfish Fugu rubric...

Descripción completa

Detalles Bibliográficos
Autores principales: Jørgensen, Frank Grønlund, Hobolth, Asger, Hornshøj, Henrik, Bendixen, Christian, Fredholm, Merete, Schierup, Mikkel Heide
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC549206/
https://www.ncbi.nlm.nih.gov/pubmed/15679890
http://dx.doi.org/10.1186/1741-7007-3-2
_version_ 1782122403768303616
author Jørgensen, Frank Grønlund
Hobolth, Asger
Hornshøj, Henrik
Bendixen, Christian
Fredholm, Merete
Schierup, Mikkel Heide
author_facet Jørgensen, Frank Grønlund
Hobolth, Asger
Hornshøj, Henrik
Bendixen, Christian
Fredholm, Merete
Schierup, Mikkel Heide
author_sort Jørgensen, Frank Grønlund
collection PubMed
description BACKGROUND: The availability of abundant sequence data from key model organisms has made large scale studies of molecular evolution an exciting possibility. Here we use full length cDNA alignments comprising more than 700,000 nucleotides from human, mouse, pig and the Japanese pufferfish Fugu rubrices in order to investigate 1) the relationships between three major lineages of mammals: rodents, artiodactyls and primates, and 2) the rate of evolution and the occurrence of positive Darwinian selection using codon based models of sequence evolution. RESULTS: We provide evidence that the evolutionary splits among primates, rodents and artiodactyls happened shortly after each other, with most gene trees favouring a topology with rodents as outgroup to primates and artiodactyls. Using an unrooted topology of the three mammalian species we show that since their diversification, the pig and mouse lineages have on average experienced 1.44 and 2.86 times as many synonymous substitutions as humans, respectively, whereas the rates of non-synonymous substitutions are more similar. The analysis shows the highest average dN/dS ratio in the human lineage, followed by the pig and then the mouse lineages. Using codon based models we detect signals of positive Darwinian selection in approximately 5.3%, 4.9% and 6.0% of the genes on the human, pig and mouse lineages respectively. Approximately 16.8% of all the genes studied here are not currently annotated as functional genes in humans. Our analyses indicate that a large fraction of these genes may have lost their function quite recently or may still be functional genes in some or all of the three mammalian species. CONCLUSIONS: We present a comparative analysis of protein coding genes from three major mammalian lineages. Our study demonstrates the usefulness of codon-based likelihood models in detecting selection and it illustrates the value of sequencing organisms at different phylogenetic distances for comparative studies.
format Text
id pubmed-549206
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5492062005-02-23 Comparative analysis of protein coding sequences from human, mouse and the domesticated pig Jørgensen, Frank Grønlund Hobolth, Asger Hornshøj, Henrik Bendixen, Christian Fredholm, Merete Schierup, Mikkel Heide BMC Biol Research Article BACKGROUND: The availability of abundant sequence data from key model organisms has made large scale studies of molecular evolution an exciting possibility. Here we use full length cDNA alignments comprising more than 700,000 nucleotides from human, mouse, pig and the Japanese pufferfish Fugu rubrices in order to investigate 1) the relationships between three major lineages of mammals: rodents, artiodactyls and primates, and 2) the rate of evolution and the occurrence of positive Darwinian selection using codon based models of sequence evolution. RESULTS: We provide evidence that the evolutionary splits among primates, rodents and artiodactyls happened shortly after each other, with most gene trees favouring a topology with rodents as outgroup to primates and artiodactyls. Using an unrooted topology of the three mammalian species we show that since their diversification, the pig and mouse lineages have on average experienced 1.44 and 2.86 times as many synonymous substitutions as humans, respectively, whereas the rates of non-synonymous substitutions are more similar. The analysis shows the highest average dN/dS ratio in the human lineage, followed by the pig and then the mouse lineages. Using codon based models we detect signals of positive Darwinian selection in approximately 5.3%, 4.9% and 6.0% of the genes on the human, pig and mouse lineages respectively. Approximately 16.8% of all the genes studied here are not currently annotated as functional genes in humans. Our analyses indicate that a large fraction of these genes may have lost their function quite recently or may still be functional genes in some or all of the three mammalian species. CONCLUSIONS: We present a comparative analysis of protein coding genes from three major mammalian lineages. Our study demonstrates the usefulness of codon-based likelihood models in detecting selection and it illustrates the value of sequencing organisms at different phylogenetic distances for comparative studies. BioMed Central 2005-01-28 /pmc/articles/PMC549206/ /pubmed/15679890 http://dx.doi.org/10.1186/1741-7007-3-2 Text en Copyright © 2005 Jørgensen et al; licensee BioMed Central Ltd.
spellingShingle Research Article
Jørgensen, Frank Grønlund
Hobolth, Asger
Hornshøj, Henrik
Bendixen, Christian
Fredholm, Merete
Schierup, Mikkel Heide
Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
title Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
title_full Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
title_fullStr Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
title_full_unstemmed Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
title_short Comparative analysis of protein coding sequences from human, mouse and the domesticated pig
title_sort comparative analysis of protein coding sequences from human, mouse and the domesticated pig
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC549206/
https://www.ncbi.nlm.nih.gov/pubmed/15679890
http://dx.doi.org/10.1186/1741-7007-3-2
work_keys_str_mv AT jørgensenfrankgrønlund comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig
AT hobolthasger comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig
AT hornshøjhenrik comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig
AT bendixenchristian comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig
AT fredholmmerete comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig
AT schierupmikkelheide comparativeanalysisofproteincodingsequencesfromhumanmouseandthedomesticatedpig