Cargando…

Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes

BACKGROUND: Harnessing vast amounts of genomic data in phylogenetic context stemming from massive sequencing of multiple closely related genomes requires new tools and approaches. We present a tool for the genome-wide analysis of frequencies and patterns of amino acid substitutions in multiple align...

Descripción completa

Detalles Bibliográficos
Autores principales: Yampolsky, Lev Y, Bouzinier, Michael A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3005911/
https://www.ncbi.nlm.nih.gov/pubmed/21143793
http://dx.doi.org/10.1186/1471-2164-11-S4-S10
_version_ 1782194144878264320
author Yampolsky, Lev Y
Bouzinier, Michael A
author_facet Yampolsky, Lev Y
Bouzinier, Michael A
author_sort Yampolsky, Lev Y
collection PubMed
description BACKGROUND: Harnessing vast amounts of genomic data in phylogenetic context stemming from massive sequencing of multiple closely related genomes requires new tools and approaches. We present a tool for the genome-wide analysis of frequencies and patterns of amino acid substitutions in multiple alignments of genes’ coding regions, and a database of amino acid substitutions in the phylogeny of 12 Drosophila genomes. We illustrate the use of these resources to address three types of evolutionary genomics questions: about fluxes in amino acid composition in proteins, about asymmetries in amino acid substitutions and about patterns of molecular evolution in duplicated genes. RESULTS: We demonstrate that amino acid composition of Drosophila proteins underwent a significant shift over the last 70 million years encompassed by the studied phylogeny, with less common amino acids (Cys, Met, His) increasing in frequency and more common ones (Ala, Leu, Glu) becoming less frequent. These fluxes are strongly correlated with polarity of source and destination amino acids, resulting in overall systematic decrease of mean polarity of amino acids found in Drosophila proteins. Frequency and radicality of amino acid substitutions are higher in paralogs than in orthologous single-copy genes and are higher in gene families with paralogs than in gene families without surviving duplications. Rate and radicality of substitutions, as expected, are negatively correlated with overall level and uniformity of gene expression. However, these correlations are not observed for substitutions occurring in duplicated genes, indicating a different selective constraint on the evolution of paralogous sequences. Clades resulting from duplications show a marked asymmetry in rate and radicality of amino acid substitutions, possibly a signal of widespread neofunctionalization. These patterns differ among protein families of different functionality, with genes coding for RNA-binding proteins differing from most other functional groups in terms of amino acid substitution patterns in duplicated and single-copy genes. CONCLUSIONS: We demonstrate that deep phylogenetic analysis of amino acid substitutions can reveal interesting genome-wide patterns. Amino acid composition of drosophilid proteins is shaped by fluxes similar to those previously observed in prokaryotic, yeast and mammalian genomes, indicating globally present patterns. Increased frequency and radicality of amino acid substitutions in duplicated genes and the presence of asymmetry of these parameters between paralogous clades indicate widespread neofunctionalization among paralogs as the mechanism of duplication retention.
format Text
id pubmed-3005911
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30059112010-12-22 Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes Yampolsky, Lev Y Bouzinier, Michael A BMC Genomics Proceedings BACKGROUND: Harnessing vast amounts of genomic data in phylogenetic context stemming from massive sequencing of multiple closely related genomes requires new tools and approaches. We present a tool for the genome-wide analysis of frequencies and patterns of amino acid substitutions in multiple alignments of genes’ coding regions, and a database of amino acid substitutions in the phylogeny of 12 Drosophila genomes. We illustrate the use of these resources to address three types of evolutionary genomics questions: about fluxes in amino acid composition in proteins, about asymmetries in amino acid substitutions and about patterns of molecular evolution in duplicated genes. RESULTS: We demonstrate that amino acid composition of Drosophila proteins underwent a significant shift over the last 70 million years encompassed by the studied phylogeny, with less common amino acids (Cys, Met, His) increasing in frequency and more common ones (Ala, Leu, Glu) becoming less frequent. These fluxes are strongly correlated with polarity of source and destination amino acids, resulting in overall systematic decrease of mean polarity of amino acids found in Drosophila proteins. Frequency and radicality of amino acid substitutions are higher in paralogs than in orthologous single-copy genes and are higher in gene families with paralogs than in gene families without surviving duplications. Rate and radicality of substitutions, as expected, are negatively correlated with overall level and uniformity of gene expression. However, these correlations are not observed for substitutions occurring in duplicated genes, indicating a different selective constraint on the evolution of paralogous sequences. Clades resulting from duplications show a marked asymmetry in rate and radicality of amino acid substitutions, possibly a signal of widespread neofunctionalization. These patterns differ among protein families of different functionality, with genes coding for RNA-binding proteins differing from most other functional groups in terms of amino acid substitution patterns in duplicated and single-copy genes. CONCLUSIONS: We demonstrate that deep phylogenetic analysis of amino acid substitutions can reveal interesting genome-wide patterns. Amino acid composition of drosophilid proteins is shaped by fluxes similar to those previously observed in prokaryotic, yeast and mammalian genomes, indicating globally present patterns. Increased frequency and radicality of amino acid substitutions in duplicated genes and the presence of asymmetry of these parameters between paralogous clades indicate widespread neofunctionalization among paralogs as the mechanism of duplication retention. BioMed Central 2010-12-02 /pmc/articles/PMC3005911/ /pubmed/21143793 http://dx.doi.org/10.1186/1471-2164-11-S4-S10 Text en Copyright ©2010 Yampolsky and Bouzinier; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Yampolsky, Lev Y
Bouzinier, Michael A
Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes
title Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes
title_full Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes
title_fullStr Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes
title_full_unstemmed Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes
title_short Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes
title_sort evolutionary patterns of amino acid substitutions in 12 drosophila genomes
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3005911/
https://www.ncbi.nlm.nih.gov/pubmed/21143793
http://dx.doi.org/10.1186/1471-2164-11-S4-S10
work_keys_str_mv AT yampolskylevy evolutionarypatternsofaminoacidsubstitutionsin12drosophilagenomes
AT bouziniermichaela evolutionarypatternsofaminoacidsubstitutionsin12drosophilagenomes