Cargando…
Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes
BACKGROUND: Harnessing vast amounts of genomic data in phylogenetic context stemming from massive sequencing of multiple closely related genomes requires new tools and approaches. We present a tool for the genome-wide analysis of frequencies and patterns of amino acid substitutions in multiple align...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3005911/ https://www.ncbi.nlm.nih.gov/pubmed/21143793 http://dx.doi.org/10.1186/1471-2164-11-S4-S10 |
_version_ | 1782194144878264320 |
---|---|
author | Yampolsky, Lev Y Bouzinier, Michael A |
author_facet | Yampolsky, Lev Y Bouzinier, Michael A |
author_sort | Yampolsky, Lev Y |
collection | PubMed |
description | BACKGROUND: Harnessing vast amounts of genomic data in phylogenetic context stemming from massive sequencing of multiple closely related genomes requires new tools and approaches. We present a tool for the genome-wide analysis of frequencies and patterns of amino acid substitutions in multiple alignments of genes’ coding regions, and a database of amino acid substitutions in the phylogeny of 12 Drosophila genomes. We illustrate the use of these resources to address three types of evolutionary genomics questions: about fluxes in amino acid composition in proteins, about asymmetries in amino acid substitutions and about patterns of molecular evolution in duplicated genes. RESULTS: We demonstrate that amino acid composition of Drosophila proteins underwent a significant shift over the last 70 million years encompassed by the studied phylogeny, with less common amino acids (Cys, Met, His) increasing in frequency and more common ones (Ala, Leu, Glu) becoming less frequent. These fluxes are strongly correlated with polarity of source and destination amino acids, resulting in overall systematic decrease of mean polarity of amino acids found in Drosophila proteins. Frequency and radicality of amino acid substitutions are higher in paralogs than in orthologous single-copy genes and are higher in gene families with paralogs than in gene families without surviving duplications. Rate and radicality of substitutions, as expected, are negatively correlated with overall level and uniformity of gene expression. However, these correlations are not observed for substitutions occurring in duplicated genes, indicating a different selective constraint on the evolution of paralogous sequences. Clades resulting from duplications show a marked asymmetry in rate and radicality of amino acid substitutions, possibly a signal of widespread neofunctionalization. These patterns differ among protein families of different functionality, with genes coding for RNA-binding proteins differing from most other functional groups in terms of amino acid substitution patterns in duplicated and single-copy genes. CONCLUSIONS: We demonstrate that deep phylogenetic analysis of amino acid substitutions can reveal interesting genome-wide patterns. Amino acid composition of drosophilid proteins is shaped by fluxes similar to those previously observed in prokaryotic, yeast and mammalian genomes, indicating globally present patterns. Increased frequency and radicality of amino acid substitutions in duplicated genes and the presence of asymmetry of these parameters between paralogous clades indicate widespread neofunctionalization among paralogs as the mechanism of duplication retention. |
format | Text |
id | pubmed-3005911 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30059112010-12-22 Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes Yampolsky, Lev Y Bouzinier, Michael A BMC Genomics Proceedings BACKGROUND: Harnessing vast amounts of genomic data in phylogenetic context stemming from massive sequencing of multiple closely related genomes requires new tools and approaches. We present a tool for the genome-wide analysis of frequencies and patterns of amino acid substitutions in multiple alignments of genes’ coding regions, and a database of amino acid substitutions in the phylogeny of 12 Drosophila genomes. We illustrate the use of these resources to address three types of evolutionary genomics questions: about fluxes in amino acid composition in proteins, about asymmetries in amino acid substitutions and about patterns of molecular evolution in duplicated genes. RESULTS: We demonstrate that amino acid composition of Drosophila proteins underwent a significant shift over the last 70 million years encompassed by the studied phylogeny, with less common amino acids (Cys, Met, His) increasing in frequency and more common ones (Ala, Leu, Glu) becoming less frequent. These fluxes are strongly correlated with polarity of source and destination amino acids, resulting in overall systematic decrease of mean polarity of amino acids found in Drosophila proteins. Frequency and radicality of amino acid substitutions are higher in paralogs than in orthologous single-copy genes and are higher in gene families with paralogs than in gene families without surviving duplications. Rate and radicality of substitutions, as expected, are negatively correlated with overall level and uniformity of gene expression. However, these correlations are not observed for substitutions occurring in duplicated genes, indicating a different selective constraint on the evolution of paralogous sequences. Clades resulting from duplications show a marked asymmetry in rate and radicality of amino acid substitutions, possibly a signal of widespread neofunctionalization. These patterns differ among protein families of different functionality, with genes coding for RNA-binding proteins differing from most other functional groups in terms of amino acid substitution patterns in duplicated and single-copy genes. CONCLUSIONS: We demonstrate that deep phylogenetic analysis of amino acid substitutions can reveal interesting genome-wide patterns. Amino acid composition of drosophilid proteins is shaped by fluxes similar to those previously observed in prokaryotic, yeast and mammalian genomes, indicating globally present patterns. Increased frequency and radicality of amino acid substitutions in duplicated genes and the presence of asymmetry of these parameters between paralogous clades indicate widespread neofunctionalization among paralogs as the mechanism of duplication retention. BioMed Central 2010-12-02 /pmc/articles/PMC3005911/ /pubmed/21143793 http://dx.doi.org/10.1186/1471-2164-11-S4-S10 Text en Copyright ©2010 Yampolsky and Bouzinier; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Yampolsky, Lev Y Bouzinier, Michael A Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes |
title | Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes |
title_full | Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes |
title_fullStr | Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes |
title_full_unstemmed | Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes |
title_short | Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes |
title_sort | evolutionary patterns of amino acid substitutions in 12 drosophila genomes |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3005911/ https://www.ncbi.nlm.nih.gov/pubmed/21143793 http://dx.doi.org/10.1186/1471-2164-11-S4-S10 |
work_keys_str_mv | AT yampolskylevy evolutionarypatternsofaminoacidsubstitutionsin12drosophilagenomes AT bouziniermichaela evolutionarypatternsofaminoacidsubstitutionsin12drosophilagenomes |