Cargando…

The evolution of the tape measure protein: units, duplications and losses

BACKGROUND: A large family of viruses that infect bacteria, called phages, is characterized by long tails used to inject DNA into their victims' cells. The tape measure protein got its name because the length of the corresponding gene is proportional to the length of the phage's tail: a fa...

Descripción completa

Detalles Bibliográficos
Autores principales: Belcaid, Mahdi, Bergeron, Anne, Poisson, Guylaine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3271669/
https://www.ncbi.nlm.nih.gov/pubmed/22151602
http://dx.doi.org/10.1186/1471-2105-12-S9-S10
_version_ 1782222744781324288
author Belcaid, Mahdi
Bergeron, Anne
Poisson, Guylaine
author_facet Belcaid, Mahdi
Bergeron, Anne
Poisson, Guylaine
author_sort Belcaid, Mahdi
collection PubMed
description BACKGROUND: A large family of viruses that infect bacteria, called phages, is characterized by long tails used to inject DNA into their victims' cells. The tape measure protein got its name because the length of the corresponding gene is proportional to the length of the phage's tail: a fact shown by actually copying or splicing out parts of DNA in exemplar species. A natural question is whether there exist units for these tape measures, and if different tape measures have different units and lengths. Such units would allow us to retrace the evolution of tape measure proteins using their duplication/loss history. The vast number of sequenced phages genomes allows us to attack this problem with a comparative genomics approach. RESULTS: Here we describe a subset of phages whose tape measure proteins contain variable numbers of an 11 amino acids sequence repeat, aligned with sequence similarity, structural properties, and simple arithmetics. This subset provides a unique opportunity for the combinatorial study of phage evolution, without the added uncertainties of multiple alignments, which are trivial in this case, or of protein functions, that are well established. We give a heuristic that reconstructs the duplication history of these sequences, using divergent strains to discriminate between mutations that occurred before and after speciation, or lineage divergence. The heuristic is based on an efficient algorithm that gives an exhaustive enumeration of all possible parsimonious reconstructions of the duplication/speciation history of a single nucleotide. Finally, we present a method that allows, when possible, to discriminate between duplication and loss events. CONCLUSIONS: Establishing the evolutionary history of viruses is difficult, in part due to extensive recombinations and gene transfers, and high mutation rates that often erase detectable similarity between homologous genes. In this paper, we introduce new tools to address this problem.
format Online
Article
Text
id pubmed-3271669
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32716692012-02-04 The evolution of the tape measure protein: units, duplications and losses Belcaid, Mahdi Bergeron, Anne Poisson, Guylaine BMC Bioinformatics Proceedings BACKGROUND: A large family of viruses that infect bacteria, called phages, is characterized by long tails used to inject DNA into their victims' cells. The tape measure protein got its name because the length of the corresponding gene is proportional to the length of the phage's tail: a fact shown by actually copying or splicing out parts of DNA in exemplar species. A natural question is whether there exist units for these tape measures, and if different tape measures have different units and lengths. Such units would allow us to retrace the evolution of tape measure proteins using their duplication/loss history. The vast number of sequenced phages genomes allows us to attack this problem with a comparative genomics approach. RESULTS: Here we describe a subset of phages whose tape measure proteins contain variable numbers of an 11 amino acids sequence repeat, aligned with sequence similarity, structural properties, and simple arithmetics. This subset provides a unique opportunity for the combinatorial study of phage evolution, without the added uncertainties of multiple alignments, which are trivial in this case, or of protein functions, that are well established. We give a heuristic that reconstructs the duplication history of these sequences, using divergent strains to discriminate between mutations that occurred before and after speciation, or lineage divergence. The heuristic is based on an efficient algorithm that gives an exhaustive enumeration of all possible parsimonious reconstructions of the duplication/speciation history of a single nucleotide. Finally, we present a method that allows, when possible, to discriminate between duplication and loss events. CONCLUSIONS: Establishing the evolutionary history of viruses is difficult, in part due to extensive recombinations and gene transfers, and high mutation rates that often erase detectable similarity between homologous genes. In this paper, we introduce new tools to address this problem. BioMed Central 2011-10-05 /pmc/articles/PMC3271669/ /pubmed/22151602 http://dx.doi.org/10.1186/1471-2105-12-S9-S10 Text en Copyright ©2011 Belcaid et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Belcaid, Mahdi
Bergeron, Anne
Poisson, Guylaine
The evolution of the tape measure protein: units, duplications and losses
title The evolution of the tape measure protein: units, duplications and losses
title_full The evolution of the tape measure protein: units, duplications and losses
title_fullStr The evolution of the tape measure protein: units, duplications and losses
title_full_unstemmed The evolution of the tape measure protein: units, duplications and losses
title_short The evolution of the tape measure protein: units, duplications and losses
title_sort evolution of the tape measure protein: units, duplications and losses
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3271669/
https://www.ncbi.nlm.nih.gov/pubmed/22151602
http://dx.doi.org/10.1186/1471-2105-12-S9-S10
work_keys_str_mv AT belcaidmahdi theevolutionofthetapemeasureproteinunitsduplicationsandlosses
AT bergeronanne theevolutionofthetapemeasureproteinunitsduplicationsandlosses
AT poissonguylaine theevolutionofthetapemeasureproteinunitsduplicationsandlosses
AT belcaidmahdi evolutionofthetapemeasureproteinunitsduplicationsandlosses
AT bergeronanne evolutionofthetapemeasureproteinunitsduplicationsandlosses
AT poissonguylaine evolutionofthetapemeasureproteinunitsduplicationsandlosses