Cargando…

Conservation analysis of the CydX protein yields insights into small protein identification and evolution

BACKGROUND: The reliable identification of proteins containing 50 or fewer amino acids is difficult due to the limited information content in short sequences. The 37 amino acid CydX protein in Escherichia coli is a member of the cytochrome bd oxidase complex, an enzyme found throughout Eubacteria. T...

Descripción completa

Detalles Bibliográficos
Autores principales: Allen, Rondine J, Brenner, Evan P, VanOrsdel, Caitlin E, Hobson, Jessica J, Hearn, David J, Hemm, Matthew R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4325964/
https://www.ncbi.nlm.nih.gov/pubmed/25475368
http://dx.doi.org/10.1186/1471-2164-15-946
_version_ 1782356876525043712
author Allen, Rondine J
Brenner, Evan P
VanOrsdel, Caitlin E
Hobson, Jessica J
Hearn, David J
Hemm, Matthew R
author_facet Allen, Rondine J
Brenner, Evan P
VanOrsdel, Caitlin E
Hobson, Jessica J
Hearn, David J
Hemm, Matthew R
author_sort Allen, Rondine J
collection PubMed
description BACKGROUND: The reliable identification of proteins containing 50 or fewer amino acids is difficult due to the limited information content in short sequences. The 37 amino acid CydX protein in Escherichia coli is a member of the cytochrome bd oxidase complex, an enzyme found throughout Eubacteria. To investigate the extent of CydX conservation and prevalence and evaluate different methods of small protein homologue identification, we surveyed 1095 Eubacteria species for the presence of the small protein. RESULTS: Over 300 homologues were identified, including 80 unannotated genes. The ability of both closely-related and divergent homologues to complement the E. coli ΔcydX mutant supports our identification techniques, and suggests that CydX homologues retain similar function among divergent species. However, sequence analysis of these proteins shows a great degree of variability, with only a few highly-conserved residues. An analysis of the co-variation between CydX homologues and their corresponding cydA and cydB genes shows a close synteny of the small protein with the CydA long Q-loop. Phylogenetic analysis suggests that the cydABX operon has undergone horizontal gene transfer, although the cydX gene likely evolved in a progenitor of the Alpha, Beta, and Gammaproteobacteria. Further investigation of cydAB operons identified two additional conserved hypothetical small proteins: CydY encoded in CydA(Qlong) operons that lack cydX, and CydZ encoded in more than 150 CydA(Qshort) operons. CONCLUSIONS: This study provides a systematic analysis of bioinformatics techniques required for the unique challenges present in small protein identification and phylogenetic analyses. These results elucidate the prevalence of CydX throughout the Proteobacteria, provide insight into the selection pressure and sequence requirements for CydX function, and suggest a potential functional interaction between the small protein and the CydA Q-loop, an enigmatic domain of the cytochrome bd oxidase complex. Finally, these results identify other conserved small proteins encoded in cytochrome bd oxidase operons, suggesting that small protein subunits may be a more common component of these enzymes than previously thought. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-946) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4325964
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43259642015-02-13 Conservation analysis of the CydX protein yields insights into small protein identification and evolution Allen, Rondine J Brenner, Evan P VanOrsdel, Caitlin E Hobson, Jessica J Hearn, David J Hemm, Matthew R BMC Genomics Research Article BACKGROUND: The reliable identification of proteins containing 50 or fewer amino acids is difficult due to the limited information content in short sequences. The 37 amino acid CydX protein in Escherichia coli is a member of the cytochrome bd oxidase complex, an enzyme found throughout Eubacteria. To investigate the extent of CydX conservation and prevalence and evaluate different methods of small protein homologue identification, we surveyed 1095 Eubacteria species for the presence of the small protein. RESULTS: Over 300 homologues were identified, including 80 unannotated genes. The ability of both closely-related and divergent homologues to complement the E. coli ΔcydX mutant supports our identification techniques, and suggests that CydX homologues retain similar function among divergent species. However, sequence analysis of these proteins shows a great degree of variability, with only a few highly-conserved residues. An analysis of the co-variation between CydX homologues and their corresponding cydA and cydB genes shows a close synteny of the small protein with the CydA long Q-loop. Phylogenetic analysis suggests that the cydABX operon has undergone horizontal gene transfer, although the cydX gene likely evolved in a progenitor of the Alpha, Beta, and Gammaproteobacteria. Further investigation of cydAB operons identified two additional conserved hypothetical small proteins: CydY encoded in CydA(Qlong) operons that lack cydX, and CydZ encoded in more than 150 CydA(Qshort) operons. CONCLUSIONS: This study provides a systematic analysis of bioinformatics techniques required for the unique challenges present in small protein identification and phylogenetic analyses. These results elucidate the prevalence of CydX throughout the Proteobacteria, provide insight into the selection pressure and sequence requirements for CydX function, and suggest a potential functional interaction between the small protein and the CydA Q-loop, an enigmatic domain of the cytochrome bd oxidase complex. Finally, these results identify other conserved small proteins encoded in cytochrome bd oxidase operons, suggesting that small protein subunits may be a more common component of these enzymes than previously thought. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-946) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-05 /pmc/articles/PMC4325964/ /pubmed/25475368 http://dx.doi.org/10.1186/1471-2164-15-946 Text en © Allen et al.; licensee BioMed Central. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Allen, Rondine J
Brenner, Evan P
VanOrsdel, Caitlin E
Hobson, Jessica J
Hearn, David J
Hemm, Matthew R
Conservation analysis of the CydX protein yields insights into small protein identification and evolution
title Conservation analysis of the CydX protein yields insights into small protein identification and evolution
title_full Conservation analysis of the CydX protein yields insights into small protein identification and evolution
title_fullStr Conservation analysis of the CydX protein yields insights into small protein identification and evolution
title_full_unstemmed Conservation analysis of the CydX protein yields insights into small protein identification and evolution
title_short Conservation analysis of the CydX protein yields insights into small protein identification and evolution
title_sort conservation analysis of the cydx protein yields insights into small protein identification and evolution
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4325964/
https://www.ncbi.nlm.nih.gov/pubmed/25475368
http://dx.doi.org/10.1186/1471-2164-15-946
work_keys_str_mv AT allenrondinej conservationanalysisofthecydxproteinyieldsinsightsintosmallproteinidentificationandevolution
AT brennerevanp conservationanalysisofthecydxproteinyieldsinsightsintosmallproteinidentificationandevolution
AT vanorsdelcaitline conservationanalysisofthecydxproteinyieldsinsightsintosmallproteinidentificationandevolution
AT hobsonjessicaj conservationanalysisofthecydxproteinyieldsinsightsintosmallproteinidentificationandevolution
AT hearndavidj conservationanalysisofthecydxproteinyieldsinsightsintosmallproteinidentificationandevolution
AT hemmmatthewr conservationanalysisofthecydxproteinyieldsinsightsintosmallproteinidentificationandevolution