Cargando…

Translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in Escherichia coli O157:H7 (EHEC)

BACKGROUND: Genomes of E. coli, including that of the human pathogen Escherichia coli O157:H7 (EHEC) EDL933, still harbor undetected protein-coding genes which, apparently, have escaped annotation due to their small size and non-essential function. To find such genes, global gene expression of EHEC...

Descripción completa

Detalles Bibliográficos
Autores principales: Neuhaus, Klaus, Landstorfer, Richard, Fellner, Lea, Simon, Svenja, Schafferhans, Andrea, Goldberg, Tatyana, Marx, Harald, Ozoline, Olga N., Rost, Burkhard, Kuster, Bernhard, Keim, Daniel A., Scherer, Siegfried
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4765031/
https://www.ncbi.nlm.nih.gov/pubmed/26911138
http://dx.doi.org/10.1186/s12864-016-2456-1
_version_ 1782417486631665664
author Neuhaus, Klaus
Landstorfer, Richard
Fellner, Lea
Simon, Svenja
Schafferhans, Andrea
Goldberg, Tatyana
Marx, Harald
Ozoline, Olga N.
Rost, Burkhard
Kuster, Bernhard
Keim, Daniel A.
Scherer, Siegfried
author_facet Neuhaus, Klaus
Landstorfer, Richard
Fellner, Lea
Simon, Svenja
Schafferhans, Andrea
Goldberg, Tatyana
Marx, Harald
Ozoline, Olga N.
Rost, Burkhard
Kuster, Bernhard
Keim, Daniel A.
Scherer, Siegfried
author_sort Neuhaus, Klaus
collection PubMed
description BACKGROUND: Genomes of E. coli, including that of the human pathogen Escherichia coli O157:H7 (EHEC) EDL933, still harbor undetected protein-coding genes which, apparently, have escaped annotation due to their small size and non-essential function. To find such genes, global gene expression of EHEC EDL933 was examined, using strand-specific RNAseq (transcriptome), ribosomal footprinting (translatome) and mass spectrometry (proteome). RESULTS: Using the above methods, 72 short, non-annotated protein-coding genes were detected. All of these showed signals in the ribosomal footprinting assay indicating mRNA translation. Seven were verified by mass spectrometry. Fifty-seven genes are annotated in other enterobacteriaceae, mainly as hypothetical genes; the remaining 15 genes constitute novel discoveries. In addition, protein structure and function were predicted computationally and compared between EHEC-encoded proteins and 100-times randomly shuffled proteins. Based on this comparison, 61 of the 72 novel proteins exhibit predicted structural and functional features similar to those of annotated proteins. Many of the novel genes show differential transcription when grown under eleven diverse growth conditions suggesting environmental regulation. Three genes were found to confer a phenotype in previous studies, e.g., decreased cattle colonization. CONCLUSIONS: These findings demonstrate that ribosomal footprinting can be used to detect novel protein coding genes, contributing to the growing body of evidence that hypothetical genes are not annotation artifacts and opening an additional way to study their functionality. All 72 genes are taxonomically restricted and, therefore, appear to have evolved relatively recently de novo. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2456-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4765031
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47650312016-02-25 Translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in Escherichia coli O157:H7 (EHEC) Neuhaus, Klaus Landstorfer, Richard Fellner, Lea Simon, Svenja Schafferhans, Andrea Goldberg, Tatyana Marx, Harald Ozoline, Olga N. Rost, Burkhard Kuster, Bernhard Keim, Daniel A. Scherer, Siegfried BMC Genomics Research Article BACKGROUND: Genomes of E. coli, including that of the human pathogen Escherichia coli O157:H7 (EHEC) EDL933, still harbor undetected protein-coding genes which, apparently, have escaped annotation due to their small size and non-essential function. To find such genes, global gene expression of EHEC EDL933 was examined, using strand-specific RNAseq (transcriptome), ribosomal footprinting (translatome) and mass spectrometry (proteome). RESULTS: Using the above methods, 72 short, non-annotated protein-coding genes were detected. All of these showed signals in the ribosomal footprinting assay indicating mRNA translation. Seven were verified by mass spectrometry. Fifty-seven genes are annotated in other enterobacteriaceae, mainly as hypothetical genes; the remaining 15 genes constitute novel discoveries. In addition, protein structure and function were predicted computationally and compared between EHEC-encoded proteins and 100-times randomly shuffled proteins. Based on this comparison, 61 of the 72 novel proteins exhibit predicted structural and functional features similar to those of annotated proteins. Many of the novel genes show differential transcription when grown under eleven diverse growth conditions suggesting environmental regulation. Three genes were found to confer a phenotype in previous studies, e.g., decreased cattle colonization. CONCLUSIONS: These findings demonstrate that ribosomal footprinting can be used to detect novel protein coding genes, contributing to the growing body of evidence that hypothetical genes are not annotation artifacts and opening an additional way to study their functionality. All 72 genes are taxonomically restricted and, therefore, appear to have evolved relatively recently de novo. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2456-1) contains supplementary material, which is available to authorized users. BioMed Central 2016-02-24 /pmc/articles/PMC4765031/ /pubmed/26911138 http://dx.doi.org/10.1186/s12864-016-2456-1 Text en © Neuhaus et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Neuhaus, Klaus
Landstorfer, Richard
Fellner, Lea
Simon, Svenja
Schafferhans, Andrea
Goldberg, Tatyana
Marx, Harald
Ozoline, Olga N.
Rost, Burkhard
Kuster, Bernhard
Keim, Daniel A.
Scherer, Siegfried
Translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in Escherichia coli O157:H7 (EHEC)
title Translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in Escherichia coli O157:H7 (EHEC)
title_full Translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in Escherichia coli O157:H7 (EHEC)
title_fullStr Translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in Escherichia coli O157:H7 (EHEC)
title_full_unstemmed Translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in Escherichia coli O157:H7 (EHEC)
title_short Translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in Escherichia coli O157:H7 (EHEC)
title_sort translatomics combined with transcriptomics and proteomics reveals novel functional, recently evolved orphan genes in escherichia coli o157:h7 (ehec)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4765031/
https://www.ncbi.nlm.nih.gov/pubmed/26911138
http://dx.doi.org/10.1186/s12864-016-2456-1
work_keys_str_mv AT neuhausklaus translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec
AT landstorferrichard translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec
AT fellnerlea translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec
AT simonsvenja translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec
AT schafferhansandrea translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec
AT goldbergtatyana translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec
AT marxharald translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec
AT ozolineolgan translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec
AT rostburkhard translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec
AT kusterbernhard translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec
AT keimdaniela translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec
AT scherersiegfried translatomicscombinedwithtranscriptomicsandproteomicsrevealsnovelfunctionalrecentlyevolvedorphangenesinescherichiacolio157h7ehec