Cargando…

Unraveling the hidden universe of small proteins in bacterial genomes

Identification of small open reading frames (smORFs) encoding small proteins (≤ 100 amino acids; SEPs) is a challenge in the fields of genome annotation and protein discovery. Here, by combining a novel bioinformatics tool (RanSEPs) with “‐omics” approaches, we were able to describe 109 bacterial sm...

Descripción completa

Detalles Bibliográficos
Autores principales: Miravet‐Verde, Samuel, Ferrar, Tony, Espadas‐García, Guadalupe, Mazzolini, Rocco, Gharrab, Anas, Sabido, Eduard, Serrano, Luis, Lluch‐Senar, Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6385055/
https://www.ncbi.nlm.nih.gov/pubmed/30796087
http://dx.doi.org/10.15252/msb.20188290
_version_ 1783397120765591552
author Miravet‐Verde, Samuel
Ferrar, Tony
Espadas‐García, Guadalupe
Mazzolini, Rocco
Gharrab, Anas
Sabido, Eduard
Serrano, Luis
Lluch‐Senar, Maria
author_facet Miravet‐Verde, Samuel
Ferrar, Tony
Espadas‐García, Guadalupe
Mazzolini, Rocco
Gharrab, Anas
Sabido, Eduard
Serrano, Luis
Lluch‐Senar, Maria
author_sort Miravet‐Verde, Samuel
collection PubMed
description Identification of small open reading frames (smORFs) encoding small proteins (≤ 100 amino acids; SEPs) is a challenge in the fields of genome annotation and protein discovery. Here, by combining a novel bioinformatics tool (RanSEPs) with “‐omics” approaches, we were able to describe 109 bacterial small ORFomes. Predictions were first validated by performing an exhaustive search of SEPs present in Mycoplasma pneumoniae proteome via mass spectrometry, which illustrated the limitations of shotgun approaches. Then, RanSEPs predictions were validated and compared with other tools using proteomic datasets from different bacterial species and SEPs from the literature. We found that up to 16 ± 9% of proteins in an organism could be classified as SEPs. Integration of RanSEPs predictions with transcriptomics data showed that some annotated non‐coding RNAs could in fact encode for SEPs. A functional study of SEPs highlighted an enrichment in the membrane, translation, metabolism, and nucleotide‐binding categories. Additionally, 9.7% of the SEPs included a N‐terminus predicted signal peptide. We envision RanSEPs as a tool to unmask the hidden universe of small bacterial proteins.
format Online
Article
Text
id pubmed-6385055
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-63850552019-03-01 Unraveling the hidden universe of small proteins in bacterial genomes Miravet‐Verde, Samuel Ferrar, Tony Espadas‐García, Guadalupe Mazzolini, Rocco Gharrab, Anas Sabido, Eduard Serrano, Luis Lluch‐Senar, Maria Mol Syst Biol Articles Identification of small open reading frames (smORFs) encoding small proteins (≤ 100 amino acids; SEPs) is a challenge in the fields of genome annotation and protein discovery. Here, by combining a novel bioinformatics tool (RanSEPs) with “‐omics” approaches, we were able to describe 109 bacterial small ORFomes. Predictions were first validated by performing an exhaustive search of SEPs present in Mycoplasma pneumoniae proteome via mass spectrometry, which illustrated the limitations of shotgun approaches. Then, RanSEPs predictions were validated and compared with other tools using proteomic datasets from different bacterial species and SEPs from the literature. We found that up to 16 ± 9% of proteins in an organism could be classified as SEPs. Integration of RanSEPs predictions with transcriptomics data showed that some annotated non‐coding RNAs could in fact encode for SEPs. A functional study of SEPs highlighted an enrichment in the membrane, translation, metabolism, and nucleotide‐binding categories. Additionally, 9.7% of the SEPs included a N‐terminus predicted signal peptide. We envision RanSEPs as a tool to unmask the hidden universe of small bacterial proteins. John Wiley and Sons Inc. 2019-02-22 /pmc/articles/PMC6385055/ /pubmed/30796087 http://dx.doi.org/10.15252/msb.20188290 Text en © 2019 The Authors. Published under the terms of the CC BY 4.0 license This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Miravet‐Verde, Samuel
Ferrar, Tony
Espadas‐García, Guadalupe
Mazzolini, Rocco
Gharrab, Anas
Sabido, Eduard
Serrano, Luis
Lluch‐Senar, Maria
Unraveling the hidden universe of small proteins in bacterial genomes
title Unraveling the hidden universe of small proteins in bacterial genomes
title_full Unraveling the hidden universe of small proteins in bacterial genomes
title_fullStr Unraveling the hidden universe of small proteins in bacterial genomes
title_full_unstemmed Unraveling the hidden universe of small proteins in bacterial genomes
title_short Unraveling the hidden universe of small proteins in bacterial genomes
title_sort unraveling the hidden universe of small proteins in bacterial genomes
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6385055/
https://www.ncbi.nlm.nih.gov/pubmed/30796087
http://dx.doi.org/10.15252/msb.20188290
work_keys_str_mv AT miravetverdesamuel unravelingthehiddenuniverseofsmallproteinsinbacterialgenomes
AT ferrartony unravelingthehiddenuniverseofsmallproteinsinbacterialgenomes
AT espadasgarciaguadalupe unravelingthehiddenuniverseofsmallproteinsinbacterialgenomes
AT mazzolinirocco unravelingthehiddenuniverseofsmallproteinsinbacterialgenomes
AT gharrabanas unravelingthehiddenuniverseofsmallproteinsinbacterialgenomes
AT sabidoeduard unravelingthehiddenuniverseofsmallproteinsinbacterialgenomes
AT serranoluis unravelingthehiddenuniverseofsmallproteinsinbacterialgenomes
AT lluchsenarmaria unravelingthehiddenuniverseofsmallproteinsinbacterialgenomes