Cargando…

Exploring the “dark matter” of a mammalian proteome by protein structure and function modeling

BACKGROUND: A growing body of evidence shows that gene products encoded by short open reading frames play key roles in numerous cellular processes. Yet, they are generally overlooked in genome assembly, escaping annotation because small protein-coding genes are difficult to predict computationally....

Descripción completa

Detalles Bibliográficos
Autor principal: Brylinski, Michal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3866606/
https://www.ncbi.nlm.nih.gov/pubmed/24321360
http://dx.doi.org/10.1186/1477-5956-11-47
_version_ 1782296189406806016
author Brylinski, Michal
author_facet Brylinski, Michal
author_sort Brylinski, Michal
collection PubMed
description BACKGROUND: A growing body of evidence shows that gene products encoded by short open reading frames play key roles in numerous cellular processes. Yet, they are generally overlooked in genome assembly, escaping annotation because small protein-coding genes are difficult to predict computationally. Consequently, there are still a considerable number of small proteins whose functions are yet to be characterized. RESULTS: To address this issue, we apply a collection of structural bioinformatics algorithms to infer molecular function of putative small proteins from the mouse proteome. Specifically, we construct 1,743 confident structure models of small proteins, which reveal a significant structural diversity with a noticeably high helical content. A subsequent structure-based function annotation of small protein models exposes 178,745 putative protein-protein interactions with the remaining gene products in the mouse proteome, 1,100 potential binding sites for small organic molecules and 987 metal-binding signatures. CONCLUSIONS: These results strongly indicate that many small proteins adopt three-dimensional structures and are fully functional, playing important roles in transcriptional regulation, cell signaling and metabolism. Data collected through this work is freely available to the academic community at http://www.brylinski.org/content/databases to support future studies oriented on elucidating the functions of hypothetical small proteins.
format Online
Article
Text
id pubmed-3866606
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38666062013-12-19 Exploring the “dark matter” of a mammalian proteome by protein structure and function modeling Brylinski, Michal Proteome Sci Research BACKGROUND: A growing body of evidence shows that gene products encoded by short open reading frames play key roles in numerous cellular processes. Yet, they are generally overlooked in genome assembly, escaping annotation because small protein-coding genes are difficult to predict computationally. Consequently, there are still a considerable number of small proteins whose functions are yet to be characterized. RESULTS: To address this issue, we apply a collection of structural bioinformatics algorithms to infer molecular function of putative small proteins from the mouse proteome. Specifically, we construct 1,743 confident structure models of small proteins, which reveal a significant structural diversity with a noticeably high helical content. A subsequent structure-based function annotation of small protein models exposes 178,745 putative protein-protein interactions with the remaining gene products in the mouse proteome, 1,100 potential binding sites for small organic molecules and 987 metal-binding signatures. CONCLUSIONS: These results strongly indicate that many small proteins adopt three-dimensional structures and are fully functional, playing important roles in transcriptional regulation, cell signaling and metabolism. Data collected through this work is freely available to the academic community at http://www.brylinski.org/content/databases to support future studies oriented on elucidating the functions of hypothetical small proteins. BioMed Central 2013-12-09 /pmc/articles/PMC3866606/ /pubmed/24321360 http://dx.doi.org/10.1186/1477-5956-11-47 Text en Copyright © 2013 Brylinski; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Brylinski, Michal
Exploring the “dark matter” of a mammalian proteome by protein structure and function modeling
title Exploring the “dark matter” of a mammalian proteome by protein structure and function modeling
title_full Exploring the “dark matter” of a mammalian proteome by protein structure and function modeling
title_fullStr Exploring the “dark matter” of a mammalian proteome by protein structure and function modeling
title_full_unstemmed Exploring the “dark matter” of a mammalian proteome by protein structure and function modeling
title_short Exploring the “dark matter” of a mammalian proteome by protein structure and function modeling
title_sort exploring the “dark matter” of a mammalian proteome by protein structure and function modeling
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3866606/
https://www.ncbi.nlm.nih.gov/pubmed/24321360
http://dx.doi.org/10.1186/1477-5956-11-47
work_keys_str_mv AT brylinskimichal exploringthedarkmatterofamammalianproteomebyproteinstructureandfunctionmodeling