Cargando…

In Silico screening for functional candidates amongst hypothetical proteins

BACKGROUND: The definition of a hypothetical protein is a protein that is predicted to be expressed from an open reading frame, but for which there is no experimental evidence of translation. Hypothetical proteins constitute a substantial fraction of proteomes of human as well as of other eukaryotes...

Descripción completa

Detalles Bibliográficos
Autores principales: Desler, Claus, Suravajhala, Prashanth, Sanderhoff, May, Rasmussen, Merete, Rasmussen, Lene Juel
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2758874/
https://www.ncbi.nlm.nih.gov/pubmed/19754976
http://dx.doi.org/10.1186/1471-2105-10-289
_version_ 1782172621604913152
author Desler, Claus
Suravajhala, Prashanth
Sanderhoff, May
Rasmussen, Merete
Rasmussen, Lene Juel
author_facet Desler, Claus
Suravajhala, Prashanth
Sanderhoff, May
Rasmussen, Merete
Rasmussen, Lene Juel
author_sort Desler, Claus
collection PubMed
description BACKGROUND: The definition of a hypothetical protein is a protein that is predicted to be expressed from an open reading frame, but for which there is no experimental evidence of translation. Hypothetical proteins constitute a substantial fraction of proteomes of human as well as of other eukaryotes. With the general belief that the majority of hypothetical proteins are the product of pseudogenes, it is essential to have a tool with the ability of pinpointing the minority of hypothetical proteins with a high probability of being expressed. RESULTS: Here, we present an in silico selection strategy where eukaryotic hypothetical proteins are sorted according to two criteria that can be reliably identified in silico: the presence of subcellular targeting signals and presence of characterized protein domains. To validate the selection strategy we applied it on a database of human hypothetical proteins dating to 2006 and compared the proteins predicted to be expressed by our selecting strategy, with their status in 2008. For the comparison we focused on mitochondrial proteins, since considerable amounts of research have focused on this field in between 2006 and 2008. Therefore, many proteins, defined as hypothetical in 2006, have later been characterized as mitochondrial. CONCLUSION: Among the total amount of human proteins hypothetical in 2006, 21% have later been experimentally characterized and 6% of those have been shown to have a role in a mitochondrial context. In contrast, among the selected hypothetical proteins from the 2006 dataset, predicted by our strategy to have a mitochondrial role, 53-62% have later been experimentally characterized, and 85% of these have actually been assigned a role in mitochondria by 2008. Therefore our in silico selection strategy can be used to select the most promising candidates for subsequent in vitro and in vivo analyses.
format Text
id pubmed-2758874
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27588742009-10-08 In Silico screening for functional candidates amongst hypothetical proteins Desler, Claus Suravajhala, Prashanth Sanderhoff, May Rasmussen, Merete Rasmussen, Lene Juel BMC Bioinformatics Methodology Article BACKGROUND: The definition of a hypothetical protein is a protein that is predicted to be expressed from an open reading frame, but for which there is no experimental evidence of translation. Hypothetical proteins constitute a substantial fraction of proteomes of human as well as of other eukaryotes. With the general belief that the majority of hypothetical proteins are the product of pseudogenes, it is essential to have a tool with the ability of pinpointing the minority of hypothetical proteins with a high probability of being expressed. RESULTS: Here, we present an in silico selection strategy where eukaryotic hypothetical proteins are sorted according to two criteria that can be reliably identified in silico: the presence of subcellular targeting signals and presence of characterized protein domains. To validate the selection strategy we applied it on a database of human hypothetical proteins dating to 2006 and compared the proteins predicted to be expressed by our selecting strategy, with their status in 2008. For the comparison we focused on mitochondrial proteins, since considerable amounts of research have focused on this field in between 2006 and 2008. Therefore, many proteins, defined as hypothetical in 2006, have later been characterized as mitochondrial. CONCLUSION: Among the total amount of human proteins hypothetical in 2006, 21% have later been experimentally characterized and 6% of those have been shown to have a role in a mitochondrial context. In contrast, among the selected hypothetical proteins from the 2006 dataset, predicted by our strategy to have a mitochondrial role, 53-62% have later been experimentally characterized, and 85% of these have actually been assigned a role in mitochondria by 2008. Therefore our in silico selection strategy can be used to select the most promising candidates for subsequent in vitro and in vivo analyses. BioMed Central 2009-09-16 /pmc/articles/PMC2758874/ /pubmed/19754976 http://dx.doi.org/10.1186/1471-2105-10-289 Text en Copyright © 2009 Desler et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Desler, Claus
Suravajhala, Prashanth
Sanderhoff, May
Rasmussen, Merete
Rasmussen, Lene Juel
In Silico screening for functional candidates amongst hypothetical proteins
title In Silico screening for functional candidates amongst hypothetical proteins
title_full In Silico screening for functional candidates amongst hypothetical proteins
title_fullStr In Silico screening for functional candidates amongst hypothetical proteins
title_full_unstemmed In Silico screening for functional candidates amongst hypothetical proteins
title_short In Silico screening for functional candidates amongst hypothetical proteins
title_sort in silico screening for functional candidates amongst hypothetical proteins
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2758874/
https://www.ncbi.nlm.nih.gov/pubmed/19754976
http://dx.doi.org/10.1186/1471-2105-10-289
work_keys_str_mv AT deslerclaus insilicoscreeningforfunctionalcandidatesamongsthypotheticalproteins
AT suravajhalaprashanth insilicoscreeningforfunctionalcandidatesamongsthypotheticalproteins
AT sanderhoffmay insilicoscreeningforfunctionalcandidatesamongsthypotheticalproteins
AT rasmussenmerete insilicoscreeningforfunctionalcandidatesamongsthypotheticalproteins
AT rasmussenlenejuel insilicoscreeningforfunctionalcandidatesamongsthypotheticalproteins