Cargando…

Annotation of the M. tuberculosis Hypothetical Orfeome: Adding Functional Information to More than Half of the Uncharacterized Proteins

The genome of Mycobacterium tuberculosis (H37Rv) contains 4,019 protein coding genes, of which more than thousand have been categorized as ‘hypothetical’ implying that for these not even weak functional associations could be identified so far. We here predict reliable functional indications for half...

Descripción completa

Detalles Bibliográficos
Autores principales: Doerks, Tobias, van Noort, Vera, Minguez, Pablo, Bork, Peer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3317503/
https://www.ncbi.nlm.nih.gov/pubmed/22485162
http://dx.doi.org/10.1371/journal.pone.0034302
_version_ 1782228566746857472
author Doerks, Tobias
van Noort, Vera
Minguez, Pablo
Bork, Peer
author_facet Doerks, Tobias
van Noort, Vera
Minguez, Pablo
Bork, Peer
author_sort Doerks, Tobias
collection PubMed
description The genome of Mycobacterium tuberculosis (H37Rv) contains 4,019 protein coding genes, of which more than thousand have been categorized as ‘hypothetical’ implying that for these not even weak functional associations could be identified so far. We here predict reliable functional indications for half of this large hypothetical orfeome: 497 genes can be annotated based on orthology, and another 125 can be linked to interacting proteins via integrated genomic context analysis and literature mining. The assignments include newly identified clusters of interacting proteins, hypothetical genes that are associated to well known pathways and putative disease-relevant targets. All together, we have raised the fraction of the proteome with at least some functional annotation to 88% which should considerably enhance the interpretation of large-scale experiments targeting this medically important organism.
format Online
Article
Text
id pubmed-3317503
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33175032012-04-06 Annotation of the M. tuberculosis Hypothetical Orfeome: Adding Functional Information to More than Half of the Uncharacterized Proteins Doerks, Tobias van Noort, Vera Minguez, Pablo Bork, Peer PLoS One Research Article The genome of Mycobacterium tuberculosis (H37Rv) contains 4,019 protein coding genes, of which more than thousand have been categorized as ‘hypothetical’ implying that for these not even weak functional associations could be identified so far. We here predict reliable functional indications for half of this large hypothetical orfeome: 497 genes can be annotated based on orthology, and another 125 can be linked to interacting proteins via integrated genomic context analysis and literature mining. The assignments include newly identified clusters of interacting proteins, hypothetical genes that are associated to well known pathways and putative disease-relevant targets. All together, we have raised the fraction of the proteome with at least some functional annotation to 88% which should considerably enhance the interpretation of large-scale experiments targeting this medically important organism. Public Library of Science 2012-04-02 /pmc/articles/PMC3317503/ /pubmed/22485162 http://dx.doi.org/10.1371/journal.pone.0034302 Text en Doerks et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Doerks, Tobias
van Noort, Vera
Minguez, Pablo
Bork, Peer
Annotation of the M. tuberculosis Hypothetical Orfeome: Adding Functional Information to More than Half of the Uncharacterized Proteins
title Annotation of the M. tuberculosis Hypothetical Orfeome: Adding Functional Information to More than Half of the Uncharacterized Proteins
title_full Annotation of the M. tuberculosis Hypothetical Orfeome: Adding Functional Information to More than Half of the Uncharacterized Proteins
title_fullStr Annotation of the M. tuberculosis Hypothetical Orfeome: Adding Functional Information to More than Half of the Uncharacterized Proteins
title_full_unstemmed Annotation of the M. tuberculosis Hypothetical Orfeome: Adding Functional Information to More than Half of the Uncharacterized Proteins
title_short Annotation of the M. tuberculosis Hypothetical Orfeome: Adding Functional Information to More than Half of the Uncharacterized Proteins
title_sort annotation of the m. tuberculosis hypothetical orfeome: adding functional information to more than half of the uncharacterized proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3317503/
https://www.ncbi.nlm.nih.gov/pubmed/22485162
http://dx.doi.org/10.1371/journal.pone.0034302
work_keys_str_mv AT doerkstobias annotationofthemtuberculosishypotheticalorfeomeaddingfunctionalinformationtomorethanhalfoftheuncharacterizedproteins
AT vannoortvera annotationofthemtuberculosishypotheticalorfeomeaddingfunctionalinformationtomorethanhalfoftheuncharacterizedproteins
AT minguezpablo annotationofthemtuberculosishypotheticalorfeomeaddingfunctionalinformationtomorethanhalfoftheuncharacterizedproteins
AT borkpeer annotationofthemtuberculosishypotheticalorfeomeaddingfunctionalinformationtomorethanhalfoftheuncharacterizedproteins