Cargando…

Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome

BACKGROUND: Mycobacterium tuberculosis (MTB) is a common bacterium causing tuberculosis and remains a major pathogen for mortality. Although the MTB genome has been extensively explored for two decades, the functions of 27% (1051/3906) of encoded proteins have yet to be determined and these proteins...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Zhiyuan, Zeng, Xi, Tsui, Stephen Kwok-Wing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6528289/
https://www.ncbi.nlm.nih.gov/pubmed/31113361
http://dx.doi.org/10.1186/s12864-019-5746-6
_version_ 1783420184580587520
author Yang, Zhiyuan
Zeng, Xi
Tsui, Stephen Kwok-Wing
author_facet Yang, Zhiyuan
Zeng, Xi
Tsui, Stephen Kwok-Wing
author_sort Yang, Zhiyuan
collection PubMed
description BACKGROUND: Mycobacterium tuberculosis (MTB) is a common bacterium causing tuberculosis and remains a major pathogen for mortality. Although the MTB genome has been extensively explored for two decades, the functions of 27% (1051/3906) of encoded proteins have yet to be determined and these proteins are annotated as hypothetical proteins. METHODS: We assigned functions to these hypothetical proteins using SSEalign, a newly designed algorithm utilizing structural information. A set of rigorous criteria was applied to these annotations in order to examine whether they were supported by each parameter. Virulence factors and potential drug targets were also screened among the annotated proteins. RESULTS: For 78% (823/1051) of the hypothetical proteins, we could identify homologs in Escherichia coli and Salmonella typhimurium by using SSEalign. Functional classification analysis indicated that 62.2% (512/823) of these annotated proteins were enzymes with catalytic activities and most of these annotations were supported by at least two other independent parameters. A relatively high proportion of transporter was identified in MTB genome, indicating the potential frequent transportation of frequent absorbing essential metabolites and excreting toxic materials in MTB. Twelve virulence factors and ten vaccine candidates were identified within these MTB hypothetical proteins, including two genes (rpoS and pspA) related to stress response to the host immune system. Furthermore, we have identified six novel drug target candidates among our annotated proteins, including Rv0817 and Rv2927c, which could be used for treating MTB infection. CONCLUSIONS: Our annotation of the MTB hypothetical proteins will probably serve as a useful dataset for future MTB studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5746-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6528289
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65282892019-05-28 Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome Yang, Zhiyuan Zeng, Xi Tsui, Stephen Kwok-Wing BMC Genomics Research Article BACKGROUND: Mycobacterium tuberculosis (MTB) is a common bacterium causing tuberculosis and remains a major pathogen for mortality. Although the MTB genome has been extensively explored for two decades, the functions of 27% (1051/3906) of encoded proteins have yet to be determined and these proteins are annotated as hypothetical proteins. METHODS: We assigned functions to these hypothetical proteins using SSEalign, a newly designed algorithm utilizing structural information. A set of rigorous criteria was applied to these annotations in order to examine whether they were supported by each parameter. Virulence factors and potential drug targets were also screened among the annotated proteins. RESULTS: For 78% (823/1051) of the hypothetical proteins, we could identify homologs in Escherichia coli and Salmonella typhimurium by using SSEalign. Functional classification analysis indicated that 62.2% (512/823) of these annotated proteins were enzymes with catalytic activities and most of these annotations were supported by at least two other independent parameters. A relatively high proportion of transporter was identified in MTB genome, indicating the potential frequent transportation of frequent absorbing essential metabolites and excreting toxic materials in MTB. Twelve virulence factors and ten vaccine candidates were identified within these MTB hypothetical proteins, including two genes (rpoS and pspA) related to stress response to the host immune system. Furthermore, we have identified six novel drug target candidates among our annotated proteins, including Rv0817 and Rv2927c, which could be used for treating MTB infection. CONCLUSIONS: Our annotation of the MTB hypothetical proteins will probably serve as a useful dataset for future MTB studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5746-6) contains supplementary material, which is available to authorized users. BioMed Central 2019-05-21 /pmc/articles/PMC6528289/ /pubmed/31113361 http://dx.doi.org/10.1186/s12864-019-5746-6 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Yang, Zhiyuan
Zeng, Xi
Tsui, Stephen Kwok-Wing
Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome
title Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome
title_full Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome
title_fullStr Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome
title_full_unstemmed Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome
title_short Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome
title_sort investigating function roles of hypothetical proteins encoded by the mycobacterium tuberculosis h37rv genome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6528289/
https://www.ncbi.nlm.nih.gov/pubmed/31113361
http://dx.doi.org/10.1186/s12864-019-5746-6
work_keys_str_mv AT yangzhiyuan investigatingfunctionrolesofhypotheticalproteinsencodedbythemycobacteriumtuberculosish37rvgenome
AT zengxi investigatingfunctionrolesofhypotheticalproteinsencodedbythemycobacteriumtuberculosish37rvgenome
AT tsuistephenkwokwing investigatingfunctionrolesofhypotheticalproteinsencodedbythemycobacteriumtuberculosish37rvgenome