Cargando…
Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome
BACKGROUND: Mycobacterium tuberculosis (MTB) is a common bacterium causing tuberculosis and remains a major pathogen for mortality. Although the MTB genome has been extensively explored for two decades, the functions of 27% (1051/3906) of encoded proteins have yet to be determined and these proteins...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6528289/ https://www.ncbi.nlm.nih.gov/pubmed/31113361 http://dx.doi.org/10.1186/s12864-019-5746-6 |
_version_ | 1783420184580587520 |
---|---|
author | Yang, Zhiyuan Zeng, Xi Tsui, Stephen Kwok-Wing |
author_facet | Yang, Zhiyuan Zeng, Xi Tsui, Stephen Kwok-Wing |
author_sort | Yang, Zhiyuan |
collection | PubMed |
description | BACKGROUND: Mycobacterium tuberculosis (MTB) is a common bacterium causing tuberculosis and remains a major pathogen for mortality. Although the MTB genome has been extensively explored for two decades, the functions of 27% (1051/3906) of encoded proteins have yet to be determined and these proteins are annotated as hypothetical proteins. METHODS: We assigned functions to these hypothetical proteins using SSEalign, a newly designed algorithm utilizing structural information. A set of rigorous criteria was applied to these annotations in order to examine whether they were supported by each parameter. Virulence factors and potential drug targets were also screened among the annotated proteins. RESULTS: For 78% (823/1051) of the hypothetical proteins, we could identify homologs in Escherichia coli and Salmonella typhimurium by using SSEalign. Functional classification analysis indicated that 62.2% (512/823) of these annotated proteins were enzymes with catalytic activities and most of these annotations were supported by at least two other independent parameters. A relatively high proportion of transporter was identified in MTB genome, indicating the potential frequent transportation of frequent absorbing essential metabolites and excreting toxic materials in MTB. Twelve virulence factors and ten vaccine candidates were identified within these MTB hypothetical proteins, including two genes (rpoS and pspA) related to stress response to the host immune system. Furthermore, we have identified six novel drug target candidates among our annotated proteins, including Rv0817 and Rv2927c, which could be used for treating MTB infection. CONCLUSIONS: Our annotation of the MTB hypothetical proteins will probably serve as a useful dataset for future MTB studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5746-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6528289 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-65282892019-05-28 Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome Yang, Zhiyuan Zeng, Xi Tsui, Stephen Kwok-Wing BMC Genomics Research Article BACKGROUND: Mycobacterium tuberculosis (MTB) is a common bacterium causing tuberculosis and remains a major pathogen for mortality. Although the MTB genome has been extensively explored for two decades, the functions of 27% (1051/3906) of encoded proteins have yet to be determined and these proteins are annotated as hypothetical proteins. METHODS: We assigned functions to these hypothetical proteins using SSEalign, a newly designed algorithm utilizing structural information. A set of rigorous criteria was applied to these annotations in order to examine whether they were supported by each parameter. Virulence factors and potential drug targets were also screened among the annotated proteins. RESULTS: For 78% (823/1051) of the hypothetical proteins, we could identify homologs in Escherichia coli and Salmonella typhimurium by using SSEalign. Functional classification analysis indicated that 62.2% (512/823) of these annotated proteins were enzymes with catalytic activities and most of these annotations were supported by at least two other independent parameters. A relatively high proportion of transporter was identified in MTB genome, indicating the potential frequent transportation of frequent absorbing essential metabolites and excreting toxic materials in MTB. Twelve virulence factors and ten vaccine candidates were identified within these MTB hypothetical proteins, including two genes (rpoS and pspA) related to stress response to the host immune system. Furthermore, we have identified six novel drug target candidates among our annotated proteins, including Rv0817 and Rv2927c, which could be used for treating MTB infection. CONCLUSIONS: Our annotation of the MTB hypothetical proteins will probably serve as a useful dataset for future MTB studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5746-6) contains supplementary material, which is available to authorized users. BioMed Central 2019-05-21 /pmc/articles/PMC6528289/ /pubmed/31113361 http://dx.doi.org/10.1186/s12864-019-5746-6 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Yang, Zhiyuan Zeng, Xi Tsui, Stephen Kwok-Wing Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome |
title | Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome |
title_full | Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome |
title_fullStr | Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome |
title_full_unstemmed | Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome |
title_short | Investigating function roles of hypothetical proteins encoded by the Mycobacterium tuberculosis H37Rv genome |
title_sort | investigating function roles of hypothetical proteins encoded by the mycobacterium tuberculosis h37rv genome |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6528289/ https://www.ncbi.nlm.nih.gov/pubmed/31113361 http://dx.doi.org/10.1186/s12864-019-5746-6 |
work_keys_str_mv | AT yangzhiyuan investigatingfunctionrolesofhypotheticalproteinsencodedbythemycobacteriumtuberculosish37rvgenome AT zengxi investigatingfunctionrolesofhypotheticalproteinsencodedbythemycobacteriumtuberculosish37rvgenome AT tsuistephenkwokwing investigatingfunctionrolesofhypotheticalproteinsencodedbythemycobacteriumtuberculosish37rvgenome |