Cargando…

Structure-Aware Mycobacterium tuberculosis Functional Annotation Uncloaks Resistance, Metabolic, and Virulence Genes

Accurate and timely functional genome annotation is essential for translating basic pathogen research into clinically impactful advances. Here, through literature curation and structure-function inference, we systematically update the functional genome annotation of Mycobacterium tuberculosis virule...

Descripción completa

Detalles Bibliográficos
Autores principales: Modlin, Samuel J., Elghraoui, Afif, Gunasekaran, Deepika, Zlotnicki, Alyssa M., Dillon, Nicholas A., Dhillon, Nermeeta, Kuo, Norman, Robinhold, Cassidy, Chan, Carmela K., Baughn, Anthony D., Valafar, Faramarz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8562490/
https://www.ncbi.nlm.nih.gov/pubmed/34726489
http://dx.doi.org/10.1128/mSystems.00673-21
_version_ 1784593270387507200
author Modlin, Samuel J.
Elghraoui, Afif
Gunasekaran, Deepika
Zlotnicki, Alyssa M.
Dillon, Nicholas A.
Dhillon, Nermeeta
Kuo, Norman
Robinhold, Cassidy
Chan, Carmela K.
Baughn, Anthony D.
Valafar, Faramarz
author_facet Modlin, Samuel J.
Elghraoui, Afif
Gunasekaran, Deepika
Zlotnicki, Alyssa M.
Dillon, Nicholas A.
Dhillon, Nermeeta
Kuo, Norman
Robinhold, Cassidy
Chan, Carmela K.
Baughn, Anthony D.
Valafar, Faramarz
author_sort Modlin, Samuel J.
collection PubMed
description Accurate and timely functional genome annotation is essential for translating basic pathogen research into clinically impactful advances. Here, through literature curation and structure-function inference, we systematically update the functional genome annotation of Mycobacterium tuberculosis virulent type strain H37Rv. First, we systematically curated annotations for 589 genes from 662 publications, including 282 gene products absent from leading databases. Second, we modeled 1,711 underannotated proteins and developed a semiautomated pipeline that captured shared function between 400 protein models and structural matches of known function on Protein Data Bank, including drug efflux proteins, metabolic enzymes, and virulence factors. In aggregate, these structure- and literature-derived annotations update 940/1,725 underannotated H37Rv genes and generate hundreds of functional hypotheses. Retrospectively applying the annotation to a recent whole-genome transposon mutant screen provided missing function for 48% (13/27) of underannotated genes altering antibiotic efficacy and 33% (23/69) required for persistence during mouse tuberculosis (TB) infection. Prospective application of the protein models enabled us to functionally interpret novel laboratory generated pyrazinamide (PZA)-resistant mutants of unknown function, which implicated the emerging coenzyme A depletion model of PZA action in the mutants’ PZA resistance. Our findings demonstrate the functional insight gained by integrating structural modeling and systematic literature curation, even for widely studied microorganisms. Functional annotations and protein structure models are available at https://tuberculosis.sdsu.edu/H37Rv in human- and machine-readable formats. IMPORTANCE Mycobacterium tuberculosis, the primary causative agent of tuberculosis, kills more humans than any other infectious bacterium. Yet 40% of its genome is functionally uncharacterized, leaving much about the genetic basis of its resistance to antibiotics, capacity to withstand host immunity, and basic metabolism yet undiscovered. Irregular literature curation for functional annotation contributes to this gap. We systematically curated functions from literature and structural similarity for over half of poorly characterized genes, expanding the functionally annotated Mycobacterium tuberculosis proteome. Applying this updated annotation to recent in vivo functional screens added functional information to dozens of clinically pertinent proteins described as having unknown function. Integrating the annotations with a prospective functional screen identified new mutants resistant to a first-line TB drug, supporting an emerging hypothesis for its mode of action. These improvements in functional interpretation of clinically informative studies underscore the translational value of this functional knowledge. Structure-derived annotations identify hundreds of high-confidence candidates for mechanisms of antibiotic resistance, virulence factors, and basic metabolism and other functions key in clinical and basic tuberculosis research. More broadly, they provide a systematic framework for improving prokaryotic reference annotations.
format Online
Article
Text
id pubmed-8562490
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-85624902021-11-04 Structure-Aware Mycobacterium tuberculosis Functional Annotation Uncloaks Resistance, Metabolic, and Virulence Genes Modlin, Samuel J. Elghraoui, Afif Gunasekaran, Deepika Zlotnicki, Alyssa M. Dillon, Nicholas A. Dhillon, Nermeeta Kuo, Norman Robinhold, Cassidy Chan, Carmela K. Baughn, Anthony D. Valafar, Faramarz mSystems Research Article Accurate and timely functional genome annotation is essential for translating basic pathogen research into clinically impactful advances. Here, through literature curation and structure-function inference, we systematically update the functional genome annotation of Mycobacterium tuberculosis virulent type strain H37Rv. First, we systematically curated annotations for 589 genes from 662 publications, including 282 gene products absent from leading databases. Second, we modeled 1,711 underannotated proteins and developed a semiautomated pipeline that captured shared function between 400 protein models and structural matches of known function on Protein Data Bank, including drug efflux proteins, metabolic enzymes, and virulence factors. In aggregate, these structure- and literature-derived annotations update 940/1,725 underannotated H37Rv genes and generate hundreds of functional hypotheses. Retrospectively applying the annotation to a recent whole-genome transposon mutant screen provided missing function for 48% (13/27) of underannotated genes altering antibiotic efficacy and 33% (23/69) required for persistence during mouse tuberculosis (TB) infection. Prospective application of the protein models enabled us to functionally interpret novel laboratory generated pyrazinamide (PZA)-resistant mutants of unknown function, which implicated the emerging coenzyme A depletion model of PZA action in the mutants’ PZA resistance. Our findings demonstrate the functional insight gained by integrating structural modeling and systematic literature curation, even for widely studied microorganisms. Functional annotations and protein structure models are available at https://tuberculosis.sdsu.edu/H37Rv in human- and machine-readable formats. IMPORTANCE Mycobacterium tuberculosis, the primary causative agent of tuberculosis, kills more humans than any other infectious bacterium. Yet 40% of its genome is functionally uncharacterized, leaving much about the genetic basis of its resistance to antibiotics, capacity to withstand host immunity, and basic metabolism yet undiscovered. Irregular literature curation for functional annotation contributes to this gap. We systematically curated functions from literature and structural similarity for over half of poorly characterized genes, expanding the functionally annotated Mycobacterium tuberculosis proteome. Applying this updated annotation to recent in vivo functional screens added functional information to dozens of clinically pertinent proteins described as having unknown function. Integrating the annotations with a prospective functional screen identified new mutants resistant to a first-line TB drug, supporting an emerging hypothesis for its mode of action. These improvements in functional interpretation of clinically informative studies underscore the translational value of this functional knowledge. Structure-derived annotations identify hundreds of high-confidence candidates for mechanisms of antibiotic resistance, virulence factors, and basic metabolism and other functions key in clinical and basic tuberculosis research. More broadly, they provide a systematic framework for improving prokaryotic reference annotations. American Society for Microbiology 2021-11-02 /pmc/articles/PMC8562490/ /pubmed/34726489 http://dx.doi.org/10.1128/mSystems.00673-21 Text en Copyright © 2021 Modlin et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Modlin, Samuel J.
Elghraoui, Afif
Gunasekaran, Deepika
Zlotnicki, Alyssa M.
Dillon, Nicholas A.
Dhillon, Nermeeta
Kuo, Norman
Robinhold, Cassidy
Chan, Carmela K.
Baughn, Anthony D.
Valafar, Faramarz
Structure-Aware Mycobacterium tuberculosis Functional Annotation Uncloaks Resistance, Metabolic, and Virulence Genes
title Structure-Aware Mycobacterium tuberculosis Functional Annotation Uncloaks Resistance, Metabolic, and Virulence Genes
title_full Structure-Aware Mycobacterium tuberculosis Functional Annotation Uncloaks Resistance, Metabolic, and Virulence Genes
title_fullStr Structure-Aware Mycobacterium tuberculosis Functional Annotation Uncloaks Resistance, Metabolic, and Virulence Genes
title_full_unstemmed Structure-Aware Mycobacterium tuberculosis Functional Annotation Uncloaks Resistance, Metabolic, and Virulence Genes
title_short Structure-Aware Mycobacterium tuberculosis Functional Annotation Uncloaks Resistance, Metabolic, and Virulence Genes
title_sort structure-aware mycobacterium tuberculosis functional annotation uncloaks resistance, metabolic, and virulence genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8562490/
https://www.ncbi.nlm.nih.gov/pubmed/34726489
http://dx.doi.org/10.1128/mSystems.00673-21
work_keys_str_mv AT modlinsamuelj structureawaremycobacteriumtuberculosisfunctionalannotationuncloaksresistancemetabolicandvirulencegenes
AT elghraouiafif structureawaremycobacteriumtuberculosisfunctionalannotationuncloaksresistancemetabolicandvirulencegenes
AT gunasekarandeepika structureawaremycobacteriumtuberculosisfunctionalannotationuncloaksresistancemetabolicandvirulencegenes
AT zlotnickialyssam structureawaremycobacteriumtuberculosisfunctionalannotationuncloaksresistancemetabolicandvirulencegenes
AT dillonnicholasa structureawaremycobacteriumtuberculosisfunctionalannotationuncloaksresistancemetabolicandvirulencegenes
AT dhillonnermeeta structureawaremycobacteriumtuberculosisfunctionalannotationuncloaksresistancemetabolicandvirulencegenes
AT kuonorman structureawaremycobacteriumtuberculosisfunctionalannotationuncloaksresistancemetabolicandvirulencegenes
AT robinholdcassidy structureawaremycobacteriumtuberculosisfunctionalannotationuncloaksresistancemetabolicandvirulencegenes
AT chancarmelak structureawaremycobacteriumtuberculosisfunctionalannotationuncloaksresistancemetabolicandvirulencegenes
AT baughnanthonyd structureawaremycobacteriumtuberculosisfunctionalannotationuncloaksresistancemetabolicandvirulencegenes
AT valafarfaramarz structureawaremycobacteriumtuberculosisfunctionalannotationuncloaksresistancemetabolicandvirulencegenes