Cargando…

Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting

BACKGROUND: Although pathogenic Gram-negative bacteria lack their own ubiquitination machinery, they have evolved or acquired virulence effectors that can manipulate the host ubiquitination process through structural and/or functional mimicry of host machinery. Many such effectors have been identifi...

Descripción completa

Detalles Bibliográficos
Autores principales: McDermott, Jason E., Cort, John R., Nakayasu, Ernesto S., Pruneda, Jonathan N., Overall, Christopher, Adkins, Joshua N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6557245/
https://www.ncbi.nlm.nih.gov/pubmed/31211016
http://dx.doi.org/10.7717/peerj.7055
_version_ 1783425442671230976
author McDermott, Jason E.
Cort, John R.
Nakayasu, Ernesto S.
Pruneda, Jonathan N.
Overall, Christopher
Adkins, Joshua N.
author_facet McDermott, Jason E.
Cort, John R.
Nakayasu, Ernesto S.
Pruneda, Jonathan N.
Overall, Christopher
Adkins, Joshua N.
author_sort McDermott, Jason E.
collection PubMed
description BACKGROUND: Although pathogenic Gram-negative bacteria lack their own ubiquitination machinery, they have evolved or acquired virulence effectors that can manipulate the host ubiquitination process through structural and/or functional mimicry of host machinery. Many such effectors have been identified in a wide variety of bacterial pathogens that share little sequence similarity amongst themselves or with eukaryotic ubiquitin E3 ligases. METHODS: To allow identification of novel bacterial E3 ubiquitin ligase effectors from protein sequences we have developed a machine learning approach, the SVM-based Identification and Evaluation of Virulence Effector Ubiquitin ligases (SIEVE-Ub). We extend the string kernel approach used previously to sequence classification by introducing reduced amino acid (RED) alphabet encoding for protein sequences. RESULTS: We found that 14mer peptides with amino acids represented as simply either hydrophobic or hydrophilic provided the best models for discrimination of E3 ligases from other effector proteins with a receiver-operator characteristic area under the curve (AUC) of 0.90. When considering a subset of E3 ubiquitin ligase effectors that do not fall into known sequence based families we found that the AUC was 0.82, demonstrating the effectiveness of our method at identifying novel functional family members. Feature selection was used to identify a parsimonious set of 10 RED peptides that provided good discrimination, and these peptides were found to be located in functionally important regions of the proteins involved in E2 and host target protein binding. Our general approach enables construction of models based on other effector functions. We used SIEVE-Ub to predict nine potential novel E3 ligases from a large set of bacterial genomes. SIEVE-Ub is available for download at https://doi.org/10.6084/m9.figshare.7766984.v1 or https://github.com/biodataganache/SIEVE-Ub for the most current version.
format Online
Article
Text
id pubmed-6557245
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-65572452019-06-17 Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting McDermott, Jason E. Cort, John R. Nakayasu, Ernesto S. Pruneda, Jonathan N. Overall, Christopher Adkins, Joshua N. PeerJ Bioinformatics BACKGROUND: Although pathogenic Gram-negative bacteria lack their own ubiquitination machinery, they have evolved or acquired virulence effectors that can manipulate the host ubiquitination process through structural and/or functional mimicry of host machinery. Many such effectors have been identified in a wide variety of bacterial pathogens that share little sequence similarity amongst themselves or with eukaryotic ubiquitin E3 ligases. METHODS: To allow identification of novel bacterial E3 ubiquitin ligase effectors from protein sequences we have developed a machine learning approach, the SVM-based Identification and Evaluation of Virulence Effector Ubiquitin ligases (SIEVE-Ub). We extend the string kernel approach used previously to sequence classification by introducing reduced amino acid (RED) alphabet encoding for protein sequences. RESULTS: We found that 14mer peptides with amino acids represented as simply either hydrophobic or hydrophilic provided the best models for discrimination of E3 ligases from other effector proteins with a receiver-operator characteristic area under the curve (AUC) of 0.90. When considering a subset of E3 ubiquitin ligase effectors that do not fall into known sequence based families we found that the AUC was 0.82, demonstrating the effectiveness of our method at identifying novel functional family members. Feature selection was used to identify a parsimonious set of 10 RED peptides that provided good discrimination, and these peptides were found to be located in functionally important regions of the proteins involved in E2 and host target protein binding. Our general approach enables construction of models based on other effector functions. We used SIEVE-Ub to predict nine potential novel E3 ligases from a large set of bacterial genomes. SIEVE-Ub is available for download at https://doi.org/10.6084/m9.figshare.7766984.v1 or https://github.com/biodataganache/SIEVE-Ub for the most current version. PeerJ Inc. 2019-06-07 /pmc/articles/PMC6557245/ /pubmed/31211016 http://dx.doi.org/10.7717/peerj.7055 Text en ©2019 McDermott et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
McDermott, Jason E.
Cort, John R.
Nakayasu, Ernesto S.
Pruneda, Jonathan N.
Overall, Christopher
Adkins, Joshua N.
Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting
title Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting
title_full Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting
title_fullStr Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting
title_full_unstemmed Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting
title_short Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting
title_sort prediction of bacterial e3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6557245/
https://www.ncbi.nlm.nih.gov/pubmed/31211016
http://dx.doi.org/10.7717/peerj.7055
work_keys_str_mv AT mcdermottjasone predictionofbacteriale3ubiquitinligaseeffectorsusingreducedaminoacidpeptidefingerprinting
AT cortjohnr predictionofbacteriale3ubiquitinligaseeffectorsusingreducedaminoacidpeptidefingerprinting
AT nakayasuernestos predictionofbacteriale3ubiquitinligaseeffectorsusingreducedaminoacidpeptidefingerprinting
AT prunedajonathann predictionofbacteriale3ubiquitinligaseeffectorsusingreducedaminoacidpeptidefingerprinting
AT overallchristopher predictionofbacteriale3ubiquitinligaseeffectorsusingreducedaminoacidpeptidefingerprinting
AT adkinsjoshuan predictionofbacteriale3ubiquitinligaseeffectorsusingreducedaminoacidpeptidefingerprinting