Cargando…
Computational approach to predict species-specific type III secretion system (T3SS) effectors using single and multiple genomes
BACKGROUND: Many gram-negative bacteria use type III secretion systems (T3SSs) to translocate effector proteins into host cells. T3SS effectors can give some bacteria a competitive edge over others within the same environment and can help bacteria to invade the host cells and allow them to multiply...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5168842/ https://www.ncbi.nlm.nih.gov/pubmed/27993130 http://dx.doi.org/10.1186/s12864-016-3363-1 |
_version_ | 1782483423375392768 |
---|---|
author | Hobbs, Christopher K. Porter, Vanessa L. Stow, Maxwell L. S. Siame, Bupe A. Tsang, Herbert H. Leung, Ka Yin |
author_facet | Hobbs, Christopher K. Porter, Vanessa L. Stow, Maxwell L. S. Siame, Bupe A. Tsang, Herbert H. Leung, Ka Yin |
author_sort | Hobbs, Christopher K. |
collection | PubMed |
description | BACKGROUND: Many gram-negative bacteria use type III secretion systems (T3SSs) to translocate effector proteins into host cells. T3SS effectors can give some bacteria a competitive edge over others within the same environment and can help bacteria to invade the host cells and allow them to multiply rapidly within the host. Therefore, developing efficient methods to identify effectors scattered in bacterial genomes can lead to a better understanding of host-pathogen interactions and ultimately to important medical and biotechnological applications. RESULTS: We used 21 genomic and proteomic attributes to create a precise and reliable T3SS effector prediction method called Genome Search for Effectors Tool (GenSET). Five machine learning algorithms were trained on effectors selected from different organisms and a trained (voting) algorithm was then applied to identify other effectors present in the genome testing sets from the same (GenSET Phase 1) or different (GenSET Phase 2) organism. Although a select group of attributes that included the codon adaptation index, probability of expression in inclusion bodies, N-terminal disorder, and G + C content (filtered) were better at discriminating between positive and negative sets, algorithm performance was better when all 21 attributes (unfiltered) were used. Performance scores (sensitivity, specificity and area under the curve) from GenSET Phase 1 were better than those reported for six published methods. More importantly, GenSET Phase 1 ranked more known effectors (70.3%) in the top 40 ranked proteins and predicted 10–80% more effectors than three available programs in three of the four organisms tested. GenSET Phase 2 predicted 43.8% effectors in the top 40 ranked proteins when tested on four related or unrelated organisms. The lower prediction rates from GenSET Phase 2 may be due to the presence of different translocation signals in effectors from different T3SS families. CONCLUSIONS: The species-specific GenSET Phase 1 method offers an alternative approach to T3SS effector prediction that can be used with other published programs to improve effector predictions. Additionally, our approach can be applied to predict effectors of other secretion systems as long as these effectors have translocation signals embedded in their sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3363-1) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5168842 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-51688422016-12-23 Computational approach to predict species-specific type III secretion system (T3SS) effectors using single and multiple genomes Hobbs, Christopher K. Porter, Vanessa L. Stow, Maxwell L. S. Siame, Bupe A. Tsang, Herbert H. Leung, Ka Yin BMC Genomics Research Article BACKGROUND: Many gram-negative bacteria use type III secretion systems (T3SSs) to translocate effector proteins into host cells. T3SS effectors can give some bacteria a competitive edge over others within the same environment and can help bacteria to invade the host cells and allow them to multiply rapidly within the host. Therefore, developing efficient methods to identify effectors scattered in bacterial genomes can lead to a better understanding of host-pathogen interactions and ultimately to important medical and biotechnological applications. RESULTS: We used 21 genomic and proteomic attributes to create a precise and reliable T3SS effector prediction method called Genome Search for Effectors Tool (GenSET). Five machine learning algorithms were trained on effectors selected from different organisms and a trained (voting) algorithm was then applied to identify other effectors present in the genome testing sets from the same (GenSET Phase 1) or different (GenSET Phase 2) organism. Although a select group of attributes that included the codon adaptation index, probability of expression in inclusion bodies, N-terminal disorder, and G + C content (filtered) were better at discriminating between positive and negative sets, algorithm performance was better when all 21 attributes (unfiltered) were used. Performance scores (sensitivity, specificity and area under the curve) from GenSET Phase 1 were better than those reported for six published methods. More importantly, GenSET Phase 1 ranked more known effectors (70.3%) in the top 40 ranked proteins and predicted 10–80% more effectors than three available programs in three of the four organisms tested. GenSET Phase 2 predicted 43.8% effectors in the top 40 ranked proteins when tested on four related or unrelated organisms. The lower prediction rates from GenSET Phase 2 may be due to the presence of different translocation signals in effectors from different T3SS families. CONCLUSIONS: The species-specific GenSET Phase 1 method offers an alternative approach to T3SS effector prediction that can be used with other published programs to improve effector predictions. Additionally, our approach can be applied to predict effectors of other secretion systems as long as these effectors have translocation signals embedded in their sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3363-1) contains supplementary material, which is available to authorized users. BioMed Central 2016-12-19 /pmc/articles/PMC5168842/ /pubmed/27993130 http://dx.doi.org/10.1186/s12864-016-3363-1 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Hobbs, Christopher K. Porter, Vanessa L. Stow, Maxwell L. S. Siame, Bupe A. Tsang, Herbert H. Leung, Ka Yin Computational approach to predict species-specific type III secretion system (T3SS) effectors using single and multiple genomes |
title | Computational approach to predict species-specific type III secretion system (T3SS) effectors using single and multiple genomes |
title_full | Computational approach to predict species-specific type III secretion system (T3SS) effectors using single and multiple genomes |
title_fullStr | Computational approach to predict species-specific type III secretion system (T3SS) effectors using single and multiple genomes |
title_full_unstemmed | Computational approach to predict species-specific type III secretion system (T3SS) effectors using single and multiple genomes |
title_short | Computational approach to predict species-specific type III secretion system (T3SS) effectors using single and multiple genomes |
title_sort | computational approach to predict species-specific type iii secretion system (t3ss) effectors using single and multiple genomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5168842/ https://www.ncbi.nlm.nih.gov/pubmed/27993130 http://dx.doi.org/10.1186/s12864-016-3363-1 |
work_keys_str_mv | AT hobbschristopherk computationalapproachtopredictspeciesspecifictypeiiisecretionsystemt3sseffectorsusingsingleandmultiplegenomes AT portervanessal computationalapproachtopredictspeciesspecifictypeiiisecretionsystemt3sseffectorsusingsingleandmultiplegenomes AT stowmaxwellls computationalapproachtopredictspeciesspecifictypeiiisecretionsystemt3sseffectorsusingsingleandmultiplegenomes AT siamebupea computationalapproachtopredictspeciesspecifictypeiiisecretionsystemt3sseffectorsusingsingleandmultiplegenomes AT tsangherberth computationalapproachtopredictspeciesspecifictypeiiisecretionsystemt3sseffectorsusingsingleandmultiplegenomes AT leungkayin computationalapproachtopredictspeciesspecifictypeiiisecretionsystemt3sseffectorsusingsingleandmultiplegenomes |