Cargando…
Learning virulent proteins from integrated query networks
BACKGROUND: Methods of weakening and attenuating pathogens’ abilities to infect and propagate in a host, thus allowing the natural immune system to more easily decimate invaders, have gained attention as alternatives to broad-spectrum targeting approaches. The following work describes a technique to...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3560104/ https://www.ncbi.nlm.nih.gov/pubmed/23198735 http://dx.doi.org/10.1186/1471-2105-13-321 |
_version_ | 1782257732629299200 |
---|---|
author | Cadag, Eithon Tarczy-Hornoch, Peter Myler, Peter J |
author_facet | Cadag, Eithon Tarczy-Hornoch, Peter Myler, Peter J |
author_sort | Cadag, Eithon |
collection | PubMed |
description | BACKGROUND: Methods of weakening and attenuating pathogens’ abilities to infect and propagate in a host, thus allowing the natural immune system to more easily decimate invaders, have gained attention as alternatives to broad-spectrum targeting approaches. The following work describes a technique to identifying proteins involved in virulence by relying on latent information computationally gathered across biological repositories, applicable to both generic and specific virulence categories. RESULTS: A lightweight method for data integration is used, which links information regarding a protein via a path-based query graph. A method of weighting is then applied to query graphs that can serve as input to various statistical classification methods for discrimination, and the combined usage of both data integration and learning methods are tested against the problem of both generalized and specific virulence function prediction. CONCLUSIONS: This approach improves coverage of functional data over a protein. Moreover, while depending largely on noisy and potentially non-curated data from public sources, we find it outperforms other techniques to identification of general virulence factors and baseline remote homology detection methods for specific virulence categories. |
format | Online Article Text |
id | pubmed-3560104 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35601042013-02-04 Learning virulent proteins from integrated query networks Cadag, Eithon Tarczy-Hornoch, Peter Myler, Peter J BMC Bioinformatics Methodology Article BACKGROUND: Methods of weakening and attenuating pathogens’ abilities to infect and propagate in a host, thus allowing the natural immune system to more easily decimate invaders, have gained attention as alternatives to broad-spectrum targeting approaches. The following work describes a technique to identifying proteins involved in virulence by relying on latent information computationally gathered across biological repositories, applicable to both generic and specific virulence categories. RESULTS: A lightweight method for data integration is used, which links information regarding a protein via a path-based query graph. A method of weighting is then applied to query graphs that can serve as input to various statistical classification methods for discrimination, and the combined usage of both data integration and learning methods are tested against the problem of both generalized and specific virulence function prediction. CONCLUSIONS: This approach improves coverage of functional data over a protein. Moreover, while depending largely on noisy and potentially non-curated data from public sources, we find it outperforms other techniques to identification of general virulence factors and baseline remote homology detection methods for specific virulence categories. BioMed Central 2012-12-02 /pmc/articles/PMC3560104/ /pubmed/23198735 http://dx.doi.org/10.1186/1471-2105-13-321 Text en Copyright ©2012 Cadag et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Cadag, Eithon Tarczy-Hornoch, Peter Myler, Peter J Learning virulent proteins from integrated query networks |
title | Learning virulent proteins from integrated query networks |
title_full | Learning virulent proteins from integrated query networks |
title_fullStr | Learning virulent proteins from integrated query networks |
title_full_unstemmed | Learning virulent proteins from integrated query networks |
title_short | Learning virulent proteins from integrated query networks |
title_sort | learning virulent proteins from integrated query networks |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3560104/ https://www.ncbi.nlm.nih.gov/pubmed/23198735 http://dx.doi.org/10.1186/1471-2105-13-321 |
work_keys_str_mv | AT cadageithon learningvirulentproteinsfromintegratedquerynetworks AT tarczyhornochpeter learningvirulentproteinsfromintegratedquerynetworks AT mylerpeterj learningvirulentproteinsfromintegratedquerynetworks |