Cargando…

Learning virulent proteins from integrated query networks

BACKGROUND: Methods of weakening and attenuating pathogens’ abilities to infect and propagate in a host, thus allowing the natural immune system to more easily decimate invaders, have gained attention as alternatives to broad-spectrum targeting approaches. The following work describes a technique to...

Descripción completa

Detalles Bibliográficos
Autores principales: Cadag, Eithon, Tarczy-Hornoch, Peter, Myler, Peter J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3560104/
https://www.ncbi.nlm.nih.gov/pubmed/23198735
http://dx.doi.org/10.1186/1471-2105-13-321
_version_ 1782257732629299200
author Cadag, Eithon
Tarczy-Hornoch, Peter
Myler, Peter J
author_facet Cadag, Eithon
Tarczy-Hornoch, Peter
Myler, Peter J
author_sort Cadag, Eithon
collection PubMed
description BACKGROUND: Methods of weakening and attenuating pathogens’ abilities to infect and propagate in a host, thus allowing the natural immune system to more easily decimate invaders, have gained attention as alternatives to broad-spectrum targeting approaches. The following work describes a technique to identifying proteins involved in virulence by relying on latent information computationally gathered across biological repositories, applicable to both generic and specific virulence categories. RESULTS: A lightweight method for data integration is used, which links information regarding a protein via a path-based query graph. A method of weighting is then applied to query graphs that can serve as input to various statistical classification methods for discrimination, and the combined usage of both data integration and learning methods are tested against the problem of both generalized and specific virulence function prediction. CONCLUSIONS: This approach improves coverage of functional data over a protein. Moreover, while depending largely on noisy and potentially non-curated data from public sources, we find it outperforms other techniques to identification of general virulence factors and baseline remote homology detection methods for specific virulence categories.
format Online
Article
Text
id pubmed-3560104
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35601042013-02-04 Learning virulent proteins from integrated query networks Cadag, Eithon Tarczy-Hornoch, Peter Myler, Peter J BMC Bioinformatics Methodology Article BACKGROUND: Methods of weakening and attenuating pathogens’ abilities to infect and propagate in a host, thus allowing the natural immune system to more easily decimate invaders, have gained attention as alternatives to broad-spectrum targeting approaches. The following work describes a technique to identifying proteins involved in virulence by relying on latent information computationally gathered across biological repositories, applicable to both generic and specific virulence categories. RESULTS: A lightweight method for data integration is used, which links information regarding a protein via a path-based query graph. A method of weighting is then applied to query graphs that can serve as input to various statistical classification methods for discrimination, and the combined usage of both data integration and learning methods are tested against the problem of both generalized and specific virulence function prediction. CONCLUSIONS: This approach improves coverage of functional data over a protein. Moreover, while depending largely on noisy and potentially non-curated data from public sources, we find it outperforms other techniques to identification of general virulence factors and baseline remote homology detection methods for specific virulence categories. BioMed Central 2012-12-02 /pmc/articles/PMC3560104/ /pubmed/23198735 http://dx.doi.org/10.1186/1471-2105-13-321 Text en Copyright ©2012 Cadag et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Cadag, Eithon
Tarczy-Hornoch, Peter
Myler, Peter J
Learning virulent proteins from integrated query networks
title Learning virulent proteins from integrated query networks
title_full Learning virulent proteins from integrated query networks
title_fullStr Learning virulent proteins from integrated query networks
title_full_unstemmed Learning virulent proteins from integrated query networks
title_short Learning virulent proteins from integrated query networks
title_sort learning virulent proteins from integrated query networks
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3560104/
https://www.ncbi.nlm.nih.gov/pubmed/23198735
http://dx.doi.org/10.1186/1471-2105-13-321
work_keys_str_mv AT cadageithon learningvirulentproteinsfromintegratedquerynetworks
AT tarczyhornochpeter learningvirulentproteinsfromintegratedquerynetworks
AT mylerpeterj learningvirulentproteinsfromintegratedquerynetworks