Cargando…

A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms

BACKGROUND: An in silico vaccine discovery pipeline for eukaryotic pathogens typically consists of several computational tools to predict protein characteristics. The aim of the in silico approach to discovering subunit vaccines is to use predicted characteristics to identify proteins which are wort...

Descripción completa

Detalles Bibliográficos
Autores principales:	Goodswen, Stephen J, Kennedy, Paul J, Ellis, John T
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2013
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3826511/ https://www.ncbi.nlm.nih.gov/pubmed/24180526 http://dx.doi.org/10.1186/1471-2105-14-315

_version_	1782290918047481856
author	Goodswen, Stephen J Kennedy, Paul J Ellis, John T
author_facet	Goodswen, Stephen J Kennedy, Paul J Ellis, John T
author_sort	Goodswen, Stephen J
collection	PubMed
description	BACKGROUND: An in silico vaccine discovery pipeline for eukaryotic pathogens typically consists of several computational tools to predict protein characteristics. The aim of the in silico approach to discovering subunit vaccines is to use predicted characteristics to identify proteins which are worthy of laboratory investigation. A major challenge is that these predictions are inherent with hidden inaccuracies and contradictions. This study focuses on how to reduce the number of false candidates using machine learning algorithms rather than relying on expensive laboratory validation. Proteins from Toxoplasma gondii, Plasmodium sp., and Caenorhabditis elegans were used as training and test datasets. RESULTS: The results show that machine learning algorithms can effectively distinguish expected true from expected false vaccine candidates (with an average sensitivity and specificity of 0.97 and 0.98 respectively), for proteins observed to induce immune responses experimentally. CONCLUSIONS: Vaccine candidates from an in silico approach can only be truly validated in a laboratory. Given any in silico output and appropriate training data, the number of false candidates allocated for validation can be dramatically reduced using a pool of machine learning algorithms. This will ultimately save time and money in the laboratory.
format	Online Article Text
id	pubmed-3826511
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-38265112013-11-18 A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms Goodswen, Stephen J Kennedy, Paul J Ellis, John T BMC Bioinformatics Methodology Article BACKGROUND: An in silico vaccine discovery pipeline for eukaryotic pathogens typically consists of several computational tools to predict protein characteristics. The aim of the in silico approach to discovering subunit vaccines is to use predicted characteristics to identify proteins which are worthy of laboratory investigation. A major challenge is that these predictions are inherent with hidden inaccuracies and contradictions. This study focuses on how to reduce the number of false candidates using machine learning algorithms rather than relying on expensive laboratory validation. Proteins from Toxoplasma gondii, Plasmodium sp., and Caenorhabditis elegans were used as training and test datasets. RESULTS: The results show that machine learning algorithms can effectively distinguish expected true from expected false vaccine candidates (with an average sensitivity and specificity of 0.97 and 0.98 respectively), for proteins observed to induce immune responses experimentally. CONCLUSIONS: Vaccine candidates from an in silico approach can only be truly validated in a laboratory. Given any in silico output and appropriate training data, the number of false candidates allocated for validation can be dramatically reduced using a pool of machine learning algorithms. This will ultimately save time and money in the laboratory. BioMed Central 2013-11-02 /pmc/articles/PMC3826511/ /pubmed/24180526 http://dx.doi.org/10.1186/1471-2105-14-315 Text en Copyright © 2013 Goodswen et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Goodswen, Stephen J Kennedy, Paul J Ellis, John T A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms
title	A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms
title_full	A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms
title_fullStr	A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms
title_full_unstemmed	A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms
title_short	A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms
title_sort	novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3826511/ https://www.ncbi.nlm.nih.gov/pubmed/24180526 http://dx.doi.org/10.1186/1471-2105-14-315
work_keys_str_mv	AT goodswenstephenj anovelstrategyforclassifyingtheoutputfromaninsilicovaccinediscoverypipelineforeukaryoticpathogensusingmachinelearningalgorithms AT kennedypaulj anovelstrategyforclassifyingtheoutputfromaninsilicovaccinediscoverypipelineforeukaryoticpathogensusingmachinelearningalgorithms AT ellisjohnt anovelstrategyforclassifyingtheoutputfromaninsilicovaccinediscoverypipelineforeukaryoticpathogensusingmachinelearningalgorithms AT goodswenstephenj novelstrategyforclassifyingtheoutputfromaninsilicovaccinediscoverypipelineforeukaryoticpathogensusingmachinelearningalgorithms AT kennedypaulj novelstrategyforclassifyingtheoutputfromaninsilicovaccinediscoverypipelineforeukaryoticpathogensusingmachinelearningalgorithms AT ellisjohnt novelstrategyforclassifyingtheoutputfromaninsilicovaccinediscoverypipelineforeukaryoticpathogensusingmachinelearningalgorithms

A novel strategy for classifying the output from an in silico vaccine discovery pipeline for eukaryotic pathogens using machine learning algorithms

Ejemplares similares