Cargando…

In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment

BACKGROUND: Many Automatic Function Prediction (AFP) methods were developed to cope with an increasing growth of the number of gene sequences that are available from high throughput sequencing experiments. To support the development of AFP methods, it is essential to have community wide experiments...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chitale, Meghana, Khan, Ishita K, Kihara, Daisuke
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2013
Materias:	Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3584938/ https://www.ncbi.nlm.nih.gov/pubmed/23514353 http://dx.doi.org/10.1186/1471-2105-14-S3-S2

_version_	1782261081967689728
author	Chitale, Meghana Khan, Ishita K Kihara, Daisuke
author_facet	Chitale, Meghana Khan, Ishita K Kihara, Daisuke
author_sort	Chitale, Meghana
collection	PubMed
description	BACKGROUND: Many Automatic Function Prediction (AFP) methods were developed to cope with an increasing growth of the number of gene sequences that are available from high throughput sequencing experiments. To support the development of AFP methods, it is essential to have community wide experiments for evaluating performance of existing AFP methods. Critical Assessment of Function Annotation (CAFA) is one such community experiment. The meeting of CAFA was held as a Special Interest Group (SIG) meeting at the Intelligent Systems in Molecular Biology (ISMB) conference in 2011. Here, we perform a detailed analysis of two sequence-based function prediction methods, PFP and ESG, which were developed in our lab, using the predictions submitted to CAFA. RESULTS: We evaluate PFP and ESG using four different measures in comparison with BLAST, Prior, and GOtcha. In addition to the predictions submitted to CAFA, we further investigate performance of a different scoring function to rank order predictions by PFP as well as PFP/ESG predictions enriched with Priors that simply adds frequently occurring Gene Ontology terms as a part of predictions. Prediction accuracies of each method were also evaluated separately for different functional categories. Successful and unsuccessful predictions by PFP and ESG are also discussed in comparison with BLAST. CONCLUSION: The in-depth analysis discussed here will complement the overall assessment by the CAFA organizers. Since PFP and ESG are based on sequence database search results, our analyses are not only useful for PFP and ESG users but will also shed light on the relationship of the sequence similarity space and functions that can be inferred from the sequences.
format	Online Article Text
id	pubmed-3584938
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-35849382013-03-11 In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment Chitale, Meghana Khan, Ishita K Kihara, Daisuke BMC Bioinformatics Proceedings BACKGROUND: Many Automatic Function Prediction (AFP) methods were developed to cope with an increasing growth of the number of gene sequences that are available from high throughput sequencing experiments. To support the development of AFP methods, it is essential to have community wide experiments for evaluating performance of existing AFP methods. Critical Assessment of Function Annotation (CAFA) is one such community experiment. The meeting of CAFA was held as a Special Interest Group (SIG) meeting at the Intelligent Systems in Molecular Biology (ISMB) conference in 2011. Here, we perform a detailed analysis of two sequence-based function prediction methods, PFP and ESG, which were developed in our lab, using the predictions submitted to CAFA. RESULTS: We evaluate PFP and ESG using four different measures in comparison with BLAST, Prior, and GOtcha. In addition to the predictions submitted to CAFA, we further investigate performance of a different scoring function to rank order predictions by PFP as well as PFP/ESG predictions enriched with Priors that simply adds frequently occurring Gene Ontology terms as a part of predictions. Prediction accuracies of each method were also evaluated separately for different functional categories. Successful and unsuccessful predictions by PFP and ESG are also discussed in comparison with BLAST. CONCLUSION: The in-depth analysis discussed here will complement the overall assessment by the CAFA organizers. Since PFP and ESG are based on sequence database search results, our analyses are not only useful for PFP and ESG users but will also shed light on the relationship of the sequence similarity space and functions that can be inferred from the sequences. BioMed Central 2013-02-28 /pmc/articles/PMC3584938/ /pubmed/23514353 http://dx.doi.org/10.1186/1471-2105-14-S3-S2 Text en Copyright ©2013 Chitale et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Proceedings Chitale, Meghana Khan, Ishita K Kihara, Daisuke In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment
title	In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment
title_full	In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment
title_fullStr	In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment
title_full_unstemmed	In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment
title_short	In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment
title_sort	in-depth performance evaluation of pfp and esg sequence-based function prediction methods in cafa 2011 experiment
topic	Proceedings
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3584938/ https://www.ncbi.nlm.nih.gov/pubmed/23514353 http://dx.doi.org/10.1186/1471-2105-14-S3-S2
work_keys_str_mv	AT chitalemeghana indepthperformanceevaluationofpfpandesgsequencebasedfunctionpredictionmethodsincafa2011experiment AT khanishitak indepthperformanceevaluationofpfpandesgsequencebasedfunctionpredictionmethodsincafa2011experiment AT kiharadaisuke indepthperformanceevaluationofpfpandesgsequencebasedfunctionpredictionmethodsincafa2011experiment

In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment

Ejemplares similares