Cargando…

Detailed estimation of bioinformatics prediction reliability through the Fragmented Prediction Performance Plots

BACKGROUND: An important and yet rather neglected question related to bioinformatics predictions is the estimation of the amount of data that is needed to allow reliable predictions. Bioinformatics predictions are usually validated through a series of figures of merit, like for example sensitivity a...

Descripción completa

Detalles Bibliográficos
Autor principal:	Carugo, Oliviero
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2007
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2148069/ https://www.ncbi.nlm.nih.gov/pubmed/17931407 http://dx.doi.org/10.1186/1471-2105-8-380

_version_	1782144501391818752
author	Carugo, Oliviero
author_facet	Carugo, Oliviero
author_sort	Carugo, Oliviero
collection	PubMed
description	BACKGROUND: An important and yet rather neglected question related to bioinformatics predictions is the estimation of the amount of data that is needed to allow reliable predictions. Bioinformatics predictions are usually validated through a series of figures of merit, like for example sensitivity and precision, and little attention is paid to the fact that their performance may depend on the amount of data used to make the predictions themselves. RESULTS: Here I describe a tool, named Fragmented Prediction Performance Plot (FPPP), which monitors the relationship between the prediction reliability and the amount of information underling the prediction themselves. Three examples of FPPPs are presented to illustrate their principal features. In one example, the reliability becomes independent, over a certain threshold, of the amount of data used to predict protein features and the intrinsic reliability of the predictor can be estimated. In the other two cases, on the contrary, the reliability strongly depends on the amount of data used to make the predictions and, thus, the intrinsic reliability of the two predictors cannot be determined. Only in the first example it is thus possible to fully quantify the prediction performance. CONCLUSION: It is thus highly advisable to use FPPPs to determine the performance of any new bioinformatics prediction protocol, in order to fully quantify its prediction power and to allow comparisons between two or more predictors based on different types of data.
format	Text
id	pubmed-2148069
institution	National Center for Biotechnology Information
language	English
publishDate	2007
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-21480692007-12-20 Detailed estimation of bioinformatics prediction reliability through the Fragmented Prediction Performance Plots Carugo, Oliviero BMC Bioinformatics Methodology Article BACKGROUND: An important and yet rather neglected question related to bioinformatics predictions is the estimation of the amount of data that is needed to allow reliable predictions. Bioinformatics predictions are usually validated through a series of figures of merit, like for example sensitivity and precision, and little attention is paid to the fact that their performance may depend on the amount of data used to make the predictions themselves. RESULTS: Here I describe a tool, named Fragmented Prediction Performance Plot (FPPP), which monitors the relationship between the prediction reliability and the amount of information underling the prediction themselves. Three examples of FPPPs are presented to illustrate their principal features. In one example, the reliability becomes independent, over a certain threshold, of the amount of data used to predict protein features and the intrinsic reliability of the predictor can be estimated. In the other two cases, on the contrary, the reliability strongly depends on the amount of data used to make the predictions and, thus, the intrinsic reliability of the two predictors cannot be determined. Only in the first example it is thus possible to fully quantify the prediction performance. CONCLUSION: It is thus highly advisable to use FPPPs to determine the performance of any new bioinformatics prediction protocol, in order to fully quantify its prediction power and to allow comparisons between two or more predictors based on different types of data. BioMed Central 2007-10-11 /pmc/articles/PMC2148069/ /pubmed/17931407 http://dx.doi.org/10.1186/1471-2105-8-380 Text en Copyright © 2007 Carugo; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Carugo, Oliviero Detailed estimation of bioinformatics prediction reliability through the Fragmented Prediction Performance Plots
title	Detailed estimation of bioinformatics prediction reliability through the Fragmented Prediction Performance Plots
title_full	Detailed estimation of bioinformatics prediction reliability through the Fragmented Prediction Performance Plots
title_fullStr	Detailed estimation of bioinformatics prediction reliability through the Fragmented Prediction Performance Plots
title_full_unstemmed	Detailed estimation of bioinformatics prediction reliability through the Fragmented Prediction Performance Plots
title_short	Detailed estimation of bioinformatics prediction reliability through the Fragmented Prediction Performance Plots
title_sort	detailed estimation of bioinformatics prediction reliability through the fragmented prediction performance plots
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2148069/ https://www.ncbi.nlm.nih.gov/pubmed/17931407 http://dx.doi.org/10.1186/1471-2105-8-380
work_keys_str_mv	AT carugooliviero detailedestimationofbioinformaticspredictionreliabilitythroughthefragmentedpredictionperformanceplots

Detailed estimation of bioinformatics prediction reliability through the Fragmented Prediction Performance Plots

Ejemplares similares