Cargando…

FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions

The estimation of prediction quality is important because without quality measures, it is difficult to determine the usefulness of a prediction. Currently, methods for ligand binding site residue predictions are assessed in the function prediction category of the biennial Critical Assessment of Tech...

Descripción completa

Detalles Bibliográficos
Autores principales: Roche, Daniel B., Buenavista, Maria T., McGuffin, Liam J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3364224/
https://www.ncbi.nlm.nih.gov/pubmed/22666491
http://dx.doi.org/10.1371/journal.pone.0038219
_version_ 1782234508919046144
author Roche, Daniel B.
Buenavista, Maria T.
McGuffin, Liam J.
author_facet Roche, Daniel B.
Buenavista, Maria T.
McGuffin, Liam J.
author_sort Roche, Daniel B.
collection PubMed
description The estimation of prediction quality is important because without quality measures, it is difficult to determine the usefulness of a prediction. Currently, methods for ligand binding site residue predictions are assessed in the function prediction category of the biennial Critical Assessment of Techniques for Protein Structure Prediction (CASP) experiment, utilizing the Matthews Correlation Coefficient (MCC) and Binding-site Distance Test (BDT) metrics. However, the assessment of ligand binding site predictions using such metrics requires the availability of solved structures with bound ligands. Thus, we have developed a ligand binding site quality assessment tool, FunFOLDQA, which utilizes protein feature analysis to predict ligand binding site quality prior to the experimental solution of the protein structures and their ligand interactions. The FunFOLDQA feature scores were combined using: simple linear combinations, multiple linear regression and a neural network. The neural network produced significantly better results for correlations to both the MCC and BDT scores, according to Kendall’s τ, Spearman’s ρ and Pearson’s r correlation coefficients, when tested on both the CASP8 and CASP9 datasets. The neural network also produced the largest Area Under the Curve score (AUC) when Receiver Operator Characteristic (ROC) analysis was undertaken for the CASP8 dataset. Furthermore, the FunFOLDQA algorithm incorporating the neural network, is shown to add value to FunFOLD, when both methods are employed in combination. This results in a statistically significant improvement over all of the best server methods, the FunFOLD method (6.43%), and one of the top manual groups (FN293) tested on the CASP8 dataset. The FunFOLDQA method was also found to be competitive with the top server methods when tested on the CASP9 dataset. To the best of our knowledge, FunFOLDQA is the first attempt to develop a method that can be used to assess ligand binding site prediction quality, in the absence of experimental data.
format Online
Article
Text
id pubmed-3364224
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33642242012-06-04 FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions Roche, Daniel B. Buenavista, Maria T. McGuffin, Liam J. PLoS One Research Article The estimation of prediction quality is important because without quality measures, it is difficult to determine the usefulness of a prediction. Currently, methods for ligand binding site residue predictions are assessed in the function prediction category of the biennial Critical Assessment of Techniques for Protein Structure Prediction (CASP) experiment, utilizing the Matthews Correlation Coefficient (MCC) and Binding-site Distance Test (BDT) metrics. However, the assessment of ligand binding site predictions using such metrics requires the availability of solved structures with bound ligands. Thus, we have developed a ligand binding site quality assessment tool, FunFOLDQA, which utilizes protein feature analysis to predict ligand binding site quality prior to the experimental solution of the protein structures and their ligand interactions. The FunFOLDQA feature scores were combined using: simple linear combinations, multiple linear regression and a neural network. The neural network produced significantly better results for correlations to both the MCC and BDT scores, according to Kendall’s τ, Spearman’s ρ and Pearson’s r correlation coefficients, when tested on both the CASP8 and CASP9 datasets. The neural network also produced the largest Area Under the Curve score (AUC) when Receiver Operator Characteristic (ROC) analysis was undertaken for the CASP8 dataset. Furthermore, the FunFOLDQA algorithm incorporating the neural network, is shown to add value to FunFOLD, when both methods are employed in combination. This results in a statistically significant improvement over all of the best server methods, the FunFOLD method (6.43%), and one of the top manual groups (FN293) tested on the CASP8 dataset. The FunFOLDQA method was also found to be competitive with the top server methods when tested on the CASP9 dataset. To the best of our knowledge, FunFOLDQA is the first attempt to develop a method that can be used to assess ligand binding site prediction quality, in the absence of experimental data. Public Library of Science 2012-05-30 /pmc/articles/PMC3364224/ /pubmed/22666491 http://dx.doi.org/10.1371/journal.pone.0038219 Text en Roche et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Roche, Daniel B.
Buenavista, Maria T.
McGuffin, Liam J.
FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions
title FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions
title_full FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions
title_fullStr FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions
title_full_unstemmed FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions
title_short FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions
title_sort funfoldqa: a quality assessment tool for protein-ligand binding site residue predictions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3364224/
https://www.ncbi.nlm.nih.gov/pubmed/22666491
http://dx.doi.org/10.1371/journal.pone.0038219
work_keys_str_mv AT rochedanielb funfoldqaaqualityassessmenttoolforproteinligandbindingsiteresiduepredictions
AT buenavistamariat funfoldqaaqualityassessmenttoolforproteinligandbindingsiteresiduepredictions
AT mcguffinliamj funfoldqaaqualityassessmenttoolforproteinligandbindingsiteresiduepredictions