Cargando…
FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions
The estimation of prediction quality is important because without quality measures, it is difficult to determine the usefulness of a prediction. Currently, methods for ligand binding site residue predictions are assessed in the function prediction category of the biennial Critical Assessment of Tech...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3364224/ https://www.ncbi.nlm.nih.gov/pubmed/22666491 http://dx.doi.org/10.1371/journal.pone.0038219 |
_version_ | 1782234508919046144 |
---|---|
author | Roche, Daniel B. Buenavista, Maria T. McGuffin, Liam J. |
author_facet | Roche, Daniel B. Buenavista, Maria T. McGuffin, Liam J. |
author_sort | Roche, Daniel B. |
collection | PubMed |
description | The estimation of prediction quality is important because without quality measures, it is difficult to determine the usefulness of a prediction. Currently, methods for ligand binding site residue predictions are assessed in the function prediction category of the biennial Critical Assessment of Techniques for Protein Structure Prediction (CASP) experiment, utilizing the Matthews Correlation Coefficient (MCC) and Binding-site Distance Test (BDT) metrics. However, the assessment of ligand binding site predictions using such metrics requires the availability of solved structures with bound ligands. Thus, we have developed a ligand binding site quality assessment tool, FunFOLDQA, which utilizes protein feature analysis to predict ligand binding site quality prior to the experimental solution of the protein structures and their ligand interactions. The FunFOLDQA feature scores were combined using: simple linear combinations, multiple linear regression and a neural network. The neural network produced significantly better results for correlations to both the MCC and BDT scores, according to Kendall’s τ, Spearman’s ρ and Pearson’s r correlation coefficients, when tested on both the CASP8 and CASP9 datasets. The neural network also produced the largest Area Under the Curve score (AUC) when Receiver Operator Characteristic (ROC) analysis was undertaken for the CASP8 dataset. Furthermore, the FunFOLDQA algorithm incorporating the neural network, is shown to add value to FunFOLD, when both methods are employed in combination. This results in a statistically significant improvement over all of the best server methods, the FunFOLD method (6.43%), and one of the top manual groups (FN293) tested on the CASP8 dataset. The FunFOLDQA method was also found to be competitive with the top server methods when tested on the CASP9 dataset. To the best of our knowledge, FunFOLDQA is the first attempt to develop a method that can be used to assess ligand binding site prediction quality, in the absence of experimental data. |
format | Online Article Text |
id | pubmed-3364224 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-33642242012-06-04 FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions Roche, Daniel B. Buenavista, Maria T. McGuffin, Liam J. PLoS One Research Article The estimation of prediction quality is important because without quality measures, it is difficult to determine the usefulness of a prediction. Currently, methods for ligand binding site residue predictions are assessed in the function prediction category of the biennial Critical Assessment of Techniques for Protein Structure Prediction (CASP) experiment, utilizing the Matthews Correlation Coefficient (MCC) and Binding-site Distance Test (BDT) metrics. However, the assessment of ligand binding site predictions using such metrics requires the availability of solved structures with bound ligands. Thus, we have developed a ligand binding site quality assessment tool, FunFOLDQA, which utilizes protein feature analysis to predict ligand binding site quality prior to the experimental solution of the protein structures and their ligand interactions. The FunFOLDQA feature scores were combined using: simple linear combinations, multiple linear regression and a neural network. The neural network produced significantly better results for correlations to both the MCC and BDT scores, according to Kendall’s τ, Spearman’s ρ and Pearson’s r correlation coefficients, when tested on both the CASP8 and CASP9 datasets. The neural network also produced the largest Area Under the Curve score (AUC) when Receiver Operator Characteristic (ROC) analysis was undertaken for the CASP8 dataset. Furthermore, the FunFOLDQA algorithm incorporating the neural network, is shown to add value to FunFOLD, when both methods are employed in combination. This results in a statistically significant improvement over all of the best server methods, the FunFOLD method (6.43%), and one of the top manual groups (FN293) tested on the CASP8 dataset. The FunFOLDQA method was also found to be competitive with the top server methods when tested on the CASP9 dataset. To the best of our knowledge, FunFOLDQA is the first attempt to develop a method that can be used to assess ligand binding site prediction quality, in the absence of experimental data. Public Library of Science 2012-05-30 /pmc/articles/PMC3364224/ /pubmed/22666491 http://dx.doi.org/10.1371/journal.pone.0038219 Text en Roche et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Roche, Daniel B. Buenavista, Maria T. McGuffin, Liam J. FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions |
title | FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions |
title_full | FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions |
title_fullStr | FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions |
title_full_unstemmed | FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions |
title_short | FunFOLDQA: A Quality Assessment Tool for Protein-Ligand Binding Site Residue Predictions |
title_sort | funfoldqa: a quality assessment tool for protein-ligand binding site residue predictions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3364224/ https://www.ncbi.nlm.nih.gov/pubmed/22666491 http://dx.doi.org/10.1371/journal.pone.0038219 |
work_keys_str_mv | AT rochedanielb funfoldqaaqualityassessmenttoolforproteinligandbindingsiteresiduepredictions AT buenavistamariat funfoldqaaqualityassessmenttoolforproteinligandbindingsiteresiduepredictions AT mcguffinliamj funfoldqaaqualityassessmenttoolforproteinligandbindingsiteresiduepredictions |