Cargando…

Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models

Correctly scoring protein-protein docking models to single out native-like ones is an open challenge. It is also an object of assessment in CAPRI (Critical Assessment of PRedicted Interactions), the community-wide blind docking experiment. We introduced in the field the first pure consensus method,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chermak, Edrisse, De Donato, Renato, Lensink, Marc F., Petta, Andrea, Serra, Luigi, Scarano, Vittorio, Cavallo, Luigi, Oliva, Romina
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2016
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5112798/ https://www.ncbi.nlm.nih.gov/pubmed/27846259 http://dx.doi.org/10.1371/journal.pone.0166460

_version_	1782468076029083648
author	Chermak, Edrisse De Donato, Renato Lensink, Marc F. Petta, Andrea Serra, Luigi Scarano, Vittorio Cavallo, Luigi Oliva, Romina
author_facet	Chermak, Edrisse De Donato, Renato Lensink, Marc F. Petta, Andrea Serra, Luigi Scarano, Vittorio Cavallo, Luigi Oliva, Romina
author_sort	Chermak, Edrisse
collection	PubMed
description	Correctly scoring protein-protein docking models to single out native-like ones is an open challenge. It is also an object of assessment in CAPRI (Critical Assessment of PRedicted Interactions), the community-wide blind docking experiment. We introduced in the field the first pure consensus method, CONSRANK, which ranks models based on their ability to match the most conserved contacts in the ensemble they belong to. In CAPRI, scorers are asked to evaluate a set of available models and select the top ten ones, based on their own scoring approach. Scorers’ performance is ranked based on the number of targets/interfaces for which they could provide at least one correct solution. In such terms, blind testing in CAPRI Round 30 (a joint prediction round with CASP11) has shown that critical cases for CONSRANK are represented by targets showing multiple interfaces or for which only a very small number of correct solutions are available. To address these challenging cases, CONSRANK has now been modified to include a contact-based clustering of the models as a preliminary step of the scoring process. We used an agglomerative hierarchical clustering based on the number of common inter-residue contacts within the models. Two criteria, with different thresholds, were explored in the cluster generation, setting either the number of common contacts or of total clusters. For each clustering approach, after selecting the top (most populated) ten clusters, CONSRANK was run on these clusters and the top-ranked model for each cluster was selected, in the limit of 10 models per target. We have applied our modified scoring approach, Clust-CONSRANK, to SCORE_SET, a set of CAPRI scoring models made recently available by CAPRI assessors, and to the subset of homodimeric targets in CAPRI Round 30 for which CONSRANK failed to include a correct solution within the ten selected models. Results show that, for the challenging cases, the clustering step typically enriches the ten top ranked models in native-like solutions. The best performing clustering approaches we tested indeed lead to more than double the number of cases for which at least one correct solution can be included within the top ten ranked models.
format	Online Article Text
id	pubmed-5112798
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-51127982016-12-08 Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models Chermak, Edrisse De Donato, Renato Lensink, Marc F. Petta, Andrea Serra, Luigi Scarano, Vittorio Cavallo, Luigi Oliva, Romina PLoS One Research Article Correctly scoring protein-protein docking models to single out native-like ones is an open challenge. It is also an object of assessment in CAPRI (Critical Assessment of PRedicted Interactions), the community-wide blind docking experiment. We introduced in the field the first pure consensus method, CONSRANK, which ranks models based on their ability to match the most conserved contacts in the ensemble they belong to. In CAPRI, scorers are asked to evaluate a set of available models and select the top ten ones, based on their own scoring approach. Scorers’ performance is ranked based on the number of targets/interfaces for which they could provide at least one correct solution. In such terms, blind testing in CAPRI Round 30 (a joint prediction round with CASP11) has shown that critical cases for CONSRANK are represented by targets showing multiple interfaces or for which only a very small number of correct solutions are available. To address these challenging cases, CONSRANK has now been modified to include a contact-based clustering of the models as a preliminary step of the scoring process. We used an agglomerative hierarchical clustering based on the number of common inter-residue contacts within the models. Two criteria, with different thresholds, were explored in the cluster generation, setting either the number of common contacts or of total clusters. For each clustering approach, after selecting the top (most populated) ten clusters, CONSRANK was run on these clusters and the top-ranked model for each cluster was selected, in the limit of 10 models per target. We have applied our modified scoring approach, Clust-CONSRANK, to SCORE_SET, a set of CAPRI scoring models made recently available by CAPRI assessors, and to the subset of homodimeric targets in CAPRI Round 30 for which CONSRANK failed to include a correct solution within the ten selected models. Results show that, for the challenging cases, the clustering step typically enriches the ten top ranked models in native-like solutions. The best performing clustering approaches we tested indeed lead to more than double the number of cases for which at least one correct solution can be included within the top ten ranked models. Public Library of Science 2016-11-15 /pmc/articles/PMC5112798/ /pubmed/27846259 http://dx.doi.org/10.1371/journal.pone.0166460 Text en © 2016 Chermak et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Chermak, Edrisse De Donato, Renato Lensink, Marc F. Petta, Andrea Serra, Luigi Scarano, Vittorio Cavallo, Luigi Oliva, Romina Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models
title	Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models
title_full	Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models
title_fullStr	Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models
title_full_unstemmed	Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models
title_short	Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models
title_sort	introducing a clustering step in a consensus approach for the scoring of protein-protein docking models
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5112798/ https://www.ncbi.nlm.nih.gov/pubmed/27846259 http://dx.doi.org/10.1371/journal.pone.0166460
work_keys_str_mv	AT chermakedrisse introducingaclusteringstepinaconsensusapproachforthescoringofproteinproteindockingmodels AT dedonatorenato introducingaclusteringstepinaconsensusapproachforthescoringofproteinproteindockingmodels AT lensinkmarcf introducingaclusteringstepinaconsensusapproachforthescoringofproteinproteindockingmodels AT pettaandrea introducingaclusteringstepinaconsensusapproachforthescoringofproteinproteindockingmodels AT serraluigi introducingaclusteringstepinaconsensusapproachforthescoringofproteinproteindockingmodels AT scaranovittorio introducingaclusteringstepinaconsensusapproachforthescoringofproteinproteindockingmodels AT cavalloluigi introducingaclusteringstepinaconsensusapproachforthescoringofproteinproteindockingmodels AT olivaromina introducingaclusteringstepinaconsensusapproachforthescoringofproteinproteindockingmodels

Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models

Ejemplares similares