Cargando…

Joint Evolutionary Trees: A Large-Scale Method To Predict Protein Interfaces Based on Sequence Sampling

The Joint Evolutionary Trees (JET) method detects protein interfaces, the core residues involved in the folding process, and residues susceptible to site-directed mutagenesis and relevant to molecular recognition. The approach, based on the Evolutionary Trace (ET) method, introduces a novel way to t...

Descripción completa

Detalles Bibliográficos
Autores principales:	Engelen, Stefan, Trojan, Ladislas A., Sacquin-Mora, Sophie, Lavery, Richard, Carbone, Alessandra
Formato:	Texto
Lenguaje:	English
Publicado:	Public Library of Science 2009
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2613531/ https://www.ncbi.nlm.nih.gov/pubmed/19165315 http://dx.doi.org/10.1371/journal.pcbi.1000267

_version_	1782163191464198144
author	Engelen, Stefan Trojan, Ladislas A. Sacquin-Mora, Sophie Lavery, Richard Carbone, Alessandra
author_facet	Engelen, Stefan Trojan, Ladislas A. Sacquin-Mora, Sophie Lavery, Richard Carbone, Alessandra
author_sort	Engelen, Stefan
collection	PubMed
description	The Joint Evolutionary Trees (JET) method detects protein interfaces, the core residues involved in the folding process, and residues susceptible to site-directed mutagenesis and relevant to molecular recognition. The approach, based on the Evolutionary Trace (ET) method, introduces a novel way to treat evolutionary information. Families of homologous sequences are analyzed through a Gibbs-like sampling of distance trees to reduce effects of erroneous multiple alignment and impacts of weakly homologous sequences on distance tree construction. The sampling method makes sequence analysis more sensitive to functional and structural importance of individual residues by avoiding effects of the overrepresentation of highly homologous sequences and improves computational efficiency. A carefully designed clustering method is parametrized on the target structure to detect and extend patches on protein surfaces into predicted interaction sites. Clustering takes into account residues' physical-chemical properties as well as conservation. Large-scale application of JET requires the system to be adjustable for different datasets and to guarantee predictions even if the signal is low. Flexibility was achieved by a careful treatment of the number of retrieved sequences, the amino acid distance between sequences, and the selective thresholds for cluster identification. An iterative version of JET (iJET) that guarantees finding the most likely interface residues is proposed as the appropriate tool for large-scale predictions. Tests are carried out on the Huang database of 62 heterodimer, homodimer, and transient complexes and on 265 interfaces belonging to signal transduction proteins, enzymes, inhibitors, antibodies, antigens, and others. A specific set of proteins chosen for their special functional and structural properties illustrate JET behavior on a large variety of interactions covering proteins, ligands, DNA, and RNA. JET is compared at a large scale to ET and to Consurf, Rate4Site, siteFiNDER\|3D, and SCORECONS on specific structures. A significant improvement in performance and computational efficiency is shown.
format	Text
id	pubmed-2613531
institution	National Center for Biotechnology Information
language	English
publishDate	2009
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-26135312009-01-23 Joint Evolutionary Trees: A Large-Scale Method To Predict Protein Interfaces Based on Sequence Sampling Engelen, Stefan Trojan, Ladislas A. Sacquin-Mora, Sophie Lavery, Richard Carbone, Alessandra PLoS Comput Biol Research Article The Joint Evolutionary Trees (JET) method detects protein interfaces, the core residues involved in the folding process, and residues susceptible to site-directed mutagenesis and relevant to molecular recognition. The approach, based on the Evolutionary Trace (ET) method, introduces a novel way to treat evolutionary information. Families of homologous sequences are analyzed through a Gibbs-like sampling of distance trees to reduce effects of erroneous multiple alignment and impacts of weakly homologous sequences on distance tree construction. The sampling method makes sequence analysis more sensitive to functional and structural importance of individual residues by avoiding effects of the overrepresentation of highly homologous sequences and improves computational efficiency. A carefully designed clustering method is parametrized on the target structure to detect and extend patches on protein surfaces into predicted interaction sites. Clustering takes into account residues' physical-chemical properties as well as conservation. Large-scale application of JET requires the system to be adjustable for different datasets and to guarantee predictions even if the signal is low. Flexibility was achieved by a careful treatment of the number of retrieved sequences, the amino acid distance between sequences, and the selective thresholds for cluster identification. An iterative version of JET (iJET) that guarantees finding the most likely interface residues is proposed as the appropriate tool for large-scale predictions. Tests are carried out on the Huang database of 62 heterodimer, homodimer, and transient complexes and on 265 interfaces belonging to signal transduction proteins, enzymes, inhibitors, antibodies, antigens, and others. A specific set of proteins chosen for their special functional and structural properties illustrate JET behavior on a large variety of interactions covering proteins, ligands, DNA, and RNA. JET is compared at a large scale to ET and to Consurf, Rate4Site, siteFiNDER\|3D, and SCORECONS on specific structures. A significant improvement in performance and computational efficiency is shown. Public Library of Science 2009-01-23 /pmc/articles/PMC2613531/ /pubmed/19165315 http://dx.doi.org/10.1371/journal.pcbi.1000267 Text en Engelen et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Engelen, Stefan Trojan, Ladislas A. Sacquin-Mora, Sophie Lavery, Richard Carbone, Alessandra Joint Evolutionary Trees: A Large-Scale Method To Predict Protein Interfaces Based on Sequence Sampling
title	Joint Evolutionary Trees: A Large-Scale Method To Predict Protein Interfaces Based on Sequence Sampling
title_full	Joint Evolutionary Trees: A Large-Scale Method To Predict Protein Interfaces Based on Sequence Sampling
title_fullStr	Joint Evolutionary Trees: A Large-Scale Method To Predict Protein Interfaces Based on Sequence Sampling
title_full_unstemmed	Joint Evolutionary Trees: A Large-Scale Method To Predict Protein Interfaces Based on Sequence Sampling
title_short	Joint Evolutionary Trees: A Large-Scale Method To Predict Protein Interfaces Based on Sequence Sampling
title_sort	joint evolutionary trees: a large-scale method to predict protein interfaces based on sequence sampling
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2613531/ https://www.ncbi.nlm.nih.gov/pubmed/19165315 http://dx.doi.org/10.1371/journal.pcbi.1000267
work_keys_str_mv	AT engelenstefan jointevolutionarytreesalargescalemethodtopredictproteininterfacesbasedonsequencesampling AT trojanladislasa jointevolutionarytreesalargescalemethodtopredictproteininterfacesbasedonsequencesampling AT sacquinmorasophie jointevolutionarytreesalargescalemethodtopredictproteininterfacesbasedonsequencesampling AT laveryrichard jointevolutionarytreesalargescalemethodtopredictproteininterfacesbasedonsequencesampling AT carbonealessandra jointevolutionarytreesalargescalemethodtopredictproteininterfacesbasedonsequencesampling

Joint Evolutionary Trees: A Large-Scale Method To Predict Protein Interfaces Based on Sequence Sampling

Ejemplares similares