Cargando…

Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs

Evolutionary information stored in multiple sequence alignments (MSAs) has been used to identify the interaction interface of protein complexes, by measuring either co-conservation or co-mutation of amino acid residues across the interface. Recently, maximum entropy related correlated mutation measu...

Descripción completa

Detalles Bibliográficos
Autores principales: Várnai, Csilla, Burkoff, Nikolas S., Wild, David L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5293240/
https://www.ncbi.nlm.nih.gov/pubmed/28166227
http://dx.doi.org/10.1371/journal.pone.0169356
_version_ 1782505046150217728
author Várnai, Csilla
Burkoff, Nikolas S.
Wild, David L.
author_facet Várnai, Csilla
Burkoff, Nikolas S.
Wild, David L.
author_sort Várnai, Csilla
collection PubMed
description Evolutionary information stored in multiple sequence alignments (MSAs) has been used to identify the interaction interface of protein complexes, by measuring either co-conservation or co-mutation of amino acid residues across the interface. Recently, maximum entropy related correlated mutation measures (CMMs) such as direct information, decoupling direct from indirect interactions, have been developed to identify residue pairs interacting across the protein complex interface. These studies have focussed on carefully selected protein complexes with large, good-quality MSAs. In this work, we study protein complexes with a more typical MSA consisting of fewer than 400 sequences, using a set of 79 intramolecular protein complexes. Using a maximum entropy based CMM at the residue level, we develop an interface level CMM score to be used in re-ranking docking decoys. We demonstrate that our interface level CMM score compares favourably to the complementarity trace score, an evolutionary information-based score measuring co-conservation, when combined with the number of interface residues, a knowledge-based potential and the variability score of individual amino acid sites. We also demonstrate, that, since co-mutation and co-complementarity in the MSA contain orthogonal information, the best prediction performance using evolutionary information can be achieved by combining the co-mutation information of the CMM with co-conservation information of a complementarity trace score, predicting a near-native structure as the top prediction for 41% of the dataset. The method presented is not restricted to small MSAs, and will likely improve interface prediction also for complexes with large and good-quality MSAs.
format Online
Article
Text
id pubmed-5293240
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-52932402017-02-17 Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs Várnai, Csilla Burkoff, Nikolas S. Wild, David L. PLoS One Research Article Evolutionary information stored in multiple sequence alignments (MSAs) has been used to identify the interaction interface of protein complexes, by measuring either co-conservation or co-mutation of amino acid residues across the interface. Recently, maximum entropy related correlated mutation measures (CMMs) such as direct information, decoupling direct from indirect interactions, have been developed to identify residue pairs interacting across the protein complex interface. These studies have focussed on carefully selected protein complexes with large, good-quality MSAs. In this work, we study protein complexes with a more typical MSA consisting of fewer than 400 sequences, using a set of 79 intramolecular protein complexes. Using a maximum entropy based CMM at the residue level, we develop an interface level CMM score to be used in re-ranking docking decoys. We demonstrate that our interface level CMM score compares favourably to the complementarity trace score, an evolutionary information-based score measuring co-conservation, when combined with the number of interface residues, a knowledge-based potential and the variability score of individual amino acid sites. We also demonstrate, that, since co-mutation and co-complementarity in the MSA contain orthogonal information, the best prediction performance using evolutionary information can be achieved by combining the co-mutation information of the CMM with co-conservation information of a complementarity trace score, predicting a near-native structure as the top prediction for 41% of the dataset. The method presented is not restricted to small MSAs, and will likely improve interface prediction also for complexes with large and good-quality MSAs. Public Library of Science 2017-02-06 /pmc/articles/PMC5293240/ /pubmed/28166227 http://dx.doi.org/10.1371/journal.pone.0169356 Text en © 2017 Várnai et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Várnai, Csilla
Burkoff, Nikolas S.
Wild, David L.
Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs
title Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs
title_full Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs
title_fullStr Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs
title_full_unstemmed Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs
title_short Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs
title_sort improving protein-protein interaction prediction using evolutionary information from low-quality msas
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5293240/
https://www.ncbi.nlm.nih.gov/pubmed/28166227
http://dx.doi.org/10.1371/journal.pone.0169356
work_keys_str_mv AT varnaicsilla improvingproteinproteininteractionpredictionusingevolutionaryinformationfromlowqualitymsas
AT burkoffnikolass improvingproteinproteininteractionpredictionusingevolutionaryinformationfromlowqualitymsas
AT wilddavidl improvingproteinproteininteractionpredictionusingevolutionaryinformationfromlowqualitymsas