Cargando…

Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences

Increasing numbers of protein interactions have been identified in high-throughput experiments, but only a small proportion have solved structures. Recently, sequence coevolution-based approaches have led to a breakthrough in predicting monomer protein structures and protein interaction interfaces....

Descripción completa

Detalles Bibliográficos
Autores principales: Green, Anna G., Elhabashy, Hadeer, Brock, Kelly P., Maddamsetti, Rohan, Kohlbacher, Oliver, Marks, Debora S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7925567/
https://www.ncbi.nlm.nih.gov/pubmed/33654096
http://dx.doi.org/10.1038/s41467-021-21636-z
_version_ 1783659296633913344
author Green, Anna G.
Elhabashy, Hadeer
Brock, Kelly P.
Maddamsetti, Rohan
Kohlbacher, Oliver
Marks, Debora S.
author_facet Green, Anna G.
Elhabashy, Hadeer
Brock, Kelly P.
Maddamsetti, Rohan
Kohlbacher, Oliver
Marks, Debora S.
author_sort Green, Anna G.
collection PubMed
description Increasing numbers of protein interactions have been identified in high-throughput experiments, but only a small proportion have solved structures. Recently, sequence coevolution-based approaches have led to a breakthrough in predicting monomer protein structures and protein interaction interfaces. Here, we address the challenges of large-scale interaction prediction at residue resolution with a fast alignment concatenation method and a probabilistic score for the interaction of residues. Importantly, this method (EVcomplex2) is able to assess the likelihood of a protein interaction, as we show here applied to large-scale experimental datasets where the pairwise interactions are unknown. We predict 504 interactions de novo in the E. coli membrane proteome, including 243 that are newly discovered. While EVcomplex2 does not require available structures, coevolving residue pairs can be used to produce structural models of protein interactions, as done here for membrane complexes including the Flagellar Hook-Filament Junction and the Tol/Pal complex.
format Online
Article
Text
id pubmed-7925567
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-79255672021-03-21 Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences Green, Anna G. Elhabashy, Hadeer Brock, Kelly P. Maddamsetti, Rohan Kohlbacher, Oliver Marks, Debora S. Nat Commun Article Increasing numbers of protein interactions have been identified in high-throughput experiments, but only a small proportion have solved structures. Recently, sequence coevolution-based approaches have led to a breakthrough in predicting monomer protein structures and protein interaction interfaces. Here, we address the challenges of large-scale interaction prediction at residue resolution with a fast alignment concatenation method and a probabilistic score for the interaction of residues. Importantly, this method (EVcomplex2) is able to assess the likelihood of a protein interaction, as we show here applied to large-scale experimental datasets where the pairwise interactions are unknown. We predict 504 interactions de novo in the E. coli membrane proteome, including 243 that are newly discovered. While EVcomplex2 does not require available structures, coevolving residue pairs can be used to produce structural models of protein interactions, as done here for membrane complexes including the Flagellar Hook-Filament Junction and the Tol/Pal complex. Nature Publishing Group UK 2021-03-02 /pmc/articles/PMC7925567/ /pubmed/33654096 http://dx.doi.org/10.1038/s41467-021-21636-z Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Green, Anna G.
Elhabashy, Hadeer
Brock, Kelly P.
Maddamsetti, Rohan
Kohlbacher, Oliver
Marks, Debora S.
Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences
title Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences
title_full Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences
title_fullStr Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences
title_full_unstemmed Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences
title_short Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences
title_sort large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7925567/
https://www.ncbi.nlm.nih.gov/pubmed/33654096
http://dx.doi.org/10.1038/s41467-021-21636-z
work_keys_str_mv AT greenannag largescalediscoveryofproteininteractionsatresidueresolutionusingcoevolutioncalculatedfromgenomicsequences
AT elhabashyhadeer largescalediscoveryofproteininteractionsatresidueresolutionusingcoevolutioncalculatedfromgenomicsequences
AT brockkellyp largescalediscoveryofproteininteractionsatresidueresolutionusingcoevolutioncalculatedfromgenomicsequences
AT maddamsettirohan largescalediscoveryofproteininteractionsatresidueresolutionusingcoevolutioncalculatedfromgenomicsequences
AT kohlbacheroliver largescalediscoveryofproteininteractionsatresidueresolutionusingcoevolutioncalculatedfromgenomicsequences
AT marksdeboras largescalediscoveryofproteininteractionsatresidueresolutionusingcoevolutioncalculatedfromgenomicsequences