Cargando…

Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants

High‐resolution experimental structural determination of protein–protein interactions has led to valuable mechanistic insights, yet due to the massive number of interactions and experimental limitations there is a need for computational methods that can accurately model their structures. Here we exp...

Descripción completa

Detalles Bibliográficos
Autores principales: Yin, Rui, Feng, Brandon Y., Varshney, Amitabh, Pierce, Brian G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9278006/
https://www.ncbi.nlm.nih.gov/pubmed/35900023
http://dx.doi.org/10.1002/pro.4379
_version_ 1784746108115746816
author Yin, Rui
Feng, Brandon Y.
Varshney, Amitabh
Pierce, Brian G.
author_facet Yin, Rui
Feng, Brandon Y.
Varshney, Amitabh
Pierce, Brian G.
author_sort Yin, Rui
collection PubMed
description High‐resolution experimental structural determination of protein–protein interactions has led to valuable mechanistic insights, yet due to the massive number of interactions and experimental limitations there is a need for computational methods that can accurately model their structures. Here we explore the use of the recently developed deep learning method, AlphaFold, to predict structures of protein complexes from sequence. With a benchmark of 152 diverse heterodimeric protein complexes, multiple implementations and parameters of AlphaFold were tested for accuracy. Remarkably, many cases (43%) had near‐native models (medium or high critical assessment of predicted interactions accuracy) generated as top‐ranked predictions by AlphaFold, greatly surpassing the performance of unbound protein–protein docking (9% success rate for near‐native top‐ranked models), however AlphaFold modeling of antibody–antigen complexes within our set was unsuccessful. We identified sequence and structural features associated with lack of AlphaFold success, and we also investigated the impact of multiple sequence alignment input. Benchmarking of a multimer‐optimized version of AlphaFold (AlphaFold‐Multimer) with a set of recently released antibody–antigen structures confirmed a low rate of success for antibody–antigen complexes (11% success), and we found that T cell receptor–antigen complexes are likewise not accurately modeled by that algorithm, showing that adaptive immune recognition poses a challenge for the current AlphaFold algorithm and model. Overall, our study demonstrates that end‐to‐end deep learning can accurately model many transient protein complexes, and highlights areas of improvement for future developments to reliably model any protein–protein interaction of interest.
format Online
Article
Text
id pubmed-9278006
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-92780062022-07-15 Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants Yin, Rui Feng, Brandon Y. Varshney, Amitabh Pierce, Brian G. Protein Sci Full‐length Papers High‐resolution experimental structural determination of protein–protein interactions has led to valuable mechanistic insights, yet due to the massive number of interactions and experimental limitations there is a need for computational methods that can accurately model their structures. Here we explore the use of the recently developed deep learning method, AlphaFold, to predict structures of protein complexes from sequence. With a benchmark of 152 diverse heterodimeric protein complexes, multiple implementations and parameters of AlphaFold were tested for accuracy. Remarkably, many cases (43%) had near‐native models (medium or high critical assessment of predicted interactions accuracy) generated as top‐ranked predictions by AlphaFold, greatly surpassing the performance of unbound protein–protein docking (9% success rate for near‐native top‐ranked models), however AlphaFold modeling of antibody–antigen complexes within our set was unsuccessful. We identified sequence and structural features associated with lack of AlphaFold success, and we also investigated the impact of multiple sequence alignment input. Benchmarking of a multimer‐optimized version of AlphaFold (AlphaFold‐Multimer) with a set of recently released antibody–antigen structures confirmed a low rate of success for antibody–antigen complexes (11% success), and we found that T cell receptor–antigen complexes are likewise not accurately modeled by that algorithm, showing that adaptive immune recognition poses a challenge for the current AlphaFold algorithm and model. Overall, our study demonstrates that end‐to‐end deep learning can accurately model many transient protein complexes, and highlights areas of improvement for future developments to reliably model any protein–protein interaction of interest. John Wiley & Sons, Inc. 2022-07-13 2022-08 /pmc/articles/PMC9278006/ /pubmed/35900023 http://dx.doi.org/10.1002/pro.4379 Text en © 2022 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Full‐length Papers
Yin, Rui
Feng, Brandon Y.
Varshney, Amitabh
Pierce, Brian G.
Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants
title Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants
title_full Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants
title_fullStr Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants
title_full_unstemmed Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants
title_short Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants
title_sort benchmarking alphafold for protein complex modeling reveals accuracy determinants
topic Full‐length Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9278006/
https://www.ncbi.nlm.nih.gov/pubmed/35900023
http://dx.doi.org/10.1002/pro.4379
work_keys_str_mv AT yinrui benchmarkingalphafoldforproteincomplexmodelingrevealsaccuracydeterminants
AT fengbrandony benchmarkingalphafoldforproteincomplexmodelingrevealsaccuracydeterminants
AT varshneyamitabh benchmarkingalphafoldforproteincomplexmodelingrevealsaccuracydeterminants
AT piercebriang benchmarkingalphafoldforproteincomplexmodelingrevealsaccuracydeterminants