Cargando…

Highly accurate protein structure prediction with AlphaFold

Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort(1–4), the structures of around 100,000 unique proteins have been determined(5), but this represents a small fraction of the billions...

Descripción completa

Detalles Bibliográficos
Autores principales: Jumper, John, Evans, Richard, Pritzel, Alexander, Green, Tim, Figurnov, Michael, Ronneberger, Olaf, Tunyasuvunakool, Kathryn, Bates, Russ, Žídek, Augustin, Potapenko, Anna, Bridgland, Alex, Meyer, Clemens, Kohl, Simon A. A., Ballard, Andrew J., Cowie, Andrew, Romera-Paredes, Bernardino, Nikolov, Stanislav, Jain, Rishub, Adler, Jonas, Back, Trevor, Petersen, Stig, Reiman, David, Clancy, Ellen, Zielinski, Michal, Steinegger, Martin, Pacholska, Michalina, Berghammer, Tamas, Bodenstein, Sebastian, Silver, David, Vinyals, Oriol, Senior, Andrew W., Kavukcuoglu, Koray, Kohli, Pushmeet, Hassabis, Demis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8371605/
https://www.ncbi.nlm.nih.gov/pubmed/34265844
http://dx.doi.org/10.1038/s41586-021-03819-2
_version_ 1783739677993336832
author Jumper, John
Evans, Richard
Pritzel, Alexander
Green, Tim
Figurnov, Michael
Ronneberger, Olaf
Tunyasuvunakool, Kathryn
Bates, Russ
Žídek, Augustin
Potapenko, Anna
Bridgland, Alex
Meyer, Clemens
Kohl, Simon A. A.
Ballard, Andrew J.
Cowie, Andrew
Romera-Paredes, Bernardino
Nikolov, Stanislav
Jain, Rishub
Adler, Jonas
Back, Trevor
Petersen, Stig
Reiman, David
Clancy, Ellen
Zielinski, Michal
Steinegger, Martin
Pacholska, Michalina
Berghammer, Tamas
Bodenstein, Sebastian
Silver, David
Vinyals, Oriol
Senior, Andrew W.
Kavukcuoglu, Koray
Kohli, Pushmeet
Hassabis, Demis
author_facet Jumper, John
Evans, Richard
Pritzel, Alexander
Green, Tim
Figurnov, Michael
Ronneberger, Olaf
Tunyasuvunakool, Kathryn
Bates, Russ
Žídek, Augustin
Potapenko, Anna
Bridgland, Alex
Meyer, Clemens
Kohl, Simon A. A.
Ballard, Andrew J.
Cowie, Andrew
Romera-Paredes, Bernardino
Nikolov, Stanislav
Jain, Rishub
Adler, Jonas
Back, Trevor
Petersen, Stig
Reiman, David
Clancy, Ellen
Zielinski, Michal
Steinegger, Martin
Pacholska, Michalina
Berghammer, Tamas
Bodenstein, Sebastian
Silver, David
Vinyals, Oriol
Senior, Andrew W.
Kavukcuoglu, Koray
Kohli, Pushmeet
Hassabis, Demis
author_sort Jumper, John
collection PubMed
description Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort(1–4), the structures of around 100,000 unique proteins have been determined(5), but this represents a small fraction of the billions of known protein sequences(6,7). Structural coverage is bottlenecked by the months to years of painstaking effort required to determine a single protein structure. Accurate computational approaches are needed to address this gap and to enable large-scale structural bioinformatics. Predicting the three-dimensional structure that a protein will adopt based solely on its amino acid sequence—the structure prediction component of the ‘protein folding problem’(8)—has been an important open research problem for more than 50 years(9). Despite recent progress(10–14), existing methods fall far short of atomic accuracy, especially when no homologous structure is available. Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no similar structure is known. We validated an entirely redesigned version of our neural network-based model, AlphaFold, in the challenging 14th Critical Assessment of protein Structure Prediction (CASP14)(15), demonstrating accuracy competitive with experimental structures in a majority of cases and greatly outperforming other methods. Underpinning the latest version of AlphaFold is a novel machine learning approach that incorporates physical and biological knowledge about protein structure, leveraging multi-sequence alignments, into the design of the deep learning algorithm.
format Online
Article
Text
id pubmed-8371605
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-83716052021-08-18 Highly accurate protein structure prediction with AlphaFold Jumper, John Evans, Richard Pritzel, Alexander Green, Tim Figurnov, Michael Ronneberger, Olaf Tunyasuvunakool, Kathryn Bates, Russ Žídek, Augustin Potapenko, Anna Bridgland, Alex Meyer, Clemens Kohl, Simon A. A. Ballard, Andrew J. Cowie, Andrew Romera-Paredes, Bernardino Nikolov, Stanislav Jain, Rishub Adler, Jonas Back, Trevor Petersen, Stig Reiman, David Clancy, Ellen Zielinski, Michal Steinegger, Martin Pacholska, Michalina Berghammer, Tamas Bodenstein, Sebastian Silver, David Vinyals, Oriol Senior, Andrew W. Kavukcuoglu, Koray Kohli, Pushmeet Hassabis, Demis Nature Article Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort(1–4), the structures of around 100,000 unique proteins have been determined(5), but this represents a small fraction of the billions of known protein sequences(6,7). Structural coverage is bottlenecked by the months to years of painstaking effort required to determine a single protein structure. Accurate computational approaches are needed to address this gap and to enable large-scale structural bioinformatics. Predicting the three-dimensional structure that a protein will adopt based solely on its amino acid sequence—the structure prediction component of the ‘protein folding problem’(8)—has been an important open research problem for more than 50 years(9). Despite recent progress(10–14), existing methods fall far short of atomic accuracy, especially when no homologous structure is available. Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no similar structure is known. We validated an entirely redesigned version of our neural network-based model, AlphaFold, in the challenging 14th Critical Assessment of protein Structure Prediction (CASP14)(15), demonstrating accuracy competitive with experimental structures in a majority of cases and greatly outperforming other methods. Underpinning the latest version of AlphaFold is a novel machine learning approach that incorporates physical and biological knowledge about protein structure, leveraging multi-sequence alignments, into the design of the deep learning algorithm. Nature Publishing Group UK 2021-07-15 2021 /pmc/articles/PMC8371605/ /pubmed/34265844 http://dx.doi.org/10.1038/s41586-021-03819-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Jumper, John
Evans, Richard
Pritzel, Alexander
Green, Tim
Figurnov, Michael
Ronneberger, Olaf
Tunyasuvunakool, Kathryn
Bates, Russ
Žídek, Augustin
Potapenko, Anna
Bridgland, Alex
Meyer, Clemens
Kohl, Simon A. A.
Ballard, Andrew J.
Cowie, Andrew
Romera-Paredes, Bernardino
Nikolov, Stanislav
Jain, Rishub
Adler, Jonas
Back, Trevor
Petersen, Stig
Reiman, David
Clancy, Ellen
Zielinski, Michal
Steinegger, Martin
Pacholska, Michalina
Berghammer, Tamas
Bodenstein, Sebastian
Silver, David
Vinyals, Oriol
Senior, Andrew W.
Kavukcuoglu, Koray
Kohli, Pushmeet
Hassabis, Demis
Highly accurate protein structure prediction with AlphaFold
title Highly accurate protein structure prediction with AlphaFold
title_full Highly accurate protein structure prediction with AlphaFold
title_fullStr Highly accurate protein structure prediction with AlphaFold
title_full_unstemmed Highly accurate protein structure prediction with AlphaFold
title_short Highly accurate protein structure prediction with AlphaFold
title_sort highly accurate protein structure prediction with alphafold
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8371605/
https://www.ncbi.nlm.nih.gov/pubmed/34265844
http://dx.doi.org/10.1038/s41586-021-03819-2
work_keys_str_mv AT jumperjohn highlyaccurateproteinstructurepredictionwithalphafold
AT evansrichard highlyaccurateproteinstructurepredictionwithalphafold
AT pritzelalexander highlyaccurateproteinstructurepredictionwithalphafold
AT greentim highlyaccurateproteinstructurepredictionwithalphafold
AT figurnovmichael highlyaccurateproteinstructurepredictionwithalphafold
AT ronnebergerolaf highlyaccurateproteinstructurepredictionwithalphafold
AT tunyasuvunakoolkathryn highlyaccurateproteinstructurepredictionwithalphafold
AT batesruss highlyaccurateproteinstructurepredictionwithalphafold
AT zidekaugustin highlyaccurateproteinstructurepredictionwithalphafold
AT potapenkoanna highlyaccurateproteinstructurepredictionwithalphafold
AT bridglandalex highlyaccurateproteinstructurepredictionwithalphafold
AT meyerclemens highlyaccurateproteinstructurepredictionwithalphafold
AT kohlsimonaa highlyaccurateproteinstructurepredictionwithalphafold
AT ballardandrewj highlyaccurateproteinstructurepredictionwithalphafold
AT cowieandrew highlyaccurateproteinstructurepredictionwithalphafold
AT romeraparedesbernardino highlyaccurateproteinstructurepredictionwithalphafold
AT nikolovstanislav highlyaccurateproteinstructurepredictionwithalphafold
AT jainrishub highlyaccurateproteinstructurepredictionwithalphafold
AT adlerjonas highlyaccurateproteinstructurepredictionwithalphafold
AT backtrevor highlyaccurateproteinstructurepredictionwithalphafold
AT petersenstig highlyaccurateproteinstructurepredictionwithalphafold
AT reimandavid highlyaccurateproteinstructurepredictionwithalphafold
AT clancyellen highlyaccurateproteinstructurepredictionwithalphafold
AT zielinskimichal highlyaccurateproteinstructurepredictionwithalphafold
AT steineggermartin highlyaccurateproteinstructurepredictionwithalphafold
AT pacholskamichalina highlyaccurateproteinstructurepredictionwithalphafold
AT berghammertamas highlyaccurateproteinstructurepredictionwithalphafold
AT bodensteinsebastian highlyaccurateproteinstructurepredictionwithalphafold
AT silverdavid highlyaccurateproteinstructurepredictionwithalphafold
AT vinyalsoriol highlyaccurateproteinstructurepredictionwithalphafold
AT seniorandreww highlyaccurateproteinstructurepredictionwithalphafold
AT kavukcuoglukoray highlyaccurateproteinstructurepredictionwithalphafold
AT kohlipushmeet highlyaccurateproteinstructurepredictionwithalphafold
AT hassabisdemis highlyaccurateproteinstructurepredictionwithalphafold