Cargando…
Highly accurate protein structure prediction with AlphaFold
Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort(1–4), the structures of around 100,000 unique proteins have been determined(5), but this represents a small fraction of the billions...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8371605/ https://www.ncbi.nlm.nih.gov/pubmed/34265844 http://dx.doi.org/10.1038/s41586-021-03819-2 |
_version_ | 1783739677993336832 |
---|---|
author | Jumper, John Evans, Richard Pritzel, Alexander Green, Tim Figurnov, Michael Ronneberger, Olaf Tunyasuvunakool, Kathryn Bates, Russ Žídek, Augustin Potapenko, Anna Bridgland, Alex Meyer, Clemens Kohl, Simon A. A. Ballard, Andrew J. Cowie, Andrew Romera-Paredes, Bernardino Nikolov, Stanislav Jain, Rishub Adler, Jonas Back, Trevor Petersen, Stig Reiman, David Clancy, Ellen Zielinski, Michal Steinegger, Martin Pacholska, Michalina Berghammer, Tamas Bodenstein, Sebastian Silver, David Vinyals, Oriol Senior, Andrew W. Kavukcuoglu, Koray Kohli, Pushmeet Hassabis, Demis |
author_facet | Jumper, John Evans, Richard Pritzel, Alexander Green, Tim Figurnov, Michael Ronneberger, Olaf Tunyasuvunakool, Kathryn Bates, Russ Žídek, Augustin Potapenko, Anna Bridgland, Alex Meyer, Clemens Kohl, Simon A. A. Ballard, Andrew J. Cowie, Andrew Romera-Paredes, Bernardino Nikolov, Stanislav Jain, Rishub Adler, Jonas Back, Trevor Petersen, Stig Reiman, David Clancy, Ellen Zielinski, Michal Steinegger, Martin Pacholska, Michalina Berghammer, Tamas Bodenstein, Sebastian Silver, David Vinyals, Oriol Senior, Andrew W. Kavukcuoglu, Koray Kohli, Pushmeet Hassabis, Demis |
author_sort | Jumper, John |
collection | PubMed |
description | Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort(1–4), the structures of around 100,000 unique proteins have been determined(5), but this represents a small fraction of the billions of known protein sequences(6,7). Structural coverage is bottlenecked by the months to years of painstaking effort required to determine a single protein structure. Accurate computational approaches are needed to address this gap and to enable large-scale structural bioinformatics. Predicting the three-dimensional structure that a protein will adopt based solely on its amino acid sequence—the structure prediction component of the ‘protein folding problem’(8)—has been an important open research problem for more than 50 years(9). Despite recent progress(10–14), existing methods fall far short of atomic accuracy, especially when no homologous structure is available. Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no similar structure is known. We validated an entirely redesigned version of our neural network-based model, AlphaFold, in the challenging 14th Critical Assessment of protein Structure Prediction (CASP14)(15), demonstrating accuracy competitive with experimental structures in a majority of cases and greatly outperforming other methods. Underpinning the latest version of AlphaFold is a novel machine learning approach that incorporates physical and biological knowledge about protein structure, leveraging multi-sequence alignments, into the design of the deep learning algorithm. |
format | Online Article Text |
id | pubmed-8371605 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-83716052021-08-18 Highly accurate protein structure prediction with AlphaFold Jumper, John Evans, Richard Pritzel, Alexander Green, Tim Figurnov, Michael Ronneberger, Olaf Tunyasuvunakool, Kathryn Bates, Russ Žídek, Augustin Potapenko, Anna Bridgland, Alex Meyer, Clemens Kohl, Simon A. A. Ballard, Andrew J. Cowie, Andrew Romera-Paredes, Bernardino Nikolov, Stanislav Jain, Rishub Adler, Jonas Back, Trevor Petersen, Stig Reiman, David Clancy, Ellen Zielinski, Michal Steinegger, Martin Pacholska, Michalina Berghammer, Tamas Bodenstein, Sebastian Silver, David Vinyals, Oriol Senior, Andrew W. Kavukcuoglu, Koray Kohli, Pushmeet Hassabis, Demis Nature Article Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort(1–4), the structures of around 100,000 unique proteins have been determined(5), but this represents a small fraction of the billions of known protein sequences(6,7). Structural coverage is bottlenecked by the months to years of painstaking effort required to determine a single protein structure. Accurate computational approaches are needed to address this gap and to enable large-scale structural bioinformatics. Predicting the three-dimensional structure that a protein will adopt based solely on its amino acid sequence—the structure prediction component of the ‘protein folding problem’(8)—has been an important open research problem for more than 50 years(9). Despite recent progress(10–14), existing methods fall far short of atomic accuracy, especially when no homologous structure is available. Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no similar structure is known. We validated an entirely redesigned version of our neural network-based model, AlphaFold, in the challenging 14th Critical Assessment of protein Structure Prediction (CASP14)(15), demonstrating accuracy competitive with experimental structures in a majority of cases and greatly outperforming other methods. Underpinning the latest version of AlphaFold is a novel machine learning approach that incorporates physical and biological knowledge about protein structure, leveraging multi-sequence alignments, into the design of the deep learning algorithm. Nature Publishing Group UK 2021-07-15 2021 /pmc/articles/PMC8371605/ /pubmed/34265844 http://dx.doi.org/10.1038/s41586-021-03819-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Jumper, John Evans, Richard Pritzel, Alexander Green, Tim Figurnov, Michael Ronneberger, Olaf Tunyasuvunakool, Kathryn Bates, Russ Žídek, Augustin Potapenko, Anna Bridgland, Alex Meyer, Clemens Kohl, Simon A. A. Ballard, Andrew J. Cowie, Andrew Romera-Paredes, Bernardino Nikolov, Stanislav Jain, Rishub Adler, Jonas Back, Trevor Petersen, Stig Reiman, David Clancy, Ellen Zielinski, Michal Steinegger, Martin Pacholska, Michalina Berghammer, Tamas Bodenstein, Sebastian Silver, David Vinyals, Oriol Senior, Andrew W. Kavukcuoglu, Koray Kohli, Pushmeet Hassabis, Demis Highly accurate protein structure prediction with AlphaFold |
title | Highly accurate protein structure prediction with AlphaFold |
title_full | Highly accurate protein structure prediction with AlphaFold |
title_fullStr | Highly accurate protein structure prediction with AlphaFold |
title_full_unstemmed | Highly accurate protein structure prediction with AlphaFold |
title_short | Highly accurate protein structure prediction with AlphaFold |
title_sort | highly accurate protein structure prediction with alphafold |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8371605/ https://www.ncbi.nlm.nih.gov/pubmed/34265844 http://dx.doi.org/10.1038/s41586-021-03819-2 |
work_keys_str_mv | AT jumperjohn highlyaccurateproteinstructurepredictionwithalphafold AT evansrichard highlyaccurateproteinstructurepredictionwithalphafold AT pritzelalexander highlyaccurateproteinstructurepredictionwithalphafold AT greentim highlyaccurateproteinstructurepredictionwithalphafold AT figurnovmichael highlyaccurateproteinstructurepredictionwithalphafold AT ronnebergerolaf highlyaccurateproteinstructurepredictionwithalphafold AT tunyasuvunakoolkathryn highlyaccurateproteinstructurepredictionwithalphafold AT batesruss highlyaccurateproteinstructurepredictionwithalphafold AT zidekaugustin highlyaccurateproteinstructurepredictionwithalphafold AT potapenkoanna highlyaccurateproteinstructurepredictionwithalphafold AT bridglandalex highlyaccurateproteinstructurepredictionwithalphafold AT meyerclemens highlyaccurateproteinstructurepredictionwithalphafold AT kohlsimonaa highlyaccurateproteinstructurepredictionwithalphafold AT ballardandrewj highlyaccurateproteinstructurepredictionwithalphafold AT cowieandrew highlyaccurateproteinstructurepredictionwithalphafold AT romeraparedesbernardino highlyaccurateproteinstructurepredictionwithalphafold AT nikolovstanislav highlyaccurateproteinstructurepredictionwithalphafold AT jainrishub highlyaccurateproteinstructurepredictionwithalphafold AT adlerjonas highlyaccurateproteinstructurepredictionwithalphafold AT backtrevor highlyaccurateproteinstructurepredictionwithalphafold AT petersenstig highlyaccurateproteinstructurepredictionwithalphafold AT reimandavid highlyaccurateproteinstructurepredictionwithalphafold AT clancyellen highlyaccurateproteinstructurepredictionwithalphafold AT zielinskimichal highlyaccurateproteinstructurepredictionwithalphafold AT steineggermartin highlyaccurateproteinstructurepredictionwithalphafold AT pacholskamichalina highlyaccurateproteinstructurepredictionwithalphafold AT berghammertamas highlyaccurateproteinstructurepredictionwithalphafold AT bodensteinsebastian highlyaccurateproteinstructurepredictionwithalphafold AT silverdavid highlyaccurateproteinstructurepredictionwithalphafold AT vinyalsoriol highlyaccurateproteinstructurepredictionwithalphafold AT seniorandreww highlyaccurateproteinstructurepredictionwithalphafold AT kavukcuoglukoray highlyaccurateproteinstructurepredictionwithalphafold AT kohlipushmeet highlyaccurateproteinstructurepredictionwithalphafold AT hassabisdemis highlyaccurateproteinstructurepredictionwithalphafold |