Cargando…
AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms
Deep-learning (DL) methods like DeepMind’s AlphaFold2 (AF2) have led to substantial improvements in protein structure prediction. We analyse confident AF2 models from 21 model organisms using a new classification protocol (CATH-Assign) which exploits novel DL methods for structural comparison and cl...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908985/ https://www.ncbi.nlm.nih.gov/pubmed/36755055 http://dx.doi.org/10.1038/s42003-023-04488-9 |
_version_ | 1784884470616162304 |
---|---|
author | Bordin, Nicola Sillitoe, Ian Nallapareddy, Vamsi Rauer, Clemens Lam, Su Datt Waman, Vaishali P. Sen, Neeladri Heinzinger, Michael Littmann, Maria Kim, Stephanie Velankar, Sameer Steinegger, Martin Rost, Burkhard Orengo, Christine |
author_facet | Bordin, Nicola Sillitoe, Ian Nallapareddy, Vamsi Rauer, Clemens Lam, Su Datt Waman, Vaishali P. Sen, Neeladri Heinzinger, Michael Littmann, Maria Kim, Stephanie Velankar, Sameer Steinegger, Martin Rost, Burkhard Orengo, Christine |
author_sort | Bordin, Nicola |
collection | PubMed |
description | Deep-learning (DL) methods like DeepMind’s AlphaFold2 (AF2) have led to substantial improvements in protein structure prediction. We analyse confident AF2 models from 21 model organisms using a new classification protocol (CATH-Assign) which exploits novel DL methods for structural comparison and classification. Of ~370,000 confident models, 92% can be assigned to 3253 superfamilies in our CATH domain superfamily classification. The remaining cluster into 2367 putative novel superfamilies. Detailed manual analysis on 618 of these, having at least one human relative, reveal extremely remote homologies and further unusual features. Only 25 novel superfamilies could be confirmed. Although most models map to existing superfamilies, AF2 domains expand CATH by 67% and increases the number of unique ‘global’ folds by 36% and will provide valuable insights on structure function relationships. CATH-Assign will harness the huge expansion in structural data provided by DeepMind to rationalise evolutionary changes driving functional divergence. |
format | Online Article Text |
id | pubmed-9908985 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-99089852023-02-10 AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms Bordin, Nicola Sillitoe, Ian Nallapareddy, Vamsi Rauer, Clemens Lam, Su Datt Waman, Vaishali P. Sen, Neeladri Heinzinger, Michael Littmann, Maria Kim, Stephanie Velankar, Sameer Steinegger, Martin Rost, Burkhard Orengo, Christine Commun Biol Article Deep-learning (DL) methods like DeepMind’s AlphaFold2 (AF2) have led to substantial improvements in protein structure prediction. We analyse confident AF2 models from 21 model organisms using a new classification protocol (CATH-Assign) which exploits novel DL methods for structural comparison and classification. Of ~370,000 confident models, 92% can be assigned to 3253 superfamilies in our CATH domain superfamily classification. The remaining cluster into 2367 putative novel superfamilies. Detailed manual analysis on 618 of these, having at least one human relative, reveal extremely remote homologies and further unusual features. Only 25 novel superfamilies could be confirmed. Although most models map to existing superfamilies, AF2 domains expand CATH by 67% and increases the number of unique ‘global’ folds by 36% and will provide valuable insights on structure function relationships. CATH-Assign will harness the huge expansion in structural data provided by DeepMind to rationalise evolutionary changes driving functional divergence. Nature Publishing Group UK 2023-02-08 /pmc/articles/PMC9908985/ /pubmed/36755055 http://dx.doi.org/10.1038/s42003-023-04488-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Bordin, Nicola Sillitoe, Ian Nallapareddy, Vamsi Rauer, Clemens Lam, Su Datt Waman, Vaishali P. Sen, Neeladri Heinzinger, Michael Littmann, Maria Kim, Stephanie Velankar, Sameer Steinegger, Martin Rost, Burkhard Orengo, Christine AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms |
title | AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms |
title_full | AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms |
title_fullStr | AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms |
title_full_unstemmed | AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms |
title_short | AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms |
title_sort | alphafold2 reveals commonalities and novelties in protein structure space for 21 model organisms |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908985/ https://www.ncbi.nlm.nih.gov/pubmed/36755055 http://dx.doi.org/10.1038/s42003-023-04488-9 |
work_keys_str_mv | AT bordinnicola alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT sillitoeian alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT nallapareddyvamsi alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT rauerclemens alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT lamsudatt alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT wamanvaishalip alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT senneeladri alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT heinzingermichael alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT littmannmaria alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT kimstephanie alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT velankarsameer alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT steineggermartin alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT rostburkhard alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms AT orengochristine alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms |