Cargando…

AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms

Deep-learning (DL) methods like DeepMind’s AlphaFold2 (AF2) have led to substantial improvements in protein structure prediction. We analyse confident AF2 models from 21 model organisms using a new classification protocol (CATH-Assign) which exploits novel DL methods for structural comparison and cl...

Descripción completa

Detalles Bibliográficos
Autores principales: Bordin, Nicola, Sillitoe, Ian, Nallapareddy, Vamsi, Rauer, Clemens, Lam, Su Datt, Waman, Vaishali P., Sen, Neeladri, Heinzinger, Michael, Littmann, Maria, Kim, Stephanie, Velankar, Sameer, Steinegger, Martin, Rost, Burkhard, Orengo, Christine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908985/
https://www.ncbi.nlm.nih.gov/pubmed/36755055
http://dx.doi.org/10.1038/s42003-023-04488-9
_version_ 1784884470616162304
author Bordin, Nicola
Sillitoe, Ian
Nallapareddy, Vamsi
Rauer, Clemens
Lam, Su Datt
Waman, Vaishali P.
Sen, Neeladri
Heinzinger, Michael
Littmann, Maria
Kim, Stephanie
Velankar, Sameer
Steinegger, Martin
Rost, Burkhard
Orengo, Christine
author_facet Bordin, Nicola
Sillitoe, Ian
Nallapareddy, Vamsi
Rauer, Clemens
Lam, Su Datt
Waman, Vaishali P.
Sen, Neeladri
Heinzinger, Michael
Littmann, Maria
Kim, Stephanie
Velankar, Sameer
Steinegger, Martin
Rost, Burkhard
Orengo, Christine
author_sort Bordin, Nicola
collection PubMed
description Deep-learning (DL) methods like DeepMind’s AlphaFold2 (AF2) have led to substantial improvements in protein structure prediction. We analyse confident AF2 models from 21 model organisms using a new classification protocol (CATH-Assign) which exploits novel DL methods for structural comparison and classification. Of ~370,000 confident models, 92% can be assigned to 3253 superfamilies in our CATH domain superfamily classification. The remaining cluster into 2367 putative novel superfamilies. Detailed manual analysis on 618 of these, having at least one human relative, reveal extremely remote homologies and further unusual features. Only 25 novel superfamilies could be confirmed. Although most models map to existing superfamilies, AF2 domains expand CATH by 67% and increases the number of unique ‘global’ folds by 36% and will provide valuable insights on structure function relationships. CATH-Assign will harness the huge expansion in structural data provided by DeepMind to rationalise evolutionary changes driving functional divergence.
format Online
Article
Text
id pubmed-9908985
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-99089852023-02-10 AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms Bordin, Nicola Sillitoe, Ian Nallapareddy, Vamsi Rauer, Clemens Lam, Su Datt Waman, Vaishali P. Sen, Neeladri Heinzinger, Michael Littmann, Maria Kim, Stephanie Velankar, Sameer Steinegger, Martin Rost, Burkhard Orengo, Christine Commun Biol Article Deep-learning (DL) methods like DeepMind’s AlphaFold2 (AF2) have led to substantial improvements in protein structure prediction. We analyse confident AF2 models from 21 model organisms using a new classification protocol (CATH-Assign) which exploits novel DL methods for structural comparison and classification. Of ~370,000 confident models, 92% can be assigned to 3253 superfamilies in our CATH domain superfamily classification. The remaining cluster into 2367 putative novel superfamilies. Detailed manual analysis on 618 of these, having at least one human relative, reveal extremely remote homologies and further unusual features. Only 25 novel superfamilies could be confirmed. Although most models map to existing superfamilies, AF2 domains expand CATH by 67% and increases the number of unique ‘global’ folds by 36% and will provide valuable insights on structure function relationships. CATH-Assign will harness the huge expansion in structural data provided by DeepMind to rationalise evolutionary changes driving functional divergence. Nature Publishing Group UK 2023-02-08 /pmc/articles/PMC9908985/ /pubmed/36755055 http://dx.doi.org/10.1038/s42003-023-04488-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Bordin, Nicola
Sillitoe, Ian
Nallapareddy, Vamsi
Rauer, Clemens
Lam, Su Datt
Waman, Vaishali P.
Sen, Neeladri
Heinzinger, Michael
Littmann, Maria
Kim, Stephanie
Velankar, Sameer
Steinegger, Martin
Rost, Burkhard
Orengo, Christine
AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms
title AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms
title_full AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms
title_fullStr AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms
title_full_unstemmed AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms
title_short AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms
title_sort alphafold2 reveals commonalities and novelties in protein structure space for 21 model organisms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908985/
https://www.ncbi.nlm.nih.gov/pubmed/36755055
http://dx.doi.org/10.1038/s42003-023-04488-9
work_keys_str_mv AT bordinnicola alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT sillitoeian alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT nallapareddyvamsi alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT rauerclemens alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT lamsudatt alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT wamanvaishalip alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT senneeladri alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT heinzingermichael alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT littmannmaria alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT kimstephanie alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT velankarsameer alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT steineggermartin alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT rostburkhard alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms
AT orengochristine alphafold2revealscommonalitiesandnoveltiesinproteinstructurespacefor21modelorganisms