Cargando…

Sequence-structure-function relationships in the microbial protein universe

For the past half-century, structural biologists relied on the notion that similar protein sequences give rise to similar structures and functions. While this assumption has driven research to explore certain parts of the protein universe, it disregards spaces that don’t rely on this assumption. Her...

Descripción completa

Detalles Bibliográficos
Autores principales: Koehler Leman, Julia, Szczerbiak, Pawel, Renfrew, P. Douglas, Gligorijevic, Vladimir, Berenberg, Daniel, Vatanen, Tommi, Taylor, Bryn C., Chandler, Chris, Janssen, Stefan, Pataki, Andras, Carriero, Nick, Fisk, Ian, Xavier, Ramnik J., Knight, Rob, Bonneau, Richard, Kosciolek, Tomasz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10133388/
https://www.ncbi.nlm.nih.gov/pubmed/37100781
http://dx.doi.org/10.1038/s41467-023-37896-w
_version_ 1785031558352076800
author Koehler Leman, Julia
Szczerbiak, Pawel
Renfrew, P. Douglas
Gligorijevic, Vladimir
Berenberg, Daniel
Vatanen, Tommi
Taylor, Bryn C.
Chandler, Chris
Janssen, Stefan
Pataki, Andras
Carriero, Nick
Fisk, Ian
Xavier, Ramnik J.
Knight, Rob
Bonneau, Richard
Kosciolek, Tomasz
author_facet Koehler Leman, Julia
Szczerbiak, Pawel
Renfrew, P. Douglas
Gligorijevic, Vladimir
Berenberg, Daniel
Vatanen, Tommi
Taylor, Bryn C.
Chandler, Chris
Janssen, Stefan
Pataki, Andras
Carriero, Nick
Fisk, Ian
Xavier, Ramnik J.
Knight, Rob
Bonneau, Richard
Kosciolek, Tomasz
author_sort Koehler Leman, Julia
collection PubMed
description For the past half-century, structural biologists relied on the notion that similar protein sequences give rise to similar structures and functions. While this assumption has driven research to explore certain parts of the protein universe, it disregards spaces that don’t rely on this assumption. Here we explore areas of the protein universe where similar protein functions can be achieved by different sequences and different structures. We predict ~200,000 structures for diverse protein sequences from 1,003 representative genomes across the microbial tree of life and annotate them functionally on a per-residue basis. Structure prediction is accomplished using the World Community Grid, a large-scale citizen science initiative. The resulting database of structural models is complementary to the AlphaFold database, with regards to domains of life as well as sequence diversity and sequence length. We identify 148 novel folds and describe examples where we map specific functions to structural motifs. We also show that the structural space is continuous and largely saturated, highlighting the need for a shift in focus across all branches of biology, from obtaining structures to putting them into context and from sequence-based to sequence-structure-function based meta-omics analyses.
format Online
Article
Text
id pubmed-10133388
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-101333882023-04-28 Sequence-structure-function relationships in the microbial protein universe Koehler Leman, Julia Szczerbiak, Pawel Renfrew, P. Douglas Gligorijevic, Vladimir Berenberg, Daniel Vatanen, Tommi Taylor, Bryn C. Chandler, Chris Janssen, Stefan Pataki, Andras Carriero, Nick Fisk, Ian Xavier, Ramnik J. Knight, Rob Bonneau, Richard Kosciolek, Tomasz Nat Commun Article For the past half-century, structural biologists relied on the notion that similar protein sequences give rise to similar structures and functions. While this assumption has driven research to explore certain parts of the protein universe, it disregards spaces that don’t rely on this assumption. Here we explore areas of the protein universe where similar protein functions can be achieved by different sequences and different structures. We predict ~200,000 structures for diverse protein sequences from 1,003 representative genomes across the microbial tree of life and annotate them functionally on a per-residue basis. Structure prediction is accomplished using the World Community Grid, a large-scale citizen science initiative. The resulting database of structural models is complementary to the AlphaFold database, with regards to domains of life as well as sequence diversity and sequence length. We identify 148 novel folds and describe examples where we map specific functions to structural motifs. We also show that the structural space is continuous and largely saturated, highlighting the need for a shift in focus across all branches of biology, from obtaining structures to putting them into context and from sequence-based to sequence-structure-function based meta-omics analyses. Nature Publishing Group UK 2023-04-26 /pmc/articles/PMC10133388/ /pubmed/37100781 http://dx.doi.org/10.1038/s41467-023-37896-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Koehler Leman, Julia
Szczerbiak, Pawel
Renfrew, P. Douglas
Gligorijevic, Vladimir
Berenberg, Daniel
Vatanen, Tommi
Taylor, Bryn C.
Chandler, Chris
Janssen, Stefan
Pataki, Andras
Carriero, Nick
Fisk, Ian
Xavier, Ramnik J.
Knight, Rob
Bonneau, Richard
Kosciolek, Tomasz
Sequence-structure-function relationships in the microbial protein universe
title Sequence-structure-function relationships in the microbial protein universe
title_full Sequence-structure-function relationships in the microbial protein universe
title_fullStr Sequence-structure-function relationships in the microbial protein universe
title_full_unstemmed Sequence-structure-function relationships in the microbial protein universe
title_short Sequence-structure-function relationships in the microbial protein universe
title_sort sequence-structure-function relationships in the microbial protein universe
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10133388/
https://www.ncbi.nlm.nih.gov/pubmed/37100781
http://dx.doi.org/10.1038/s41467-023-37896-w
work_keys_str_mv AT koehlerlemanjulia sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT szczerbiakpawel sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT renfrewpdouglas sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT gligorijevicvladimir sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT berenbergdaniel sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT vatanentommi sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT taylorbrync sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT chandlerchris sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT janssenstefan sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT patakiandras sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT carrieronick sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT fiskian sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT xavierramnikj sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT knightrob sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT bonneaurichard sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse
AT kosciolektomasz sequencestructurefunctionrelationshipsinthemicrobialproteinuniverse