Cargando…
CC(+) : A searchable database of validated coiled coils in PDB structures and AlphaFold2 models
α‐Helical coiled coils are common tertiary and quaternary elements of protein structure. In coiled coils, two or more α helices wrap around each other to form bundles. This apparently simple structural motif can generate many architectures and topologies. Coiled coil‐forming sequences can be predict...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley & Sons, Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588367/ https://www.ncbi.nlm.nih.gov/pubmed/37768271 http://dx.doi.org/10.1002/pro.4789 |
_version_ | 1785123567451504640 |
---|---|
author | Kumar, Prasun Petrenas, Rokas Dawson, William M. Schweke, Hugo Levy, Emmanuel D. Woolfson, Derek N. |
author_facet | Kumar, Prasun Petrenas, Rokas Dawson, William M. Schweke, Hugo Levy, Emmanuel D. Woolfson, Derek N. |
author_sort | Kumar, Prasun |
collection | PubMed |
description | α‐Helical coiled coils are common tertiary and quaternary elements of protein structure. In coiled coils, two or more α helices wrap around each other to form bundles. This apparently simple structural motif can generate many architectures and topologies. Coiled coil‐forming sequences can be predicted from heptad repeats of hydrophobic and polar residues, hpphppp , although this is not always reliable. Alternatively, coiled‐coil structures can be identified using the program SOCKET, which finds knobs‐into‐holes (KIH) packing between side chains of neighboring helices. SOCKET also classifies coiled‐coil architecture and topology, thus allowing sequence‐to‐structure relationships to be garnered. In 2009, we used SOCKET to create a relational database of coiled‐coil structures, CC(+), from the RCSB Protein Data Bank (PDB). Here, we report an update of CC(+) following an update of SOCKET (to Socket2) and the recent explosion of structural data and the success of AlphaFold2 in predicting protein structures from genome sequences. With the most‐stringent SOCKET parameters, CC(+) contains ≈12,000 coiled‐coil assemblies from experimentally determined structures, and ≈120,000 potential coiled‐coil structures within single‐chain models predicted by AlphaFold2 across 48 proteomes. CC(+) allows these and other less‐stringently defined coiled coils to be searched at various levels of structure, sequence, and side‐chain interactions. The identified coiled coils can be viewed directly from CC(+) using the Socket2 application, and their associated data can be downloaded for further analyses. CC(+) is available freely at http://coiledcoils.chm.bris.ac.uk/CCPlus/Home.html. It will be updated automatically. We envisage that CC+ could be used to understand coiled‐coil assemblies and their sequence‐to‐structure relationships, and to aid protein design and engineering. |
format | Online Article Text |
id | pubmed-10588367 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | John Wiley & Sons, Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-105883672023-11-01 CC(+) : A searchable database of validated coiled coils in PDB structures and AlphaFold2 models Kumar, Prasun Petrenas, Rokas Dawson, William M. Schweke, Hugo Levy, Emmanuel D. Woolfson, Derek N. Protein Sci Tools for Protein Science α‐Helical coiled coils are common tertiary and quaternary elements of protein structure. In coiled coils, two or more α helices wrap around each other to form bundles. This apparently simple structural motif can generate many architectures and topologies. Coiled coil‐forming sequences can be predicted from heptad repeats of hydrophobic and polar residues, hpphppp , although this is not always reliable. Alternatively, coiled‐coil structures can be identified using the program SOCKET, which finds knobs‐into‐holes (KIH) packing between side chains of neighboring helices. SOCKET also classifies coiled‐coil architecture and topology, thus allowing sequence‐to‐structure relationships to be garnered. In 2009, we used SOCKET to create a relational database of coiled‐coil structures, CC(+), from the RCSB Protein Data Bank (PDB). Here, we report an update of CC(+) following an update of SOCKET (to Socket2) and the recent explosion of structural data and the success of AlphaFold2 in predicting protein structures from genome sequences. With the most‐stringent SOCKET parameters, CC(+) contains ≈12,000 coiled‐coil assemblies from experimentally determined structures, and ≈120,000 potential coiled‐coil structures within single‐chain models predicted by AlphaFold2 across 48 proteomes. CC(+) allows these and other less‐stringently defined coiled coils to be searched at various levels of structure, sequence, and side‐chain interactions. The identified coiled coils can be viewed directly from CC(+) using the Socket2 application, and their associated data can be downloaded for further analyses. CC(+) is available freely at http://coiledcoils.chm.bris.ac.uk/CCPlus/Home.html. It will be updated automatically. We envisage that CC+ could be used to understand coiled‐coil assemblies and their sequence‐to‐structure relationships, and to aid protein design and engineering. John Wiley & Sons, Inc. 2023-11-01 /pmc/articles/PMC10588367/ /pubmed/37768271 http://dx.doi.org/10.1002/pro.4789 Text en © 2023 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Tools for Protein Science Kumar, Prasun Petrenas, Rokas Dawson, William M. Schweke, Hugo Levy, Emmanuel D. Woolfson, Derek N. CC(+) : A searchable database of validated coiled coils in PDB structures and AlphaFold2 models |
title |
CC(+)
: A searchable database of validated coiled coils in PDB structures and AlphaFold2 models |
title_full |
CC(+)
: A searchable database of validated coiled coils in PDB structures and AlphaFold2 models |
title_fullStr |
CC(+)
: A searchable database of validated coiled coils in PDB structures and AlphaFold2 models |
title_full_unstemmed |
CC(+)
: A searchable database of validated coiled coils in PDB structures and AlphaFold2 models |
title_short |
CC(+)
: A searchable database of validated coiled coils in PDB structures and AlphaFold2 models |
title_sort | cc(+)
: a searchable database of validated coiled coils in pdb structures and alphafold2 models |
topic | Tools for Protein Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588367/ https://www.ncbi.nlm.nih.gov/pubmed/37768271 http://dx.doi.org/10.1002/pro.4789 |
work_keys_str_mv | AT kumarprasun ccasearchabledatabaseofvalidatedcoiledcoilsinpdbstructuresandalphafold2models AT petrenasrokas ccasearchabledatabaseofvalidatedcoiledcoilsinpdbstructuresandalphafold2models AT dawsonwilliamm ccasearchabledatabaseofvalidatedcoiledcoilsinpdbstructuresandalphafold2models AT schwekehugo ccasearchabledatabaseofvalidatedcoiledcoilsinpdbstructuresandalphafold2models AT levyemmanueld ccasearchabledatabaseofvalidatedcoiledcoilsinpdbstructuresandalphafold2models AT woolfsonderekn ccasearchabledatabaseofvalidatedcoiledcoilsinpdbstructuresandalphafold2models |