Cargando…

CATH: increased structural coverage of functional space

CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments...

Descripción completa

Detalles Bibliográficos
Autores principales: Sillitoe, Ian, Bordin, Nicola, Dawson, Natalie, Waman, Vaishali P, Ashford, Paul, Scholes, Harry M, Pang, Camilla S M, Woodridge, Laurel, Rauer, Clemens, Sen, Neeladri, Abbasian, Mahnaz, Le Cornu, Sean, Lam, Su Datt, Berka, Karel, Varekova, Ivana Hutařová, Svobodova, Radka, Lees, Jon, Orengo, Christine A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7778904/
https://www.ncbi.nlm.nih.gov/pubmed/33237325
http://dx.doi.org/10.1093/nar/gkaa1079
_version_ 1783631220954890240
author Sillitoe, Ian
Bordin, Nicola
Dawson, Natalie
Waman, Vaishali P
Ashford, Paul
Scholes, Harry M
Pang, Camilla S M
Woodridge, Laurel
Rauer, Clemens
Sen, Neeladri
Abbasian, Mahnaz
Le Cornu, Sean
Lam, Su Datt
Berka, Karel
Varekova, Ivana Hutařová
Svobodova, Radka
Lees, Jon
Orengo, Christine A
author_facet Sillitoe, Ian
Bordin, Nicola
Dawson, Natalie
Waman, Vaishali P
Ashford, Paul
Scholes, Harry M
Pang, Camilla S M
Woodridge, Laurel
Rauer, Clemens
Sen, Neeladri
Abbasian, Mahnaz
Le Cornu, Sean
Lam, Su Datt
Berka, Karel
Varekova, Ivana Hutařová
Svobodova, Radka
Lees, Jon
Orengo, Christine A
author_sort Sillitoe, Ian
collection PubMed
description CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments, and CATH+, with additional derived data, such as predicted sequence domains, and functionally coherent sequence subsets (Functional Families or FunFams). The latest CATH+ release, version 4.3, significantly increases coverage of structural and sequence data, with an addition of 65,351 fully-classified domains structures (+15%), providing 500 238 structural domains, and 151 million predicted sequence domains (+59%) assigned to 5481 superfamilies. The FunFam generation pipeline has been re-engineered to cope with the increased influx of data. Three times more sequences are captured in FunFams, with a concomitant increase in functional purity, information content and structural coverage. FunFam expansion increases the structural annotations provided for experimental GO terms (+59%). We also present CATH-FunVar web-pages displaying variations in protein sequences and their proximity to known or predicted functional sites. We present two case studies (1) putative cancer drivers and (2) SARS-CoV-2 proteins. Finally, we have improved links to and from CATH including SCOP, InterPro, Aquaria and 2DProt.
format Online
Article
Text
id pubmed-7778904
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77789042021-01-06 CATH: increased structural coverage of functional space Sillitoe, Ian Bordin, Nicola Dawson, Natalie Waman, Vaishali P Ashford, Paul Scholes, Harry M Pang, Camilla S M Woodridge, Laurel Rauer, Clemens Sen, Neeladri Abbasian, Mahnaz Le Cornu, Sean Lam, Su Datt Berka, Karel Varekova, Ivana Hutařová Svobodova, Radka Lees, Jon Orengo, Christine A Nucleic Acids Res Database Issue CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments, and CATH+, with additional derived data, such as predicted sequence domains, and functionally coherent sequence subsets (Functional Families or FunFams). The latest CATH+ release, version 4.3, significantly increases coverage of structural and sequence data, with an addition of 65,351 fully-classified domains structures (+15%), providing 500 238 structural domains, and 151 million predicted sequence domains (+59%) assigned to 5481 superfamilies. The FunFam generation pipeline has been re-engineered to cope with the increased influx of data. Three times more sequences are captured in FunFams, with a concomitant increase in functional purity, information content and structural coverage. FunFam expansion increases the structural annotations provided for experimental GO terms (+59%). We also present CATH-FunVar web-pages displaying variations in protein sequences and their proximity to known or predicted functional sites. We present two case studies (1) putative cancer drivers and (2) SARS-CoV-2 proteins. Finally, we have improved links to and from CATH including SCOP, InterPro, Aquaria and 2DProt. Oxford University Press 2020-11-25 /pmc/articles/PMC7778904/ /pubmed/33237325 http://dx.doi.org/10.1093/nar/gkaa1079 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Issue
Sillitoe, Ian
Bordin, Nicola
Dawson, Natalie
Waman, Vaishali P
Ashford, Paul
Scholes, Harry M
Pang, Camilla S M
Woodridge, Laurel
Rauer, Clemens
Sen, Neeladri
Abbasian, Mahnaz
Le Cornu, Sean
Lam, Su Datt
Berka, Karel
Varekova, Ivana Hutařová
Svobodova, Radka
Lees, Jon
Orengo, Christine A
CATH: increased structural coverage of functional space
title CATH: increased structural coverage of functional space
title_full CATH: increased structural coverage of functional space
title_fullStr CATH: increased structural coverage of functional space
title_full_unstemmed CATH: increased structural coverage of functional space
title_short CATH: increased structural coverage of functional space
title_sort cath: increased structural coverage of functional space
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7778904/
https://www.ncbi.nlm.nih.gov/pubmed/33237325
http://dx.doi.org/10.1093/nar/gkaa1079
work_keys_str_mv AT sillitoeian cathincreasedstructuralcoverageoffunctionalspace
AT bordinnicola cathincreasedstructuralcoverageoffunctionalspace
AT dawsonnatalie cathincreasedstructuralcoverageoffunctionalspace
AT wamanvaishalip cathincreasedstructuralcoverageoffunctionalspace
AT ashfordpaul cathincreasedstructuralcoverageoffunctionalspace
AT scholesharrym cathincreasedstructuralcoverageoffunctionalspace
AT pangcamillasm cathincreasedstructuralcoverageoffunctionalspace
AT woodridgelaurel cathincreasedstructuralcoverageoffunctionalspace
AT rauerclemens cathincreasedstructuralcoverageoffunctionalspace
AT senneeladri cathincreasedstructuralcoverageoffunctionalspace
AT abbasianmahnaz cathincreasedstructuralcoverageoffunctionalspace
AT lecornusean cathincreasedstructuralcoverageoffunctionalspace
AT lamsudatt cathincreasedstructuralcoverageoffunctionalspace
AT berkakarel cathincreasedstructuralcoverageoffunctionalspace
AT varekovaivanahutarova cathincreasedstructuralcoverageoffunctionalspace
AT svobodovaradka cathincreasedstructuralcoverageoffunctionalspace
AT leesjon cathincreasedstructuralcoverageoffunctionalspace
AT orengochristinea cathincreasedstructuralcoverageoffunctionalspace