Cargando…
CATH: increased structural coverage of functional space
CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments...
Autores principales: | , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7778904/ https://www.ncbi.nlm.nih.gov/pubmed/33237325 http://dx.doi.org/10.1093/nar/gkaa1079 |
_version_ | 1783631220954890240 |
---|---|
author | Sillitoe, Ian Bordin, Nicola Dawson, Natalie Waman, Vaishali P Ashford, Paul Scholes, Harry M Pang, Camilla S M Woodridge, Laurel Rauer, Clemens Sen, Neeladri Abbasian, Mahnaz Le Cornu, Sean Lam, Su Datt Berka, Karel Varekova, Ivana Hutařová Svobodova, Radka Lees, Jon Orengo, Christine A |
author_facet | Sillitoe, Ian Bordin, Nicola Dawson, Natalie Waman, Vaishali P Ashford, Paul Scholes, Harry M Pang, Camilla S M Woodridge, Laurel Rauer, Clemens Sen, Neeladri Abbasian, Mahnaz Le Cornu, Sean Lam, Su Datt Berka, Karel Varekova, Ivana Hutařová Svobodova, Radka Lees, Jon Orengo, Christine A |
author_sort | Sillitoe, Ian |
collection | PubMed |
description | CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments, and CATH+, with additional derived data, such as predicted sequence domains, and functionally coherent sequence subsets (Functional Families or FunFams). The latest CATH+ release, version 4.3, significantly increases coverage of structural and sequence data, with an addition of 65,351 fully-classified domains structures (+15%), providing 500 238 structural domains, and 151 million predicted sequence domains (+59%) assigned to 5481 superfamilies. The FunFam generation pipeline has been re-engineered to cope with the increased influx of data. Three times more sequences are captured in FunFams, with a concomitant increase in functional purity, information content and structural coverage. FunFam expansion increases the structural annotations provided for experimental GO terms (+59%). We also present CATH-FunVar web-pages displaying variations in protein sequences and their proximity to known or predicted functional sites. We present two case studies (1) putative cancer drivers and (2) SARS-CoV-2 proteins. Finally, we have improved links to and from CATH including SCOP, InterPro, Aquaria and 2DProt. |
format | Online Article Text |
id | pubmed-7778904 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-77789042021-01-06 CATH: increased structural coverage of functional space Sillitoe, Ian Bordin, Nicola Dawson, Natalie Waman, Vaishali P Ashford, Paul Scholes, Harry M Pang, Camilla S M Woodridge, Laurel Rauer, Clemens Sen, Neeladri Abbasian, Mahnaz Le Cornu, Sean Lam, Su Datt Berka, Karel Varekova, Ivana Hutařová Svobodova, Radka Lees, Jon Orengo, Christine A Nucleic Acids Res Database Issue CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments, and CATH+, with additional derived data, such as predicted sequence domains, and functionally coherent sequence subsets (Functional Families or FunFams). The latest CATH+ release, version 4.3, significantly increases coverage of structural and sequence data, with an addition of 65,351 fully-classified domains structures (+15%), providing 500 238 structural domains, and 151 million predicted sequence domains (+59%) assigned to 5481 superfamilies. The FunFam generation pipeline has been re-engineered to cope with the increased influx of data. Three times more sequences are captured in FunFams, with a concomitant increase in functional purity, information content and structural coverage. FunFam expansion increases the structural annotations provided for experimental GO terms (+59%). We also present CATH-FunVar web-pages displaying variations in protein sequences and their proximity to known or predicted functional sites. We present two case studies (1) putative cancer drivers and (2) SARS-CoV-2 proteins. Finally, we have improved links to and from CATH including SCOP, InterPro, Aquaria and 2DProt. Oxford University Press 2020-11-25 /pmc/articles/PMC7778904/ /pubmed/33237325 http://dx.doi.org/10.1093/nar/gkaa1079 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Issue Sillitoe, Ian Bordin, Nicola Dawson, Natalie Waman, Vaishali P Ashford, Paul Scholes, Harry M Pang, Camilla S M Woodridge, Laurel Rauer, Clemens Sen, Neeladri Abbasian, Mahnaz Le Cornu, Sean Lam, Su Datt Berka, Karel Varekova, Ivana Hutařová Svobodova, Radka Lees, Jon Orengo, Christine A CATH: increased structural coverage of functional space |
title | CATH: increased structural coverage of functional space |
title_full | CATH: increased structural coverage of functional space |
title_fullStr | CATH: increased structural coverage of functional space |
title_full_unstemmed | CATH: increased structural coverage of functional space |
title_short | CATH: increased structural coverage of functional space |
title_sort | cath: increased structural coverage of functional space |
topic | Database Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7778904/ https://www.ncbi.nlm.nih.gov/pubmed/33237325 http://dx.doi.org/10.1093/nar/gkaa1079 |
work_keys_str_mv | AT sillitoeian cathincreasedstructuralcoverageoffunctionalspace AT bordinnicola cathincreasedstructuralcoverageoffunctionalspace AT dawsonnatalie cathincreasedstructuralcoverageoffunctionalspace AT wamanvaishalip cathincreasedstructuralcoverageoffunctionalspace AT ashfordpaul cathincreasedstructuralcoverageoffunctionalspace AT scholesharrym cathincreasedstructuralcoverageoffunctionalspace AT pangcamillasm cathincreasedstructuralcoverageoffunctionalspace AT woodridgelaurel cathincreasedstructuralcoverageoffunctionalspace AT rauerclemens cathincreasedstructuralcoverageoffunctionalspace AT senneeladri cathincreasedstructuralcoverageoffunctionalspace AT abbasianmahnaz cathincreasedstructuralcoverageoffunctionalspace AT lecornusean cathincreasedstructuralcoverageoffunctionalspace AT lamsudatt cathincreasedstructuralcoverageoffunctionalspace AT berkakarel cathincreasedstructuralcoverageoffunctionalspace AT varekovaivanahutarova cathincreasedstructuralcoverageoffunctionalspace AT svobodovaradka cathincreasedstructuralcoverageoffunctionalspace AT leesjon cathincreasedstructuralcoverageoffunctionalspace AT orengochristinea cathincreasedstructuralcoverageoffunctionalspace |