Cargando…
CATH: expanding the horizons of structure-based functional annotations for genome sequences
This article provides an update of the latest data and developments within the CATH protein structure classification database (http://www.cathdb.info). The resource provides two levels of release: CATH-B, a daily snapshot of the latest structural domain boundaries and superfamily assignments, and CA...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6323983/ https://www.ncbi.nlm.nih.gov/pubmed/30398663 http://dx.doi.org/10.1093/nar/gky1097 |
_version_ | 1783385884622585856 |
---|---|
author | Sillitoe, Ian Dawson, Natalie Lewis, Tony E Das, Sayoni Lees, Jonathan G Ashford, Paul Tolulope, Adeyelu Scholes, Harry M Senatorov, Ilya Bujan, Andra Ceballos Rodriguez-Conde, Fatima Dowling, Benjamin Thornton, Janet Orengo, Christine A |
author_facet | Sillitoe, Ian Dawson, Natalie Lewis, Tony E Das, Sayoni Lees, Jonathan G Ashford, Paul Tolulope, Adeyelu Scholes, Harry M Senatorov, Ilya Bujan, Andra Ceballos Rodriguez-Conde, Fatima Dowling, Benjamin Thornton, Janet Orengo, Christine A |
author_sort | Sillitoe, Ian |
collection | PubMed |
description | This article provides an update of the latest data and developments within the CATH protein structure classification database (http://www.cathdb.info). The resource provides two levels of release: CATH-B, a daily snapshot of the latest structural domain boundaries and superfamily assignments, and CATH+, which adds layers of derived data, such as predicted sequence domains, functional annotations and functional clustering (known as Functional Families or FunFams). The most recent CATH+ release (version 4.2) provides a huge update in the coverage of structural data. This release increases the number of fully- classified domains by over 40% (from 308 999 to 434 857 structural domains), corresponding to an almost two- fold increase in sequence data (from 53 million to over 95 million predicted domains) organised into 6119 superfamilies. The coverage of high-resolution, protein PDB chains that contain at least one assigned CATH domain is now 90.2% (increased from 82.3% in the previous release). A number of highly requested features have also been implemented in our web pages: allowing the user to view an alignment between their query sequence and a representative FunFam structure and providing tools that make it easier to view the full structural context (multi-domain architecture) of domains and chains. |
format | Online Article Text |
id | pubmed-6323983 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-63239832019-01-10 CATH: expanding the horizons of structure-based functional annotations for genome sequences Sillitoe, Ian Dawson, Natalie Lewis, Tony E Das, Sayoni Lees, Jonathan G Ashford, Paul Tolulope, Adeyelu Scholes, Harry M Senatorov, Ilya Bujan, Andra Ceballos Rodriguez-Conde, Fatima Dowling, Benjamin Thornton, Janet Orengo, Christine A Nucleic Acids Res Database Issue This article provides an update of the latest data and developments within the CATH protein structure classification database (http://www.cathdb.info). The resource provides two levels of release: CATH-B, a daily snapshot of the latest structural domain boundaries and superfamily assignments, and CATH+, which adds layers of derived data, such as predicted sequence domains, functional annotations and functional clustering (known as Functional Families or FunFams). The most recent CATH+ release (version 4.2) provides a huge update in the coverage of structural data. This release increases the number of fully- classified domains by over 40% (from 308 999 to 434 857 structural domains), corresponding to an almost two- fold increase in sequence data (from 53 million to over 95 million predicted domains) organised into 6119 superfamilies. The coverage of high-resolution, protein PDB chains that contain at least one assigned CATH domain is now 90.2% (increased from 82.3% in the previous release). A number of highly requested features have also been implemented in our web pages: allowing the user to view an alignment between their query sequence and a representative FunFam structure and providing tools that make it easier to view the full structural context (multi-domain architecture) of domains and chains. Oxford University Press 2019-01-08 2018-11-06 /pmc/articles/PMC6323983/ /pubmed/30398663 http://dx.doi.org/10.1093/nar/gky1097 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Issue Sillitoe, Ian Dawson, Natalie Lewis, Tony E Das, Sayoni Lees, Jonathan G Ashford, Paul Tolulope, Adeyelu Scholes, Harry M Senatorov, Ilya Bujan, Andra Ceballos Rodriguez-Conde, Fatima Dowling, Benjamin Thornton, Janet Orengo, Christine A CATH: expanding the horizons of structure-based functional annotations for genome sequences |
title | CATH: expanding the horizons of structure-based functional annotations for genome sequences |
title_full | CATH: expanding the horizons of structure-based functional annotations for genome sequences |
title_fullStr | CATH: expanding the horizons of structure-based functional annotations for genome sequences |
title_full_unstemmed | CATH: expanding the horizons of structure-based functional annotations for genome sequences |
title_short | CATH: expanding the horizons of structure-based functional annotations for genome sequences |
title_sort | cath: expanding the horizons of structure-based functional annotations for genome sequences |
topic | Database Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6323983/ https://www.ncbi.nlm.nih.gov/pubmed/30398663 http://dx.doi.org/10.1093/nar/gky1097 |
work_keys_str_mv | AT sillitoeian cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT dawsonnatalie cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT lewistonye cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT dassayoni cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT leesjonathang cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT ashfordpaul cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT tolulopeadeyelu cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT scholesharrym cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT senatorovilya cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT bujanandra cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT ceballosrodriguezcondefatima cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT dowlingbenjamin cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT thorntonjanet cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences AT orengochristinea cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences |