Cargando…

CATH: expanding the horizons of structure-based functional annotations for genome sequences

This article provides an update of the latest data and developments within the CATH protein structure classification database (http://www.cathdb.info). The resource provides two levels of release: CATH-B, a daily snapshot of the latest structural domain boundaries and superfamily assignments, and CA...

Descripción completa

Detalles Bibliográficos
Autores principales: Sillitoe, Ian, Dawson, Natalie, Lewis, Tony E, Das, Sayoni, Lees, Jonathan G, Ashford, Paul, Tolulope, Adeyelu, Scholes, Harry M, Senatorov, Ilya, Bujan, Andra, Ceballos Rodriguez-Conde, Fatima, Dowling, Benjamin, Thornton, Janet, Orengo, Christine A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6323983/
https://www.ncbi.nlm.nih.gov/pubmed/30398663
http://dx.doi.org/10.1093/nar/gky1097
_version_ 1783385884622585856
author Sillitoe, Ian
Dawson, Natalie
Lewis, Tony E
Das, Sayoni
Lees, Jonathan G
Ashford, Paul
Tolulope, Adeyelu
Scholes, Harry M
Senatorov, Ilya
Bujan, Andra
Ceballos Rodriguez-Conde, Fatima
Dowling, Benjamin
Thornton, Janet
Orengo, Christine A
author_facet Sillitoe, Ian
Dawson, Natalie
Lewis, Tony E
Das, Sayoni
Lees, Jonathan G
Ashford, Paul
Tolulope, Adeyelu
Scholes, Harry M
Senatorov, Ilya
Bujan, Andra
Ceballos Rodriguez-Conde, Fatima
Dowling, Benjamin
Thornton, Janet
Orengo, Christine A
author_sort Sillitoe, Ian
collection PubMed
description This article provides an update of the latest data and developments within the CATH protein structure classification database (http://www.cathdb.info). The resource provides two levels of release: CATH-B, a daily snapshot of the latest structural domain boundaries and superfamily assignments, and CATH+, which adds layers of derived data, such as predicted sequence domains, functional annotations and functional clustering (known as Functional Families or FunFams). The most recent CATH+ release (version 4.2) provides a huge update in the coverage of structural data. This release increases the number of fully- classified domains by over 40% (from 308 999 to 434 857 structural domains), corresponding to an almost two- fold increase in sequence data (from 53 million to over 95 million predicted domains) organised into 6119 superfamilies. The coverage of high-resolution, protein PDB chains that contain at least one assigned CATH domain is now 90.2% (increased from 82.3% in the previous release). A number of highly requested features have also been implemented in our web pages: allowing the user to view an alignment between their query sequence and a representative FunFam structure and providing tools that make it easier to view the full structural context (multi-domain architecture) of domains and chains.
format Online
Article
Text
id pubmed-6323983
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-63239832019-01-10 CATH: expanding the horizons of structure-based functional annotations for genome sequences Sillitoe, Ian Dawson, Natalie Lewis, Tony E Das, Sayoni Lees, Jonathan G Ashford, Paul Tolulope, Adeyelu Scholes, Harry M Senatorov, Ilya Bujan, Andra Ceballos Rodriguez-Conde, Fatima Dowling, Benjamin Thornton, Janet Orengo, Christine A Nucleic Acids Res Database Issue This article provides an update of the latest data and developments within the CATH protein structure classification database (http://www.cathdb.info). The resource provides two levels of release: CATH-B, a daily snapshot of the latest structural domain boundaries and superfamily assignments, and CATH+, which adds layers of derived data, such as predicted sequence domains, functional annotations and functional clustering (known as Functional Families or FunFams). The most recent CATH+ release (version 4.2) provides a huge update in the coverage of structural data. This release increases the number of fully- classified domains by over 40% (from 308 999 to 434 857 structural domains), corresponding to an almost two- fold increase in sequence data (from 53 million to over 95 million predicted domains) organised into 6119 superfamilies. The coverage of high-resolution, protein PDB chains that contain at least one assigned CATH domain is now 90.2% (increased from 82.3% in the previous release). A number of highly requested features have also been implemented in our web pages: allowing the user to view an alignment between their query sequence and a representative FunFam structure and providing tools that make it easier to view the full structural context (multi-domain architecture) of domains and chains. Oxford University Press 2019-01-08 2018-11-06 /pmc/articles/PMC6323983/ /pubmed/30398663 http://dx.doi.org/10.1093/nar/gky1097 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Issue
Sillitoe, Ian
Dawson, Natalie
Lewis, Tony E
Das, Sayoni
Lees, Jonathan G
Ashford, Paul
Tolulope, Adeyelu
Scholes, Harry M
Senatorov, Ilya
Bujan, Andra
Ceballos Rodriguez-Conde, Fatima
Dowling, Benjamin
Thornton, Janet
Orengo, Christine A
CATH: expanding the horizons of structure-based functional annotations for genome sequences
title CATH: expanding the horizons of structure-based functional annotations for genome sequences
title_full CATH: expanding the horizons of structure-based functional annotations for genome sequences
title_fullStr CATH: expanding the horizons of structure-based functional annotations for genome sequences
title_full_unstemmed CATH: expanding the horizons of structure-based functional annotations for genome sequences
title_short CATH: expanding the horizons of structure-based functional annotations for genome sequences
title_sort cath: expanding the horizons of structure-based functional annotations for genome sequences
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6323983/
https://www.ncbi.nlm.nih.gov/pubmed/30398663
http://dx.doi.org/10.1093/nar/gky1097
work_keys_str_mv AT sillitoeian cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT dawsonnatalie cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT lewistonye cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT dassayoni cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT leesjonathang cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT ashfordpaul cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT tolulopeadeyelu cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT scholesharrym cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT senatorovilya cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT bujanandra cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT ceballosrodriguezcondefatima cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT dowlingbenjamin cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT thorntonjanet cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences
AT orengochristinea cathexpandingthehorizonsofstructurebasedfunctionalannotationsforgenomesequences