Cargando…
The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis
The CATH database of protein domain structures (http://www.biochem.ucl.ac.uk/bsm/cath/) currently contains 43 229 domains classified into 1467 superfamilies and 5107 sequence families. Each structural family is expanded with sequence relatives from GenBank and completed genomes, using a variety of e...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC539978/ https://www.ncbi.nlm.nih.gov/pubmed/15608188 http://dx.doi.org/10.1093/nar/gki024 |
_version_ | 1782122095249981440 |
---|---|
author | Pearl, Frances Todd, Annabel Sillitoe, Ian Dibley, Mark Redfern, Oliver Lewis, Tony Bennett, Christopher Marsden, Russell Grant, Alistair Lee, David Akpor, Adrian Maibaum, Michael Harrison, Andrew Dallman, Timothy Reeves, Gabrielle Diboun, Ilhem Addou, Sarah Lise, Stefano Johnston, Caroline Sillero, Antonio Thornton, Janet Orengo, Christine |
author_facet | Pearl, Frances Todd, Annabel Sillitoe, Ian Dibley, Mark Redfern, Oliver Lewis, Tony Bennett, Christopher Marsden, Russell Grant, Alistair Lee, David Akpor, Adrian Maibaum, Michael Harrison, Andrew Dallman, Timothy Reeves, Gabrielle Diboun, Ilhem Addou, Sarah Lise, Stefano Johnston, Caroline Sillero, Antonio Thornton, Janet Orengo, Christine |
author_sort | Pearl, Frances |
collection | PubMed |
description | The CATH database of protein domain structures (http://www.biochem.ucl.ac.uk/bsm/cath/) currently contains 43 229 domains classified into 1467 superfamilies and 5107 sequence families. Each structural family is expanded with sequence relatives from GenBank and completed genomes, using a variety of efficient sequence search protocols and reliable thresholds. This extended CATH protein family database contains 616 470 domain sequences classified into 23 876 sequence families. This results in the significant expansion of the CATH HMM model library to include models built from the CATH sequence relatives, giving a 10% increase in coverage for detecting remote homologues. An improved Dictionary of Homologous superfamilies (DHS) (http://www.biochem.ucl.ac.uk/bsm/dhs/) containing specific sequence, structural and functional information for each superfamily in CATH considerably assists manual validation of homologues. Information on sequence relatives in CATH superfamilies, GenBank and completed genomes is presented in the CATH associated DHS and Gene3D resources. Domain partnership information can be obtained from Gene3D (http://www.biochem.ucl.ac.uk/bsm/cath/Gene3D/). A new CATH server has been implemented (http://www.biochem.ucl.ac.uk/cgi-bin/cath/CathServer.pl) providing automatic classification of newly determined sequences and structures using a suite of rapid sequence and structure comparison methods. The statistical significance of matches is assessed and links are provided to the putative superfamily or fold group to which the query sequence or structure is assigned. |
format | Text |
id | pubmed-539978 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-5399782005-01-04 The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis Pearl, Frances Todd, Annabel Sillitoe, Ian Dibley, Mark Redfern, Oliver Lewis, Tony Bennett, Christopher Marsden, Russell Grant, Alistair Lee, David Akpor, Adrian Maibaum, Michael Harrison, Andrew Dallman, Timothy Reeves, Gabrielle Diboun, Ilhem Addou, Sarah Lise, Stefano Johnston, Caroline Sillero, Antonio Thornton, Janet Orengo, Christine Nucleic Acids Res Articles The CATH database of protein domain structures (http://www.biochem.ucl.ac.uk/bsm/cath/) currently contains 43 229 domains classified into 1467 superfamilies and 5107 sequence families. Each structural family is expanded with sequence relatives from GenBank and completed genomes, using a variety of efficient sequence search protocols and reliable thresholds. This extended CATH protein family database contains 616 470 domain sequences classified into 23 876 sequence families. This results in the significant expansion of the CATH HMM model library to include models built from the CATH sequence relatives, giving a 10% increase in coverage for detecting remote homologues. An improved Dictionary of Homologous superfamilies (DHS) (http://www.biochem.ucl.ac.uk/bsm/dhs/) containing specific sequence, structural and functional information for each superfamily in CATH considerably assists manual validation of homologues. Information on sequence relatives in CATH superfamilies, GenBank and completed genomes is presented in the CATH associated DHS and Gene3D resources. Domain partnership information can be obtained from Gene3D (http://www.biochem.ucl.ac.uk/bsm/cath/Gene3D/). A new CATH server has been implemented (http://www.biochem.ucl.ac.uk/cgi-bin/cath/CathServer.pl) providing automatic classification of newly determined sequences and structures using a suite of rapid sequence and structure comparison methods. The statistical significance of matches is assessed and links are provided to the putative superfamily or fold group to which the query sequence or structure is assigned. Oxford University Press 2005-01-01 2004-12-17 /pmc/articles/PMC539978/ /pubmed/15608188 http://dx.doi.org/10.1093/nar/gki024 Text en Copyright © 2005 Oxford University Press |
spellingShingle | Articles Pearl, Frances Todd, Annabel Sillitoe, Ian Dibley, Mark Redfern, Oliver Lewis, Tony Bennett, Christopher Marsden, Russell Grant, Alistair Lee, David Akpor, Adrian Maibaum, Michael Harrison, Andrew Dallman, Timothy Reeves, Gabrielle Diboun, Ilhem Addou, Sarah Lise, Stefano Johnston, Caroline Sillero, Antonio Thornton, Janet Orengo, Christine The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis |
title | The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis |
title_full | The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis |
title_fullStr | The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis |
title_full_unstemmed | The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis |
title_short | The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis |
title_sort | cath domain structure database and related resources gene3d and dhs provide comprehensive domain family information for genome analysis |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC539978/ https://www.ncbi.nlm.nih.gov/pubmed/15608188 http://dx.doi.org/10.1093/nar/gki024 |
work_keys_str_mv | AT pearlfrances thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT toddannabel thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT sillitoeian thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT dibleymark thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT redfernoliver thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT lewistony thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT bennettchristopher thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT marsdenrussell thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT grantalistair thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT leedavid thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT akporadrian thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT maibaummichael thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT harrisonandrew thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT dallmantimothy thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT reevesgabrielle thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT dibounilhem thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT addousarah thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT lisestefano thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT johnstoncaroline thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT silleroantonio thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT thorntonjanet thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT orengochristine thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT pearlfrances cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT toddannabel cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT sillitoeian cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT dibleymark cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT redfernoliver cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT lewistony cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT bennettchristopher cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT marsdenrussell cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT grantalistair cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT leedavid cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT akporadrian cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT maibaummichael cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT harrisonandrew cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT dallmantimothy cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT reevesgabrielle cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT dibounilhem cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT addousarah cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT lisestefano cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT johnstoncaroline cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT silleroantonio cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT thorntonjanet cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis AT orengochristine cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis |