Cargando…

The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis

The CATH database of protein domain structures (http://www.biochem.ucl.ac.uk/bsm/cath/) currently contains 43 229 domains classified into 1467 superfamilies and 5107 sequence families. Each structural family is expanded with sequence relatives from GenBank and completed genomes, using a variety of e...

Descripción completa

Detalles Bibliográficos
Autores principales: Pearl, Frances, Todd, Annabel, Sillitoe, Ian, Dibley, Mark, Redfern, Oliver, Lewis, Tony, Bennett, Christopher, Marsden, Russell, Grant, Alistair, Lee, David, Akpor, Adrian, Maibaum, Michael, Harrison, Andrew, Dallman, Timothy, Reeves, Gabrielle, Diboun, Ilhem, Addou, Sarah, Lise, Stefano, Johnston, Caroline, Sillero, Antonio, Thornton, Janet, Orengo, Christine
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC539978/
https://www.ncbi.nlm.nih.gov/pubmed/15608188
http://dx.doi.org/10.1093/nar/gki024
_version_ 1782122095249981440
author Pearl, Frances
Todd, Annabel
Sillitoe, Ian
Dibley, Mark
Redfern, Oliver
Lewis, Tony
Bennett, Christopher
Marsden, Russell
Grant, Alistair
Lee, David
Akpor, Adrian
Maibaum, Michael
Harrison, Andrew
Dallman, Timothy
Reeves, Gabrielle
Diboun, Ilhem
Addou, Sarah
Lise, Stefano
Johnston, Caroline
Sillero, Antonio
Thornton, Janet
Orengo, Christine
author_facet Pearl, Frances
Todd, Annabel
Sillitoe, Ian
Dibley, Mark
Redfern, Oliver
Lewis, Tony
Bennett, Christopher
Marsden, Russell
Grant, Alistair
Lee, David
Akpor, Adrian
Maibaum, Michael
Harrison, Andrew
Dallman, Timothy
Reeves, Gabrielle
Diboun, Ilhem
Addou, Sarah
Lise, Stefano
Johnston, Caroline
Sillero, Antonio
Thornton, Janet
Orengo, Christine
author_sort Pearl, Frances
collection PubMed
description The CATH database of protein domain structures (http://www.biochem.ucl.ac.uk/bsm/cath/) currently contains 43 229 domains classified into 1467 superfamilies and 5107 sequence families. Each structural family is expanded with sequence relatives from GenBank and completed genomes, using a variety of efficient sequence search protocols and reliable thresholds. This extended CATH protein family database contains 616 470 domain sequences classified into 23 876 sequence families. This results in the significant expansion of the CATH HMM model library to include models built from the CATH sequence relatives, giving a 10% increase in coverage for detecting remote homologues. An improved Dictionary of Homologous superfamilies (DHS) (http://www.biochem.ucl.ac.uk/bsm/dhs/) containing specific sequence, structural and functional information for each superfamily in CATH considerably assists manual validation of homologues. Information on sequence relatives in CATH superfamilies, GenBank and completed genomes is presented in the CATH associated DHS and Gene3D resources. Domain partnership information can be obtained from Gene3D (http://www.biochem.ucl.ac.uk/bsm/cath/Gene3D/). A new CATH server has been implemented (http://www.biochem.ucl.ac.uk/cgi-bin/cath/CathServer.pl) providing automatic classification of newly determined sequences and structures using a suite of rapid sequence and structure comparison methods. The statistical significance of matches is assessed and links are provided to the putative superfamily or fold group to which the query sequence or structure is assigned.
format Text
id pubmed-539978
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-5399782005-01-04 The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis Pearl, Frances Todd, Annabel Sillitoe, Ian Dibley, Mark Redfern, Oliver Lewis, Tony Bennett, Christopher Marsden, Russell Grant, Alistair Lee, David Akpor, Adrian Maibaum, Michael Harrison, Andrew Dallman, Timothy Reeves, Gabrielle Diboun, Ilhem Addou, Sarah Lise, Stefano Johnston, Caroline Sillero, Antonio Thornton, Janet Orengo, Christine Nucleic Acids Res Articles The CATH database of protein domain structures (http://www.biochem.ucl.ac.uk/bsm/cath/) currently contains 43 229 domains classified into 1467 superfamilies and 5107 sequence families. Each structural family is expanded with sequence relatives from GenBank and completed genomes, using a variety of efficient sequence search protocols and reliable thresholds. This extended CATH protein family database contains 616 470 domain sequences classified into 23 876 sequence families. This results in the significant expansion of the CATH HMM model library to include models built from the CATH sequence relatives, giving a 10% increase in coverage for detecting remote homologues. An improved Dictionary of Homologous superfamilies (DHS) (http://www.biochem.ucl.ac.uk/bsm/dhs/) containing specific sequence, structural and functional information for each superfamily in CATH considerably assists manual validation of homologues. Information on sequence relatives in CATH superfamilies, GenBank and completed genomes is presented in the CATH associated DHS and Gene3D resources. Domain partnership information can be obtained from Gene3D (http://www.biochem.ucl.ac.uk/bsm/cath/Gene3D/). A new CATH server has been implemented (http://www.biochem.ucl.ac.uk/cgi-bin/cath/CathServer.pl) providing automatic classification of newly determined sequences and structures using a suite of rapid sequence and structure comparison methods. The statistical significance of matches is assessed and links are provided to the putative superfamily or fold group to which the query sequence or structure is assigned. Oxford University Press 2005-01-01 2004-12-17 /pmc/articles/PMC539978/ /pubmed/15608188 http://dx.doi.org/10.1093/nar/gki024 Text en Copyright © 2005 Oxford University Press
spellingShingle Articles
Pearl, Frances
Todd, Annabel
Sillitoe, Ian
Dibley, Mark
Redfern, Oliver
Lewis, Tony
Bennett, Christopher
Marsden, Russell
Grant, Alistair
Lee, David
Akpor, Adrian
Maibaum, Michael
Harrison, Andrew
Dallman, Timothy
Reeves, Gabrielle
Diboun, Ilhem
Addou, Sarah
Lise, Stefano
Johnston, Caroline
Sillero, Antonio
Thornton, Janet
Orengo, Christine
The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis
title The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis
title_full The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis
title_fullStr The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis
title_full_unstemmed The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis
title_short The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis
title_sort cath domain structure database and related resources gene3d and dhs provide comprehensive domain family information for genome analysis
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC539978/
https://www.ncbi.nlm.nih.gov/pubmed/15608188
http://dx.doi.org/10.1093/nar/gki024
work_keys_str_mv AT pearlfrances thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT toddannabel thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT sillitoeian thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT dibleymark thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT redfernoliver thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT lewistony thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT bennettchristopher thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT marsdenrussell thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT grantalistair thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT leedavid thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT akporadrian thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT maibaummichael thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT harrisonandrew thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT dallmantimothy thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT reevesgabrielle thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT dibounilhem thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT addousarah thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT lisestefano thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT johnstoncaroline thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT silleroantonio thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT thorntonjanet thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT orengochristine thecathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT pearlfrances cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT toddannabel cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT sillitoeian cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT dibleymark cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT redfernoliver cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT lewistony cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT bennettchristopher cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT marsdenrussell cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT grantalistair cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT leedavid cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT akporadrian cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT maibaummichael cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT harrisonandrew cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT dallmantimothy cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT reevesgabrielle cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT dibounilhem cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT addousarah cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT lisestefano cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT johnstoncaroline cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT silleroantonio cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT thorntonjanet cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis
AT orengochristine cathdomainstructuredatabaseandrelatedresourcesgene3danddhsprovidecomprehensivedomainfamilyinformationforgenomeanalysis