Cargando…

A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem

A method for assigning functions to unknown sequences based on finding correlations between short signals and functional annotations in a protein database is presented. This approach is based on keyword (KW) and feature (FT) information stored in the SWISS-PROT database. The former refers to particu...

Descripción completa

Detalles Bibliográficos
Autores principales: Pérez, A. J., Rodríguez, A., Trelles, O., Thode, G.
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447339/
https://www.ncbi.nlm.nih.gov/pubmed/18629055
http://dx.doi.org/10.1002/cfg.208
Descripción
Sumario:A method for assigning functions to unknown sequences based on finding correlations between short signals and functional annotations in a protein database is presented. This approach is based on keyword (KW) and feature (FT) information stored in the SWISS-PROT database. The former refers to particular protein characteristics and the latter locates these characteristics at a specific sequence position. In this way, a certain keyword is only assigned to a sequence if sequence similarity is found in the position described by the FT field. Exhaustive tests performed over sequences with homologues (cluster set) and without homologues (singleton set) in the database show that assigning functions is much ’cleaner’ when information about domains (FT field) is used, than when only the keywords are used.