Cargando…

A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem

A method for assigning functions to unknown sequences based on finding correlations between short signals and functional annotations in a protein database is presented. This approach is based on keyword (KW) and feature (FT) information stored in the SWISS-PROT database. The former refers to particu...

Descripción completa

Detalles Bibliográficos
Autores principales: Pérez, A. J., Rodríguez, A., Trelles, O., Thode, G.
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447339/
https://www.ncbi.nlm.nih.gov/pubmed/18629055
http://dx.doi.org/10.1002/cfg.208
_version_ 1782156915089866752
author Pérez, A. J.
Rodríguez, A.
Trelles, O.
Thode, G.
author_facet Pérez, A. J.
Rodríguez, A.
Trelles, O.
Thode, G.
author_sort Pérez, A. J.
collection PubMed
description A method for assigning functions to unknown sequences based on finding correlations between short signals and functional annotations in a protein database is presented. This approach is based on keyword (KW) and feature (FT) information stored in the SWISS-PROT database. The former refers to particular protein characteristics and the latter locates these characteristics at a specific sequence position. In this way, a certain keyword is only assigned to a sequence if sequence similarity is found in the position described by the FT field. Exhaustive tests performed over sequences with homologues (cluster set) and without homologues (singleton set) in the database show that assigning functions is much ’cleaner’ when information about domains (FT field) is used, than when only the keywords are used.
format Text
id pubmed-2447339
institution National Center for Biotechnology Information
language English
publishDate 2002
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-24473392008-07-14 A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem Pérez, A. J. Rodríguez, A. Trelles, O. Thode, G. Comp Funct Genomics Research Article A method for assigning functions to unknown sequences based on finding correlations between short signals and functional annotations in a protein database is presented. This approach is based on keyword (KW) and feature (FT) information stored in the SWISS-PROT database. The former refers to particular protein characteristics and the latter locates these characteristics at a specific sequence position. In this way, a certain keyword is only assigned to a sequence if sequence similarity is found in the position described by the FT field. Exhaustive tests performed over sequences with homologues (cluster set) and without homologues (singleton set) in the database show that assigning functions is much ’cleaner’ when information about domains (FT field) is used, than when only the keywords are used. Hindawi Publishing Corporation 2002-10 /pmc/articles/PMC2447339/ /pubmed/18629055 http://dx.doi.org/10.1002/cfg.208 Text en Copyright © 2002 Hindawi Publishing Corporation. http://creativecommons.org/licenses/by/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Pérez, A. J.
Rodríguez, A.
Trelles, O.
Thode, G.
A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem
title A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem
title_full A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem
title_fullStr A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem
title_full_unstemmed A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem
title_short A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem
title_sort computational strategy for protein function assignment which addresses the multidomain problem
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447339/
https://www.ncbi.nlm.nih.gov/pubmed/18629055
http://dx.doi.org/10.1002/cfg.208
work_keys_str_mv AT perezaj acomputationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem
AT rodrigueza acomputationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem
AT trelleso acomputationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem
AT thodeg acomputationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem
AT perezaj computationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem
AT rodrigueza computationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem
AT trelleso computationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem
AT thodeg computationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem