Cargando…
A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem
A method for assigning functions to unknown sequences based on finding correlations between short signals and functional annotations in a protein database is presented. This approach is based on keyword (KW) and feature (FT) information stored in the SWISS-PROT database. The former refers to particu...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2002
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447339/ https://www.ncbi.nlm.nih.gov/pubmed/18629055 http://dx.doi.org/10.1002/cfg.208 |
_version_ | 1782156915089866752 |
---|---|
author | Pérez, A. J. Rodríguez, A. Trelles, O. Thode, G. |
author_facet | Pérez, A. J. Rodríguez, A. Trelles, O. Thode, G. |
author_sort | Pérez, A. J. |
collection | PubMed |
description | A method for assigning functions to unknown sequences based on finding correlations between short signals and functional annotations in a protein database is presented. This approach is based on keyword (KW) and feature (FT) information stored in the SWISS-PROT database. The former refers to particular protein characteristics and the latter locates these characteristics at a specific sequence position. In this way, a certain keyword is only assigned to a sequence if sequence similarity is found in the position described by the FT field. Exhaustive tests performed over sequences with homologues (cluster set) and without homologues (singleton set) in the database show that assigning functions is much ’cleaner’ when information about domains (FT field) is used, than when only the keywords are used. |
format | Text |
id | pubmed-2447339 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2002 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-24473392008-07-14 A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem Pérez, A. J. Rodríguez, A. Trelles, O. Thode, G. Comp Funct Genomics Research Article A method for assigning functions to unknown sequences based on finding correlations between short signals and functional annotations in a protein database is presented. This approach is based on keyword (KW) and feature (FT) information stored in the SWISS-PROT database. The former refers to particular protein characteristics and the latter locates these characteristics at a specific sequence position. In this way, a certain keyword is only assigned to a sequence if sequence similarity is found in the position described by the FT field. Exhaustive tests performed over sequences with homologues (cluster set) and without homologues (singleton set) in the database show that assigning functions is much ’cleaner’ when information about domains (FT field) is used, than when only the keywords are used. Hindawi Publishing Corporation 2002-10 /pmc/articles/PMC2447339/ /pubmed/18629055 http://dx.doi.org/10.1002/cfg.208 Text en Copyright © 2002 Hindawi Publishing Corporation. http://creativecommons.org/licenses/by/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Pérez, A. J. Rodríguez, A. Trelles, O. Thode, G. A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem |
title | A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem |
title_full | A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem |
title_fullStr | A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem |
title_full_unstemmed | A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem |
title_short | A Computational Strategy for Protein Function Assignment Which Addresses the Multidomain Problem |
title_sort | computational strategy for protein function assignment which addresses the multidomain problem |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447339/ https://www.ncbi.nlm.nih.gov/pubmed/18629055 http://dx.doi.org/10.1002/cfg.208 |
work_keys_str_mv | AT perezaj acomputationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem AT rodrigueza acomputationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem AT trelleso acomputationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem AT thodeg acomputationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem AT perezaj computationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem AT rodrigueza computationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem AT trelleso computationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem AT thodeg computationalstrategyforproteinfunctionassignmentwhichaddressesthemultidomainproblem |