Cargando…
A structural study for the optimisation of functional motifs encoded in protein sequences
BACKGROUND: A large number of PROSITE patterns select false positives and/or miss known true positives. It is possible that – at least in some cases – the weak specificity and/or sensitivity of a pattern is due to the fact that one, or maybe more, functional and/or structural key residues are not re...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2004
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC420233/ https://www.ncbi.nlm.nih.gov/pubmed/15119965 http://dx.doi.org/10.1186/1471-2105-5-50 |
_version_ | 1782121467470675968 |
---|---|
author | Via, Allegra Helmer-Citterich, Manuela |
author_facet | Via, Allegra Helmer-Citterich, Manuela |
author_sort | Via, Allegra |
collection | PubMed |
description | BACKGROUND: A large number of PROSITE patterns select false positives and/or miss known true positives. It is possible that – at least in some cases – the weak specificity and/or sensitivity of a pattern is due to the fact that one, or maybe more, functional and/or structural key residues are not represented in the pattern. Multiple sequence alignments are commonly used to build functional sequence patterns. If residues structurally conserved in proteins sharing a function cannot be aligned in a multiple sequence alignment, they are likely to be missed in a standard pattern construction procedure. RESULTS: Here we present a new procedure aimed at improving the sensitivity and/ or specificity of poorly-performing patterns. The procedure can be summarised as follows: 1. residues structurally conserved in different proteins, that are true positives for a pattern, are identified by means of a computational technique and by visual inspection. 2. the sequence positions of the structurally conserved residues falling outside the pattern are used to build extended sequence patterns. 3. the extended patterns are optimised on the SWISS-PROT database for their sensitivity and specificity. The method was applied to eight PROSITE patterns. Whenever structurally conserved residues are found in the surface region close to the pattern (seven out of eight cases), the addition of information inferred from structural analysis is shown to improve pattern selectivity and in some cases selectivity and sensitivity as well. In some of the cases considered the procedure allowed the identification of functionally interesting residues, whose biological role is also discussed. CONCLUSION: Our method can be applied to any type of functional motif or pattern (not only PROSITE ones) which is not able to select all and only the true positive hits and for which at least two true positive structures are available. The computational technique for the identification of structurally conserved residues is already available on request and will be soon accessible on our web server. The procedure is intended for the use of pattern database curators and of scientists interested in a specific protein family for which no specific or selective patterns are yet available. |
format | Text |
id | pubmed-420233 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2004 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-4202332004-06-06 A structural study for the optimisation of functional motifs encoded in protein sequences Via, Allegra Helmer-Citterich, Manuela BMC Bioinformatics Methodology Article BACKGROUND: A large number of PROSITE patterns select false positives and/or miss known true positives. It is possible that – at least in some cases – the weak specificity and/or sensitivity of a pattern is due to the fact that one, or maybe more, functional and/or structural key residues are not represented in the pattern. Multiple sequence alignments are commonly used to build functional sequence patterns. If residues structurally conserved in proteins sharing a function cannot be aligned in a multiple sequence alignment, they are likely to be missed in a standard pattern construction procedure. RESULTS: Here we present a new procedure aimed at improving the sensitivity and/ or specificity of poorly-performing patterns. The procedure can be summarised as follows: 1. residues structurally conserved in different proteins, that are true positives for a pattern, are identified by means of a computational technique and by visual inspection. 2. the sequence positions of the structurally conserved residues falling outside the pattern are used to build extended sequence patterns. 3. the extended patterns are optimised on the SWISS-PROT database for their sensitivity and specificity. The method was applied to eight PROSITE patterns. Whenever structurally conserved residues are found in the surface region close to the pattern (seven out of eight cases), the addition of information inferred from structural analysis is shown to improve pattern selectivity and in some cases selectivity and sensitivity as well. In some of the cases considered the procedure allowed the identification of functionally interesting residues, whose biological role is also discussed. CONCLUSION: Our method can be applied to any type of functional motif or pattern (not only PROSITE ones) which is not able to select all and only the true positive hits and for which at least two true positive structures are available. The computational technique for the identification of structurally conserved residues is already available on request and will be soon accessible on our web server. The procedure is intended for the use of pattern database curators and of scientists interested in a specific protein family for which no specific or selective patterns are yet available. BioMed Central 2004-04-30 /pmc/articles/PMC420233/ /pubmed/15119965 http://dx.doi.org/10.1186/1471-2105-5-50 Text en Copyright © 2004 Via and Helmer-Citterich; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL. |
spellingShingle | Methodology Article Via, Allegra Helmer-Citterich, Manuela A structural study for the optimisation of functional motifs encoded in protein sequences |
title | A structural study for the optimisation of functional motifs encoded in protein sequences |
title_full | A structural study for the optimisation of functional motifs encoded in protein sequences |
title_fullStr | A structural study for the optimisation of functional motifs encoded in protein sequences |
title_full_unstemmed | A structural study for the optimisation of functional motifs encoded in protein sequences |
title_short | A structural study for the optimisation of functional motifs encoded in protein sequences |
title_sort | structural study for the optimisation of functional motifs encoded in protein sequences |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC420233/ https://www.ncbi.nlm.nih.gov/pubmed/15119965 http://dx.doi.org/10.1186/1471-2105-5-50 |
work_keys_str_mv | AT viaallegra astructuralstudyfortheoptimisationoffunctionalmotifsencodedinproteinsequences AT helmercitterichmanuela astructuralstudyfortheoptimisationoffunctionalmotifsencodedinproteinsequences AT viaallegra structuralstudyfortheoptimisationoffunctionalmotifsencodedinproteinsequences AT helmercitterichmanuela structuralstudyfortheoptimisationoffunctionalmotifsencodedinproteinsequences |