Cargando…

A structural study for the optimisation of functional motifs encoded in protein sequences

BACKGROUND: A large number of PROSITE patterns select false positives and/or miss known true positives. It is possible that – at least in some cases – the weak specificity and/or sensitivity of a pattern is due to the fact that one, or maybe more, functional and/or structural key residues are not re...

Descripción completa

Detalles Bibliográficos
Autores principales: Via, Allegra, Helmer-Citterich, Manuela
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC420233/
https://www.ncbi.nlm.nih.gov/pubmed/15119965
http://dx.doi.org/10.1186/1471-2105-5-50
_version_ 1782121467470675968
author Via, Allegra
Helmer-Citterich, Manuela
author_facet Via, Allegra
Helmer-Citterich, Manuela
author_sort Via, Allegra
collection PubMed
description BACKGROUND: A large number of PROSITE patterns select false positives and/or miss known true positives. It is possible that – at least in some cases – the weak specificity and/or sensitivity of a pattern is due to the fact that one, or maybe more, functional and/or structural key residues are not represented in the pattern. Multiple sequence alignments are commonly used to build functional sequence patterns. If residues structurally conserved in proteins sharing a function cannot be aligned in a multiple sequence alignment, they are likely to be missed in a standard pattern construction procedure. RESULTS: Here we present a new procedure aimed at improving the sensitivity and/ or specificity of poorly-performing patterns. The procedure can be summarised as follows: 1. residues structurally conserved in different proteins, that are true positives for a pattern, are identified by means of a computational technique and by visual inspection. 2. the sequence positions of the structurally conserved residues falling outside the pattern are used to build extended sequence patterns. 3. the extended patterns are optimised on the SWISS-PROT database for their sensitivity and specificity. The method was applied to eight PROSITE patterns. Whenever structurally conserved residues are found in the surface region close to the pattern (seven out of eight cases), the addition of information inferred from structural analysis is shown to improve pattern selectivity and in some cases selectivity and sensitivity as well. In some of the cases considered the procedure allowed the identification of functionally interesting residues, whose biological role is also discussed. CONCLUSION: Our method can be applied to any type of functional motif or pattern (not only PROSITE ones) which is not able to select all and only the true positive hits and for which at least two true positive structures are available. The computational technique for the identification of structurally conserved residues is already available on request and will be soon accessible on our web server. The procedure is intended for the use of pattern database curators and of scientists interested in a specific protein family for which no specific or selective patterns are yet available.
format Text
id pubmed-420233
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-4202332004-06-06 A structural study for the optimisation of functional motifs encoded in protein sequences Via, Allegra Helmer-Citterich, Manuela BMC Bioinformatics Methodology Article BACKGROUND: A large number of PROSITE patterns select false positives and/or miss known true positives. It is possible that – at least in some cases – the weak specificity and/or sensitivity of a pattern is due to the fact that one, or maybe more, functional and/or structural key residues are not represented in the pattern. Multiple sequence alignments are commonly used to build functional sequence patterns. If residues structurally conserved in proteins sharing a function cannot be aligned in a multiple sequence alignment, they are likely to be missed in a standard pattern construction procedure. RESULTS: Here we present a new procedure aimed at improving the sensitivity and/ or specificity of poorly-performing patterns. The procedure can be summarised as follows: 1. residues structurally conserved in different proteins, that are true positives for a pattern, are identified by means of a computational technique and by visual inspection. 2. the sequence positions of the structurally conserved residues falling outside the pattern are used to build extended sequence patterns. 3. the extended patterns are optimised on the SWISS-PROT database for their sensitivity and specificity. The method was applied to eight PROSITE patterns. Whenever structurally conserved residues are found in the surface region close to the pattern (seven out of eight cases), the addition of information inferred from structural analysis is shown to improve pattern selectivity and in some cases selectivity and sensitivity as well. In some of the cases considered the procedure allowed the identification of functionally interesting residues, whose biological role is also discussed. CONCLUSION: Our method can be applied to any type of functional motif or pattern (not only PROSITE ones) which is not able to select all and only the true positive hits and for which at least two true positive structures are available. The computational technique for the identification of structurally conserved residues is already available on request and will be soon accessible on our web server. The procedure is intended for the use of pattern database curators and of scientists interested in a specific protein family for which no specific or selective patterns are yet available. BioMed Central 2004-04-30 /pmc/articles/PMC420233/ /pubmed/15119965 http://dx.doi.org/10.1186/1471-2105-5-50 Text en Copyright © 2004 Via and Helmer-Citterich; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Methodology Article
Via, Allegra
Helmer-Citterich, Manuela
A structural study for the optimisation of functional motifs encoded in protein sequences
title A structural study for the optimisation of functional motifs encoded in protein sequences
title_full A structural study for the optimisation of functional motifs encoded in protein sequences
title_fullStr A structural study for the optimisation of functional motifs encoded in protein sequences
title_full_unstemmed A structural study for the optimisation of functional motifs encoded in protein sequences
title_short A structural study for the optimisation of functional motifs encoded in protein sequences
title_sort structural study for the optimisation of functional motifs encoded in protein sequences
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC420233/
https://www.ncbi.nlm.nih.gov/pubmed/15119965
http://dx.doi.org/10.1186/1471-2105-5-50
work_keys_str_mv AT viaallegra astructuralstudyfortheoptimisationoffunctionalmotifsencodedinproteinsequences
AT helmercitterichmanuela astructuralstudyfortheoptimisationoffunctionalmotifsencodedinproteinsequences
AT viaallegra structuralstudyfortheoptimisationoffunctionalmotifsencodedinproteinsequences
AT helmercitterichmanuela structuralstudyfortheoptimisationoffunctionalmotifsencodedinproteinsequences