Cargando…
Recruitment of rare 3-grams at functional sites: Is this a mechanism for increasing enzyme specificity?
BACKGROUND: A wealth of unannotated and functionally unknown protein sequences has accumulated in recent years with rapid progresses in sequence genomics, giving rise to ever increasing demands for developing methods to efficiently assess functional sites. Sequence and structure conservations have t...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1950313/ https://www.ncbi.nlm.nih.gov/pubmed/17598909 http://dx.doi.org/10.1186/1471-2105-8-226 |
_version_ | 1782134542463664128 |
---|---|
author | Tobi, Dror Bahar, Ivet |
author_facet | Tobi, Dror Bahar, Ivet |
author_sort | Tobi, Dror |
collection | PubMed |
description | BACKGROUND: A wealth of unannotated and functionally unknown protein sequences has accumulated in recent years with rapid progresses in sequence genomics, giving rise to ever increasing demands for developing methods to efficiently assess functional sites. Sequence and structure conservations have traditionally been the major criteria adopted in various algorithms to identify functional sites. Here, we focus on the distributions of the 20(3 )different types of 3-grams (or triplets of sequentially contiguous amino acid) in the entire space of sequences accumulated to date in the UniProt database, and focus in particular on the rare 3-grams distinguished by their high entropy-based information content. RESULTS: Comparison of the UniProt distributions with those observed near/at the active sites on a non-redundant dataset of 59 enzyme/ligand complexes shows that the active sites preferentially recruit 3-grams distinguished by their low frequency in the UniProt. Three cases, Src kinase, hemoglobin, and tyrosyl-tRNA synthetase, are discussed in details to illustrate the biological significance of the results. CONCLUSION: The results suggest that recruitment of rare 3-grams may be an efficient mechanism for increasing specificity at functional sites. Rareness/scarcity emerges as a feature that may assist in identifying key sites for proteins function, providing information complementary to that derived from sequence alignments. In addition it provides us (for the first time) with a means of identifying potentially functional sites from sequence information alone, when sequence conservation properties are not available. |
format | Text |
id | pubmed-1950313 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-19503132007-08-21 Recruitment of rare 3-grams at functional sites: Is this a mechanism for increasing enzyme specificity? Tobi, Dror Bahar, Ivet BMC Bioinformatics Research Article BACKGROUND: A wealth of unannotated and functionally unknown protein sequences has accumulated in recent years with rapid progresses in sequence genomics, giving rise to ever increasing demands for developing methods to efficiently assess functional sites. Sequence and structure conservations have traditionally been the major criteria adopted in various algorithms to identify functional sites. Here, we focus on the distributions of the 20(3 )different types of 3-grams (or triplets of sequentially contiguous amino acid) in the entire space of sequences accumulated to date in the UniProt database, and focus in particular on the rare 3-grams distinguished by their high entropy-based information content. RESULTS: Comparison of the UniProt distributions with those observed near/at the active sites on a non-redundant dataset of 59 enzyme/ligand complexes shows that the active sites preferentially recruit 3-grams distinguished by their low frequency in the UniProt. Three cases, Src kinase, hemoglobin, and tyrosyl-tRNA synthetase, are discussed in details to illustrate the biological significance of the results. CONCLUSION: The results suggest that recruitment of rare 3-grams may be an efficient mechanism for increasing specificity at functional sites. Rareness/scarcity emerges as a feature that may assist in identifying key sites for proteins function, providing information complementary to that derived from sequence alignments. In addition it provides us (for the first time) with a means of identifying potentially functional sites from sequence information alone, when sequence conservation properties are not available. BioMed Central 2007-06-28 /pmc/articles/PMC1950313/ /pubmed/17598909 http://dx.doi.org/10.1186/1471-2105-8-226 Text en Copyright © 2007 Tobi and Bahar; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Tobi, Dror Bahar, Ivet Recruitment of rare 3-grams at functional sites: Is this a mechanism for increasing enzyme specificity? |
title | Recruitment of rare 3-grams at functional sites: Is this a mechanism for increasing enzyme specificity? |
title_full | Recruitment of rare 3-grams at functional sites: Is this a mechanism for increasing enzyme specificity? |
title_fullStr | Recruitment of rare 3-grams at functional sites: Is this a mechanism for increasing enzyme specificity? |
title_full_unstemmed | Recruitment of rare 3-grams at functional sites: Is this a mechanism for increasing enzyme specificity? |
title_short | Recruitment of rare 3-grams at functional sites: Is this a mechanism for increasing enzyme specificity? |
title_sort | recruitment of rare 3-grams at functional sites: is this a mechanism for increasing enzyme specificity? |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1950313/ https://www.ncbi.nlm.nih.gov/pubmed/17598909 http://dx.doi.org/10.1186/1471-2105-8-226 |
work_keys_str_mv | AT tobidror recruitmentofrare3gramsatfunctionalsitesisthisamechanismforincreasingenzymespecificity AT baharivet recruitmentofrare3gramsatfunctionalsitesisthisamechanismforincreasingenzymespecificity |