Cargando…
Sequence comparison by sequence harmony identifies subtype-specific functional sites
Multiple sequence alignments are often used to reveal functionally important residues within a protein family. They can be particularly useful for the identification of key residues that determine functional differences between protein subfamilies. We present a new entropy-based method, Sequence Har...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1702503/ https://www.ncbi.nlm.nih.gov/pubmed/17130172 http://dx.doi.org/10.1093/nar/gkl901 |
_version_ | 1782131260050636800 |
---|---|
author | Pirovano, Walter Feenstra, K. Anton Heringa, Jaap |
author_facet | Pirovano, Walter Feenstra, K. Anton Heringa, Jaap |
author_sort | Pirovano, Walter |
collection | PubMed |
description | Multiple sequence alignments are often used to reveal functionally important residues within a protein family. They can be particularly useful for the identification of key residues that determine functional differences between protein subfamilies. We present a new entropy-based method, Sequence Harmony (SH) that accurately detects subfamily-specific positions from a multiple sequence alignment. The SH algorithm implements a novel formula, able to score compositional differences between subfamilies, without imposing conservation, in a simple manner on an intuitive scale. We compare our method with the most important published methods, i.e. AMAS, TreeDet and SDP-pred, using three well-studied protein families: the receptor-binding domain (MH2) of the Smad family of transcription factors, the Ras-superfamily of small GTPases and the MIP-family of integral membrane transporters. We demonstrate that SH accurately selects known functional sites with higher coverage than the other methods for these test-cases. This shows that compositional differences between protein subfamilies provide sufficient basis for identification of functional sites. In addition, SH selects a number of sites of unknown function that could be interesting candidates for further experimental investigation. |
format | Text |
id | pubmed-1702503 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-17025032006-12-26 Sequence comparison by sequence harmony identifies subtype-specific functional sites Pirovano, Walter Feenstra, K. Anton Heringa, Jaap Nucleic Acids Res Computational Biology Multiple sequence alignments are often used to reveal functionally important residues within a protein family. They can be particularly useful for the identification of key residues that determine functional differences between protein subfamilies. We present a new entropy-based method, Sequence Harmony (SH) that accurately detects subfamily-specific positions from a multiple sequence alignment. The SH algorithm implements a novel formula, able to score compositional differences between subfamilies, without imposing conservation, in a simple manner on an intuitive scale. We compare our method with the most important published methods, i.e. AMAS, TreeDet and SDP-pred, using three well-studied protein families: the receptor-binding domain (MH2) of the Smad family of transcription factors, the Ras-superfamily of small GTPases and the MIP-family of integral membrane transporters. We demonstrate that SH accurately selects known functional sites with higher coverage than the other methods for these test-cases. This shows that compositional differences between protein subfamilies provide sufficient basis for identification of functional sites. In addition, SH selects a number of sites of unknown function that could be interesting candidates for further experimental investigation. Oxford University Press 2006-12 2006-11-27 /pmc/articles/PMC1702503/ /pubmed/17130172 http://dx.doi.org/10.1093/nar/gkl901 Text en © 2006 The Author(s). https://creativecommons.org/licenses/by-nc/2.0/uk/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/ (https://creativecommons.org/licenses/by-nc/2.0/uk/) ) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Pirovano, Walter Feenstra, K. Anton Heringa, Jaap Sequence comparison by sequence harmony identifies subtype-specific functional sites |
title | Sequence comparison by sequence harmony identifies subtype-specific functional sites |
title_full | Sequence comparison by sequence harmony identifies subtype-specific functional sites |
title_fullStr | Sequence comparison by sequence harmony identifies subtype-specific functional sites |
title_full_unstemmed | Sequence comparison by sequence harmony identifies subtype-specific functional sites |
title_short | Sequence comparison by sequence harmony identifies subtype-specific functional sites |
title_sort | sequence comparison by sequence harmony identifies subtype-specific functional sites |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1702503/ https://www.ncbi.nlm.nih.gov/pubmed/17130172 http://dx.doi.org/10.1093/nar/gkl901 |
work_keys_str_mv | AT pirovanowalter sequencecomparisonbysequenceharmonyidentifiessubtypespecificfunctionalsites AT feenstrakanton sequencecomparisonbysequenceharmonyidentifiessubtypespecificfunctionalsites AT heringajaap sequencecomparisonbysequenceharmonyidentifiessubtypespecificfunctionalsites |