Cargando…

Sequence comparison by sequence harmony identifies subtype-specific functional sites

Multiple sequence alignments are often used to reveal functionally important residues within a protein family. They can be particularly useful for the identification of key residues that determine functional differences between protein subfamilies. We present a new entropy-based method, Sequence Har...

Descripción completa

Detalles Bibliográficos
Autores principales: Pirovano, Walter, Feenstra, K. Anton, Heringa, Jaap
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1702503/
https://www.ncbi.nlm.nih.gov/pubmed/17130172
http://dx.doi.org/10.1093/nar/gkl901
_version_ 1782131260050636800
author Pirovano, Walter
Feenstra, K. Anton
Heringa, Jaap
author_facet Pirovano, Walter
Feenstra, K. Anton
Heringa, Jaap
author_sort Pirovano, Walter
collection PubMed
description Multiple sequence alignments are often used to reveal functionally important residues within a protein family. They can be particularly useful for the identification of key residues that determine functional differences between protein subfamilies. We present a new entropy-based method, Sequence Harmony (SH) that accurately detects subfamily-specific positions from a multiple sequence alignment. The SH algorithm implements a novel formula, able to score compositional differences between subfamilies, without imposing conservation, in a simple manner on an intuitive scale. We compare our method with the most important published methods, i.e. AMAS, TreeDet and SDP-pred, using three well-studied protein families: the receptor-binding domain (MH2) of the Smad family of transcription factors, the Ras-superfamily of small GTPases and the MIP-family of integral membrane transporters. We demonstrate that SH accurately selects known functional sites with higher coverage than the other methods for these test-cases. This shows that compositional differences between protein subfamilies provide sufficient basis for identification of functional sites. In addition, SH selects a number of sites of unknown function that could be interesting candidates for further experimental investigation.
format Text
id pubmed-1702503
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-17025032006-12-26 Sequence comparison by sequence harmony identifies subtype-specific functional sites Pirovano, Walter Feenstra, K. Anton Heringa, Jaap Nucleic Acids Res Computational Biology Multiple sequence alignments are often used to reveal functionally important residues within a protein family. They can be particularly useful for the identification of key residues that determine functional differences between protein subfamilies. We present a new entropy-based method, Sequence Harmony (SH) that accurately detects subfamily-specific positions from a multiple sequence alignment. The SH algorithm implements a novel formula, able to score compositional differences between subfamilies, without imposing conservation, in a simple manner on an intuitive scale. We compare our method with the most important published methods, i.e. AMAS, TreeDet and SDP-pred, using three well-studied protein families: the receptor-binding domain (MH2) of the Smad family of transcription factors, the Ras-superfamily of small GTPases and the MIP-family of integral membrane transporters. We demonstrate that SH accurately selects known functional sites with higher coverage than the other methods for these test-cases. This shows that compositional differences between protein subfamilies provide sufficient basis for identification of functional sites. In addition, SH selects a number of sites of unknown function that could be interesting candidates for further experimental investigation. Oxford University Press 2006-12 2006-11-27 /pmc/articles/PMC1702503/ /pubmed/17130172 http://dx.doi.org/10.1093/nar/gkl901 Text en © 2006 The Author(s). https://creativecommons.org/licenses/by-nc/2.0/uk/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/ (https://creativecommons.org/licenses/by-nc/2.0/uk/) ) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Pirovano, Walter
Feenstra, K. Anton
Heringa, Jaap
Sequence comparison by sequence harmony identifies subtype-specific functional sites
title Sequence comparison by sequence harmony identifies subtype-specific functional sites
title_full Sequence comparison by sequence harmony identifies subtype-specific functional sites
title_fullStr Sequence comparison by sequence harmony identifies subtype-specific functional sites
title_full_unstemmed Sequence comparison by sequence harmony identifies subtype-specific functional sites
title_short Sequence comparison by sequence harmony identifies subtype-specific functional sites
title_sort sequence comparison by sequence harmony identifies subtype-specific functional sites
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1702503/
https://www.ncbi.nlm.nih.gov/pubmed/17130172
http://dx.doi.org/10.1093/nar/gkl901
work_keys_str_mv AT pirovanowalter sequencecomparisonbysequenceharmonyidentifiessubtypespecificfunctionalsites
AT feenstrakanton sequencecomparisonbysequenceharmonyidentifiessubtypespecificfunctionalsites
AT heringajaap sequencecomparisonbysequenceharmonyidentifiessubtypespecificfunctionalsites