Cargando…

Supervised multivariate analysis of sequence groups to identify specificity determining residues

BACKGROUND: Proteins that evolve from a common ancestor can change functionality over time, and it is important to be able identify residues that cause this change. In this paper we show how a supervised multivariate statistical method, Between Group Analysis (BGA), can be used to identify these res...

Descripción completa

Detalles Bibliográficos
Autores principales: Wallace, Iain M, Higgins, Desmond G
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1878507/
https://www.ncbi.nlm.nih.gov/pubmed/17451607
http://dx.doi.org/10.1186/1471-2105-8-135
_version_ 1782133583597535232
author Wallace, Iain M
Higgins, Desmond G
author_facet Wallace, Iain M
Higgins, Desmond G
author_sort Wallace, Iain M
collection PubMed
description BACKGROUND: Proteins that evolve from a common ancestor can change functionality over time, and it is important to be able identify residues that cause this change. In this paper we show how a supervised multivariate statistical method, Between Group Analysis (BGA), can be used to identify these residues from families of proteins with different substrate specifities using multiple sequence alignments. RESULTS: We demonstrate the usefulness of this method on three different test cases. Two of these test cases, the Lactate/Malate dehydrogenase family and Nucleotidyl Cyclases, consist of two functional groups. The other family, Serine Proteases consists of three groups. BGA was used to analyse and visualise these three families using two different encoding schemes for the amino acids. CONCLUSION: This overall combination of methods in this paper is powerful and flexible while being computationally very fast and simple. BGA is especially useful because it can be used to analyse any number of functional classes. In the examples we used in this paper, we have only used 2 or 3 classes for demonstration purposes but any number can be used and visualised.
format Text
id pubmed-1878507
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18785072007-05-29 Supervised multivariate analysis of sequence groups to identify specificity determining residues Wallace, Iain M Higgins, Desmond G BMC Bioinformatics Methodology Article BACKGROUND: Proteins that evolve from a common ancestor can change functionality over time, and it is important to be able identify residues that cause this change. In this paper we show how a supervised multivariate statistical method, Between Group Analysis (BGA), can be used to identify these residues from families of proteins with different substrate specifities using multiple sequence alignments. RESULTS: We demonstrate the usefulness of this method on three different test cases. Two of these test cases, the Lactate/Malate dehydrogenase family and Nucleotidyl Cyclases, consist of two functional groups. The other family, Serine Proteases consists of three groups. BGA was used to analyse and visualise these three families using two different encoding schemes for the amino acids. CONCLUSION: This overall combination of methods in this paper is powerful and flexible while being computationally very fast and simple. BGA is especially useful because it can be used to analyse any number of functional classes. In the examples we used in this paper, we have only used 2 or 3 classes for demonstration purposes but any number can be used and visualised. BioMed Central 2007-04-23 /pmc/articles/PMC1878507/ /pubmed/17451607 http://dx.doi.org/10.1186/1471-2105-8-135 Text en Copyright © 2007 Wallace and Higgins; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Wallace, Iain M
Higgins, Desmond G
Supervised multivariate analysis of sequence groups to identify specificity determining residues
title Supervised multivariate analysis of sequence groups to identify specificity determining residues
title_full Supervised multivariate analysis of sequence groups to identify specificity determining residues
title_fullStr Supervised multivariate analysis of sequence groups to identify specificity determining residues
title_full_unstemmed Supervised multivariate analysis of sequence groups to identify specificity determining residues
title_short Supervised multivariate analysis of sequence groups to identify specificity determining residues
title_sort supervised multivariate analysis of sequence groups to identify specificity determining residues
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1878507/
https://www.ncbi.nlm.nih.gov/pubmed/17451607
http://dx.doi.org/10.1186/1471-2105-8-135
work_keys_str_mv AT wallaceiainm supervisedmultivariateanalysisofsequencegroupstoidentifyspecificitydeterminingresidues
AT higginsdesmondg supervisedmultivariateanalysisofsequencegroupstoidentifyspecificitydeterminingresidues