Cargando…

Measuring the functional sequence complexity of proteins

BACKGROUND: Abel and Trevors have delineated three aspects of sequence complexity, Random Sequence Complexity (RSC), Ordered Sequence Complexity (OSC) and Functional Sequence Complexity (FSC) observed in biosequences such as proteins. In this paper, we provide a method to measure functional sequence...

Descripción completa

Detalles Bibliográficos
Autores principales: Durston, Kirk K, Chiu, David KY, Abel, David L, Trevors, Jack T
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2217542/
https://www.ncbi.nlm.nih.gov/pubmed/18062814
http://dx.doi.org/10.1186/1742-4682-4-47
_version_ 1782149276858580992
author Durston, Kirk K
Chiu, David KY
Abel, David L
Trevors, Jack T
author_facet Durston, Kirk K
Chiu, David KY
Abel, David L
Trevors, Jack T
author_sort Durston, Kirk K
collection PubMed
description BACKGROUND: Abel and Trevors have delineated three aspects of sequence complexity, Random Sequence Complexity (RSC), Ordered Sequence Complexity (OSC) and Functional Sequence Complexity (FSC) observed in biosequences such as proteins. In this paper, we provide a method to measure functional sequence complexity. METHODS AND RESULTS: We have extended Shannon uncertainty by incorporating the data variable with a functionality variable. The resulting measured unit, which we call Functional bit (Fit), is calculated from the sequence data jointly with the defined functionality variable. To demonstrate the relevance to functional bioinformatics, a method to measure functional sequence complexity was developed and applied to 35 protein families. Considerations were made in determining how the measure can be used to correlate functionality when relating to the whole molecule and sub-molecule. In the experiment, we show that when the proposed measure is applied to the aligned protein sequences of ubiquitin, 6 of the 7 highest value sites correlate with the binding domain. CONCLUSION: For future extensions, measures of functional bioinformatics may provide a means to evaluate potential evolving pathways from effects such as mutations, as well as analyzing the internal structural and functional relationships within the 3-D structure of proteins.
format Text
id pubmed-2217542
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22175422008-01-30 Measuring the functional sequence complexity of proteins Durston, Kirk K Chiu, David KY Abel, David L Trevors, Jack T Theor Biol Med Model Research BACKGROUND: Abel and Trevors have delineated three aspects of sequence complexity, Random Sequence Complexity (RSC), Ordered Sequence Complexity (OSC) and Functional Sequence Complexity (FSC) observed in biosequences such as proteins. In this paper, we provide a method to measure functional sequence complexity. METHODS AND RESULTS: We have extended Shannon uncertainty by incorporating the data variable with a functionality variable. The resulting measured unit, which we call Functional bit (Fit), is calculated from the sequence data jointly with the defined functionality variable. To demonstrate the relevance to functional bioinformatics, a method to measure functional sequence complexity was developed and applied to 35 protein families. Considerations were made in determining how the measure can be used to correlate functionality when relating to the whole molecule and sub-molecule. In the experiment, we show that when the proposed measure is applied to the aligned protein sequences of ubiquitin, 6 of the 7 highest value sites correlate with the binding domain. CONCLUSION: For future extensions, measures of functional bioinformatics may provide a means to evaluate potential evolving pathways from effects such as mutations, as well as analyzing the internal structural and functional relationships within the 3-D structure of proteins. BioMed Central 2007-12-06 /pmc/articles/PMC2217542/ /pubmed/18062814 http://dx.doi.org/10.1186/1742-4682-4-47 Text en Copyright © 2007 Durston et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Durston, Kirk K
Chiu, David KY
Abel, David L
Trevors, Jack T
Measuring the functional sequence complexity of proteins
title Measuring the functional sequence complexity of proteins
title_full Measuring the functional sequence complexity of proteins
title_fullStr Measuring the functional sequence complexity of proteins
title_full_unstemmed Measuring the functional sequence complexity of proteins
title_short Measuring the functional sequence complexity of proteins
title_sort measuring the functional sequence complexity of proteins
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2217542/
https://www.ncbi.nlm.nih.gov/pubmed/18062814
http://dx.doi.org/10.1186/1742-4682-4-47
work_keys_str_mv AT durstonkirkk measuringthefunctionalsequencecomplexityofproteins
AT chiudavidky measuringthefunctionalsequencecomplexityofproteins
AT abeldavidl measuringthefunctionalsequencecomplexityofproteins
AT trevorsjackt measuringthefunctionalsequencecomplexityofproteins