Cargando…

Universal partitioning of the hierarchical fold network of 50-residue segments in proteins

BACKGROUND: Several studies have demonstrated that protein fold space is structured hierarchically and that power-law statistics are satisfied in relation between the numbers of protein families and protein folds (or superfamilies). We examined the internal structure and statistics in the fold space...

Descripción completa

Detalles Bibliográficos
Autores principales: Ito, Jun-ichi, Sonobe, Yuki, Ikeda, Kazuyoshi, Tomii, Kentaro, Higo, Junichi
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2693521/
https://www.ncbi.nlm.nih.gov/pubmed/19454039
http://dx.doi.org/10.1186/1472-6807-9-34
_version_ 1782167965403185152
author Ito, Jun-ichi
Sonobe, Yuki
Ikeda, Kazuyoshi
Tomii, Kentaro
Higo, Junichi
author_facet Ito, Jun-ichi
Sonobe, Yuki
Ikeda, Kazuyoshi
Tomii, Kentaro
Higo, Junichi
author_sort Ito, Jun-ichi
collection PubMed
description BACKGROUND: Several studies have demonstrated that protein fold space is structured hierarchically and that power-law statistics are satisfied in relation between the numbers of protein families and protein folds (or superfamilies). We examined the internal structure and statistics in the fold space of 50 amino-acid residue segments taken from various protein folds. We used inter-residue contact patterns to measure the tertiary structural similarity among segments. Using this similarity measure, the segments were classified into a number (K(c)) of clusters. We examined various K(c )values for the clustering. The special resolution to differentiate the segment tertiary structures increases with increasing K(c). Furthermore, we constructed networks by linking structurally similar clusters. RESULTS: The network was partitioned persistently into four regions for K(c )≥ 1000. This main partitioning is consistent with results of earlier studies, where similar partitioning was reported in classifying protein domain structures. Furthermore, the network was partitioned naturally into several dozens of sub-networks (i.e., communities). Therefore, intra-sub-network clusters were mutually connected with numerous links, although inter-sub-network ones were rarely done with few links. For K(c )≥ 1000, the major sub-networks were about 40; the contents of the major sub-networks were conserved. This sub-partitioning is a novel finding, suggesting that the network is structured hierarchically: Segments construct a cluster, clusters form a sub-network, and sub-networks constitute a region. Additionally, the network was characterized by non-power-law statistics, which is also a novel finding. CONCLUSION: Main findings are: (1) The universe of 50 residue segments found here was characterized by non-power-law statistics. Therefore, the universe differs from those ever reported for the protein domains. (2) The 50-residue segments were partitioned persistently and universally into some dozens (ca. 40) of major sub-networks, irrespective of the number of clusters. (3) These major sub-networks encompassed 90% of all segments. Consequently, the protein tertiary structure is constructed using the dozens of elements (sub-networks).
format Text
id pubmed-2693521
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26935212009-06-08 Universal partitioning of the hierarchical fold network of 50-residue segments in proteins Ito, Jun-ichi Sonobe, Yuki Ikeda, Kazuyoshi Tomii, Kentaro Higo, Junichi BMC Struct Biol Research Article BACKGROUND: Several studies have demonstrated that protein fold space is structured hierarchically and that power-law statistics are satisfied in relation between the numbers of protein families and protein folds (or superfamilies). We examined the internal structure and statistics in the fold space of 50 amino-acid residue segments taken from various protein folds. We used inter-residue contact patterns to measure the tertiary structural similarity among segments. Using this similarity measure, the segments were classified into a number (K(c)) of clusters. We examined various K(c )values for the clustering. The special resolution to differentiate the segment tertiary structures increases with increasing K(c). Furthermore, we constructed networks by linking structurally similar clusters. RESULTS: The network was partitioned persistently into four regions for K(c )≥ 1000. This main partitioning is consistent with results of earlier studies, where similar partitioning was reported in classifying protein domain structures. Furthermore, the network was partitioned naturally into several dozens of sub-networks (i.e., communities). Therefore, intra-sub-network clusters were mutually connected with numerous links, although inter-sub-network ones were rarely done with few links. For K(c )≥ 1000, the major sub-networks were about 40; the contents of the major sub-networks were conserved. This sub-partitioning is a novel finding, suggesting that the network is structured hierarchically: Segments construct a cluster, clusters form a sub-network, and sub-networks constitute a region. Additionally, the network was characterized by non-power-law statistics, which is also a novel finding. CONCLUSION: Main findings are: (1) The universe of 50 residue segments found here was characterized by non-power-law statistics. Therefore, the universe differs from those ever reported for the protein domains. (2) The 50-residue segments were partitioned persistently and universally into some dozens (ca. 40) of major sub-networks, irrespective of the number of clusters. (3) These major sub-networks encompassed 90% of all segments. Consequently, the protein tertiary structure is constructed using the dozens of elements (sub-networks). BioMed Central 2009-05-20 /pmc/articles/PMC2693521/ /pubmed/19454039 http://dx.doi.org/10.1186/1472-6807-9-34 Text en Copyright © 2009 Ito et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ito, Jun-ichi
Sonobe, Yuki
Ikeda, Kazuyoshi
Tomii, Kentaro
Higo, Junichi
Universal partitioning of the hierarchical fold network of 50-residue segments in proteins
title Universal partitioning of the hierarchical fold network of 50-residue segments in proteins
title_full Universal partitioning of the hierarchical fold network of 50-residue segments in proteins
title_fullStr Universal partitioning of the hierarchical fold network of 50-residue segments in proteins
title_full_unstemmed Universal partitioning of the hierarchical fold network of 50-residue segments in proteins
title_short Universal partitioning of the hierarchical fold network of 50-residue segments in proteins
title_sort universal partitioning of the hierarchical fold network of 50-residue segments in proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2693521/
https://www.ncbi.nlm.nih.gov/pubmed/19454039
http://dx.doi.org/10.1186/1472-6807-9-34
work_keys_str_mv AT itojunichi universalpartitioningofthehierarchicalfoldnetworkof50residuesegmentsinproteins
AT sonobeyuki universalpartitioningofthehierarchicalfoldnetworkof50residuesegmentsinproteins
AT ikedakazuyoshi universalpartitioningofthehierarchicalfoldnetworkof50residuesegmentsinproteins
AT tomiikentaro universalpartitioningofthehierarchicalfoldnetworkof50residuesegmentsinproteins
AT higojunichi universalpartitioningofthehierarchicalfoldnetworkof50residuesegmentsinproteins