Cargando…
Features analysis for identification of date and party hubs in protein interaction network of Saccharomyces Cerevisiae
BACKGROUND: It has been understood that biological networks have modular organizations which are the sources of their observed complexity. Analysis of networks and motifs has shown that two types of hubs, party hubs and date hubs, are responsible for this complexity. Party hubs are local coordinator...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3018396/ https://www.ncbi.nlm.nih.gov/pubmed/21167069 http://dx.doi.org/10.1186/1752-0509-4-172 |
_version_ | 1782196056005541888 |
---|---|
author | Mirzarezaee, Mitra Araabi, Babak N Sadeghi, Mehdi |
author_facet | Mirzarezaee, Mitra Araabi, Babak N Sadeghi, Mehdi |
author_sort | Mirzarezaee, Mitra |
collection | PubMed |
description | BACKGROUND: It has been understood that biological networks have modular organizations which are the sources of their observed complexity. Analysis of networks and motifs has shown that two types of hubs, party hubs and date hubs, are responsible for this complexity. Party hubs are local coordinators because of their high co-expressions with their partners, whereas date hubs display low co-expressions and are assumed as global connectors. However there is no mutual agreement on these concepts in related literature with different studies reporting their results on different data sets. We investigated whether there is a relation between the biological features of Saccharomyces Cerevisiae's proteins and their roles as non-hubs, intermediately connected, party hubs, and date hubs. We propose a classifier that separates these four classes. RESULTS: We extracted different biological characteristics including amino acid sequences, domain contents, repeated domains, functional categories, biological processes, cellular compartments, disordered regions, and position specific scoring matrix from various sources. Several classifiers are examined and the best feature-sets based on average correct classification rate and correlation coefficients of the results are selected. We show that fusion of five feature-sets including domains, Position Specific Scoring Matrix-400, cellular compartments level one, and composition pairs with two and one gaps provide the best discrimination with an average correct classification rate of 77%. CONCLUSIONS: We study a variety of known biological feature-sets of the proteins and show that there is a relation between domains, Position Specific Scoring Matrix-400, cellular compartments level one, composition pairs with two and one gaps of Saccharomyces Cerevisiae's proteins, and their roles in the protein interaction network as non-hubs, intermediately connected, party hubs and date hubs. This study also confirms the possibility of predicting non-hubs, party hubs and date hubs based on their biological features with acceptable accuracy. If such a hypothesis is correct for other species as well, similar methods can be applied to predict the roles of proteins in those species. |
format | Text |
id | pubmed-3018396 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30183962011-01-24 Features analysis for identification of date and party hubs in protein interaction network of Saccharomyces Cerevisiae Mirzarezaee, Mitra Araabi, Babak N Sadeghi, Mehdi BMC Syst Biol Research Article BACKGROUND: It has been understood that biological networks have modular organizations which are the sources of their observed complexity. Analysis of networks and motifs has shown that two types of hubs, party hubs and date hubs, are responsible for this complexity. Party hubs are local coordinators because of their high co-expressions with their partners, whereas date hubs display low co-expressions and are assumed as global connectors. However there is no mutual agreement on these concepts in related literature with different studies reporting their results on different data sets. We investigated whether there is a relation between the biological features of Saccharomyces Cerevisiae's proteins and their roles as non-hubs, intermediately connected, party hubs, and date hubs. We propose a classifier that separates these four classes. RESULTS: We extracted different biological characteristics including amino acid sequences, domain contents, repeated domains, functional categories, biological processes, cellular compartments, disordered regions, and position specific scoring matrix from various sources. Several classifiers are examined and the best feature-sets based on average correct classification rate and correlation coefficients of the results are selected. We show that fusion of five feature-sets including domains, Position Specific Scoring Matrix-400, cellular compartments level one, and composition pairs with two and one gaps provide the best discrimination with an average correct classification rate of 77%. CONCLUSIONS: We study a variety of known biological feature-sets of the proteins and show that there is a relation between domains, Position Specific Scoring Matrix-400, cellular compartments level one, composition pairs with two and one gaps of Saccharomyces Cerevisiae's proteins, and their roles in the protein interaction network as non-hubs, intermediately connected, party hubs and date hubs. This study also confirms the possibility of predicting non-hubs, party hubs and date hubs based on their biological features with acceptable accuracy. If such a hypothesis is correct for other species as well, similar methods can be applied to predict the roles of proteins in those species. BioMed Central 2010-12-19 /pmc/articles/PMC3018396/ /pubmed/21167069 http://dx.doi.org/10.1186/1752-0509-4-172 Text en Copyright ©2010 Mirzarezaee et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Mirzarezaee, Mitra Araabi, Babak N Sadeghi, Mehdi Features analysis for identification of date and party hubs in protein interaction network of Saccharomyces Cerevisiae |
title | Features analysis for identification of date and party hubs in protein interaction network of Saccharomyces Cerevisiae |
title_full | Features analysis for identification of date and party hubs in protein interaction network of Saccharomyces Cerevisiae |
title_fullStr | Features analysis for identification of date and party hubs in protein interaction network of Saccharomyces Cerevisiae |
title_full_unstemmed | Features analysis for identification of date and party hubs in protein interaction network of Saccharomyces Cerevisiae |
title_short | Features analysis for identification of date and party hubs in protein interaction network of Saccharomyces Cerevisiae |
title_sort | features analysis for identification of date and party hubs in protein interaction network of saccharomyces cerevisiae |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3018396/ https://www.ncbi.nlm.nih.gov/pubmed/21167069 http://dx.doi.org/10.1186/1752-0509-4-172 |
work_keys_str_mv | AT mirzarezaeemitra featuresanalysisforidentificationofdateandpartyhubsinproteininteractionnetworkofsaccharomycescerevisiae AT araabibabakn featuresanalysisforidentificationofdateandpartyhubsinproteininteractionnetworkofsaccharomycescerevisiae AT sadeghimehdi featuresanalysisforidentificationofdateandpartyhubsinproteininteractionnetworkofsaccharomycescerevisiae |