Cargando…

An novel frequent probability pattern mining algorithm based on circuit simulation method in uncertain biological networks

BACKGROUND: Motif mining has always been a hot research topic in bioinformatics. Most of current research on biological networks focuses on exact motif mining. However, due to the inevitable experimental error and noisy data, biological network data represented as the probability model could better...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Jieyue, Wang, Chunyan, Qiu, Kunpu, Zhong, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243085/
https://www.ncbi.nlm.nih.gov/pubmed/25350277
http://dx.doi.org/10.1186/1752-0509-8-S3-S6
_version_ 1782346055814217728
author He, Jieyue
Wang, Chunyan
Qiu, Kunpu
Zhong, Wei
author_facet He, Jieyue
Wang, Chunyan
Qiu, Kunpu
Zhong, Wei
author_sort He, Jieyue
collection PubMed
description BACKGROUND: Motif mining has always been a hot research topic in bioinformatics. Most of current research on biological networks focuses on exact motif mining. However, due to the inevitable experimental error and noisy data, biological network data represented as the probability model could better reflect the authenticity and biological significance, therefore, it is more biological meaningful to discover probability motif in uncertain biological networks. One of the key steps in probability motif mining is frequent pattern discovery which is usually based on the possible world model having a relatively high computational complexity. METHODS: In this paper, we present a novel method for detecting frequent probability patterns based on circuit simulation in the uncertain biological networks. First, the partition based efficient search is applied to the non-tree like subgraph mining where the probability of occurrence in random networks is small. Then, an algorithm of probability isomorphic based on circuit simulation is proposed. The probability isomorphic combines the analysis of circuit topology structure with related physical properties of voltage in order to evaluate the probability isomorphism between probability subgraphs. The circuit simulation based probability isomorphic can avoid using traditional possible world model. Finally, based on the algorithm of probability subgraph isomorphism, two-step hierarchical clustering method is used to cluster subgraphs, and discover frequent probability patterns from the clusters. RESULTS: The experiment results on data sets of the Protein-Protein Interaction (PPI) networks and the transcriptional regulatory networks of E. coli and S. cerevisiae show that the proposed method can efficiently discover the frequent probability subgraphs. The discovered subgraphs in our study contain all probability motifs reported in the experiments published in other related papers. CONCLUSIONS: The algorithm of probability graph isomorphism evaluation based on circuit simulation method excludes most of subgraphs which are not probability isomorphism and reduces the search space of the probability isomorphism subgraphs using the mismatch values in the node voltage set. It is an innovative way to find the frequent probability patterns, which can be efficiently applied to probability motif discovery problems in the further studies.
format Online
Article
Text
id pubmed-4243085
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42430852014-11-26 An novel frequent probability pattern mining algorithm based on circuit simulation method in uncertain biological networks He, Jieyue Wang, Chunyan Qiu, Kunpu Zhong, Wei BMC Syst Biol Research BACKGROUND: Motif mining has always been a hot research topic in bioinformatics. Most of current research on biological networks focuses on exact motif mining. However, due to the inevitable experimental error and noisy data, biological network data represented as the probability model could better reflect the authenticity and biological significance, therefore, it is more biological meaningful to discover probability motif in uncertain biological networks. One of the key steps in probability motif mining is frequent pattern discovery which is usually based on the possible world model having a relatively high computational complexity. METHODS: In this paper, we present a novel method for detecting frequent probability patterns based on circuit simulation in the uncertain biological networks. First, the partition based efficient search is applied to the non-tree like subgraph mining where the probability of occurrence in random networks is small. Then, an algorithm of probability isomorphic based on circuit simulation is proposed. The probability isomorphic combines the analysis of circuit topology structure with related physical properties of voltage in order to evaluate the probability isomorphism between probability subgraphs. The circuit simulation based probability isomorphic can avoid using traditional possible world model. Finally, based on the algorithm of probability subgraph isomorphism, two-step hierarchical clustering method is used to cluster subgraphs, and discover frequent probability patterns from the clusters. RESULTS: The experiment results on data sets of the Protein-Protein Interaction (PPI) networks and the transcriptional regulatory networks of E. coli and S. cerevisiae show that the proposed method can efficiently discover the frequent probability subgraphs. The discovered subgraphs in our study contain all probability motifs reported in the experiments published in other related papers. CONCLUSIONS: The algorithm of probability graph isomorphism evaluation based on circuit simulation method excludes most of subgraphs which are not probability isomorphism and reduces the search space of the probability isomorphism subgraphs using the mismatch values in the node voltage set. It is an innovative way to find the frequent probability patterns, which can be efficiently applied to probability motif discovery problems in the further studies. BioMed Central 2014-10-22 /pmc/articles/PMC4243085/ /pubmed/25350277 http://dx.doi.org/10.1186/1752-0509-8-S3-S6 Text en Copyright © 2014 He et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
He, Jieyue
Wang, Chunyan
Qiu, Kunpu
Zhong, Wei
An novel frequent probability pattern mining algorithm based on circuit simulation method in uncertain biological networks
title An novel frequent probability pattern mining algorithm based on circuit simulation method in uncertain biological networks
title_full An novel frequent probability pattern mining algorithm based on circuit simulation method in uncertain biological networks
title_fullStr An novel frequent probability pattern mining algorithm based on circuit simulation method in uncertain biological networks
title_full_unstemmed An novel frequent probability pattern mining algorithm based on circuit simulation method in uncertain biological networks
title_short An novel frequent probability pattern mining algorithm based on circuit simulation method in uncertain biological networks
title_sort novel frequent probability pattern mining algorithm based on circuit simulation method in uncertain biological networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4243085/
https://www.ncbi.nlm.nih.gov/pubmed/25350277
http://dx.doi.org/10.1186/1752-0509-8-S3-S6
work_keys_str_mv AT hejieyue annovelfrequentprobabilitypatternminingalgorithmbasedoncircuitsimulationmethodinuncertainbiologicalnetworks
AT wangchunyan annovelfrequentprobabilitypatternminingalgorithmbasedoncircuitsimulationmethodinuncertainbiologicalnetworks
AT qiukunpu annovelfrequentprobabilitypatternminingalgorithmbasedoncircuitsimulationmethodinuncertainbiologicalnetworks
AT zhongwei annovelfrequentprobabilitypatternminingalgorithmbasedoncircuitsimulationmethodinuncertainbiologicalnetworks
AT hejieyue novelfrequentprobabilitypatternminingalgorithmbasedoncircuitsimulationmethodinuncertainbiologicalnetworks
AT wangchunyan novelfrequentprobabilitypatternminingalgorithmbasedoncircuitsimulationmethodinuncertainbiologicalnetworks
AT qiukunpu novelfrequentprobabilitypatternminingalgorithmbasedoncircuitsimulationmethodinuncertainbiologicalnetworks
AT zhongwei novelfrequentprobabilitypatternminingalgorithmbasedoncircuitsimulationmethodinuncertainbiologicalnetworks