Cargando…

An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes

Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Booma, P. M., Prabhakaran, S., Dhanalakshmi, R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4083291/
https://www.ncbi.nlm.nih.gov/pubmed/25136661
http://dx.doi.org/10.1155/2014/357873
_version_ 1782324354693988352
author Booma, P. M.
Prabhakaran, S.
Dhanalakshmi, R.
author_facet Booma, P. M.
Prabhakaran, S.
Dhanalakshmi, R.
author_sort Booma, P. M.
collection PubMed
description Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality.
format Online
Article
Text
id pubmed-4083291
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-40832912014-08-18 An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes Booma, P. M. Prabhakaran, S. Dhanalakshmi, R. ScientificWorldJournal Research Article Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality. Hindawi Publishing Corporation 2014 2014-06-16 /pmc/articles/PMC4083291/ /pubmed/25136661 http://dx.doi.org/10.1155/2014/357873 Text en Copyright © 2014 P. M. Booma et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Booma, P. M.
Prabhakaran, S.
Dhanalakshmi, R.
An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes
title An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes
title_full An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes
title_fullStr An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes
title_full_unstemmed An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes
title_short An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes
title_sort improved pearson's correlation proximity-based hierarchical clustering for mining biological association between genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4083291/
https://www.ncbi.nlm.nih.gov/pubmed/25136661
http://dx.doi.org/10.1155/2014/357873
work_keys_str_mv AT boomapm animprovedpearsonscorrelationproximitybasedhierarchicalclusteringforminingbiologicalassociationbetweengenes
AT prabhakarans animprovedpearsonscorrelationproximitybasedhierarchicalclusteringforminingbiologicalassociationbetweengenes
AT dhanalakshmir animprovedpearsonscorrelationproximitybasedhierarchicalclusteringforminingbiologicalassociationbetweengenes
AT boomapm improvedpearsonscorrelationproximitybasedhierarchicalclusteringforminingbiologicalassociationbetweengenes
AT prabhakarans improvedpearsonscorrelationproximitybasedhierarchicalclusteringforminingbiologicalassociationbetweengenes
AT dhanalakshmir improvedpearsonscorrelationproximitybasedhierarchicalclusteringforminingbiologicalassociationbetweengenes