Cargando…

Evaluation of gene-expression clustering via mutual information distance measure

BACKGROUND: The definition of a distance measure plays a key role in the evaluation of different clustering solutions of gene expression profiles. In this empirical study we compare different clustering solutions when using the Mutual Information (MI) measure versus the use of the well known Euclide...

Descripción completa

Detalles Bibliográficos
Autores principales: Priness, Ido, Maimon, Oded, Ben-Gal, Irad
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1858704/
https://www.ncbi.nlm.nih.gov/pubmed/17397530
http://dx.doi.org/10.1186/1471-2105-8-111
_version_ 1782133177457836032
author Priness, Ido
Maimon, Oded
Ben-Gal, Irad
author_facet Priness, Ido
Maimon, Oded
Ben-Gal, Irad
author_sort Priness, Ido
collection PubMed
description BACKGROUND: The definition of a distance measure plays a key role in the evaluation of different clustering solutions of gene expression profiles. In this empirical study we compare different clustering solutions when using the Mutual Information (MI) measure versus the use of the well known Euclidean distance and Pearson correlation coefficient. RESULTS: Relying on several public gene expression datasets, we evaluate the homogeneity and separation scores of different clustering solutions. It was found that the use of the MI measure yields a more significant differentiation among erroneous clustering solutions. The proposed measure was also used to analyze the performance of several known clustering algorithms. A comparative study of these algorithms reveals that their "best solutions" are ranked almost oppositely when using different distance measures, despite the found correspondence between these measures when analysing the averaged scores of groups of solutions. CONCLUSION: In view of the results, further attention should be paid to the selection of a proper distance measure for analyzing the clustering of gene expression data.
format Text
id pubmed-1858704
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18587042007-04-30 Evaluation of gene-expression clustering via mutual information distance measure Priness, Ido Maimon, Oded Ben-Gal, Irad BMC Bioinformatics Research Article BACKGROUND: The definition of a distance measure plays a key role in the evaluation of different clustering solutions of gene expression profiles. In this empirical study we compare different clustering solutions when using the Mutual Information (MI) measure versus the use of the well known Euclidean distance and Pearson correlation coefficient. RESULTS: Relying on several public gene expression datasets, we evaluate the homogeneity and separation scores of different clustering solutions. It was found that the use of the MI measure yields a more significant differentiation among erroneous clustering solutions. The proposed measure was also used to analyze the performance of several known clustering algorithms. A comparative study of these algorithms reveals that their "best solutions" are ranked almost oppositely when using different distance measures, despite the found correspondence between these measures when analysing the averaged scores of groups of solutions. CONCLUSION: In view of the results, further attention should be paid to the selection of a proper distance measure for analyzing the clustering of gene expression data. BioMed Central 2007-03-30 /pmc/articles/PMC1858704/ /pubmed/17397530 http://dx.doi.org/10.1186/1471-2105-8-111 Text en Copyright © 2007 Priness et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Priness, Ido
Maimon, Oded
Ben-Gal, Irad
Evaluation of gene-expression clustering via mutual information distance measure
title Evaluation of gene-expression clustering via mutual information distance measure
title_full Evaluation of gene-expression clustering via mutual information distance measure
title_fullStr Evaluation of gene-expression clustering via mutual information distance measure
title_full_unstemmed Evaluation of gene-expression clustering via mutual information distance measure
title_short Evaluation of gene-expression clustering via mutual information distance measure
title_sort evaluation of gene-expression clustering via mutual information distance measure
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1858704/
https://www.ncbi.nlm.nih.gov/pubmed/17397530
http://dx.doi.org/10.1186/1471-2105-8-111
work_keys_str_mv AT prinessido evaluationofgeneexpressionclusteringviamutualinformationdistancemeasure
AT maimonoded evaluationofgeneexpressionclusteringviamutualinformationdistancemeasure
AT bengalirad evaluationofgeneexpressionclusteringviamutualinformationdistancemeasure