Cargando…
Evaluation of gene-expression clustering via mutual information distance measure
BACKGROUND: The definition of a distance measure plays a key role in the evaluation of different clustering solutions of gene expression profiles. In this empirical study we compare different clustering solutions when using the Mutual Information (MI) measure versus the use of the well known Euclide...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1858704/ https://www.ncbi.nlm.nih.gov/pubmed/17397530 http://dx.doi.org/10.1186/1471-2105-8-111 |
_version_ | 1782133177457836032 |
---|---|
author | Priness, Ido Maimon, Oded Ben-Gal, Irad |
author_facet | Priness, Ido Maimon, Oded Ben-Gal, Irad |
author_sort | Priness, Ido |
collection | PubMed |
description | BACKGROUND: The definition of a distance measure plays a key role in the evaluation of different clustering solutions of gene expression profiles. In this empirical study we compare different clustering solutions when using the Mutual Information (MI) measure versus the use of the well known Euclidean distance and Pearson correlation coefficient. RESULTS: Relying on several public gene expression datasets, we evaluate the homogeneity and separation scores of different clustering solutions. It was found that the use of the MI measure yields a more significant differentiation among erroneous clustering solutions. The proposed measure was also used to analyze the performance of several known clustering algorithms. A comparative study of these algorithms reveals that their "best solutions" are ranked almost oppositely when using different distance measures, despite the found correspondence between these measures when analysing the averaged scores of groups of solutions. CONCLUSION: In view of the results, further attention should be paid to the selection of a proper distance measure for analyzing the clustering of gene expression data. |
format | Text |
id | pubmed-1858704 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-18587042007-04-30 Evaluation of gene-expression clustering via mutual information distance measure Priness, Ido Maimon, Oded Ben-Gal, Irad BMC Bioinformatics Research Article BACKGROUND: The definition of a distance measure plays a key role in the evaluation of different clustering solutions of gene expression profiles. In this empirical study we compare different clustering solutions when using the Mutual Information (MI) measure versus the use of the well known Euclidean distance and Pearson correlation coefficient. RESULTS: Relying on several public gene expression datasets, we evaluate the homogeneity and separation scores of different clustering solutions. It was found that the use of the MI measure yields a more significant differentiation among erroneous clustering solutions. The proposed measure was also used to analyze the performance of several known clustering algorithms. A comparative study of these algorithms reveals that their "best solutions" are ranked almost oppositely when using different distance measures, despite the found correspondence between these measures when analysing the averaged scores of groups of solutions. CONCLUSION: In view of the results, further attention should be paid to the selection of a proper distance measure for analyzing the clustering of gene expression data. BioMed Central 2007-03-30 /pmc/articles/PMC1858704/ /pubmed/17397530 http://dx.doi.org/10.1186/1471-2105-8-111 Text en Copyright © 2007 Priness et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Priness, Ido Maimon, Oded Ben-Gal, Irad Evaluation of gene-expression clustering via mutual information distance measure |
title | Evaluation of gene-expression clustering via mutual information distance measure |
title_full | Evaluation of gene-expression clustering via mutual information distance measure |
title_fullStr | Evaluation of gene-expression clustering via mutual information distance measure |
title_full_unstemmed | Evaluation of gene-expression clustering via mutual information distance measure |
title_short | Evaluation of gene-expression clustering via mutual information distance measure |
title_sort | evaluation of gene-expression clustering via mutual information distance measure |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1858704/ https://www.ncbi.nlm.nih.gov/pubmed/17397530 http://dx.doi.org/10.1186/1471-2105-8-111 |
work_keys_str_mv | AT prinessido evaluationofgeneexpressionclusteringviamutualinformationdistancemeasure AT maimonoded evaluationofgeneexpressionclusteringviamutualinformationdistancemeasure AT bengalirad evaluationofgeneexpressionclusteringviamutualinformationdistancemeasure |