Cargando…

CLIC: clustering analysis of large microarray datasets with individual dimension-based clustering

Large microarray data sets have recently become common. However, most available clustering methods do not easily handle large microarray data sets due to their very large computational complexity and memory requirements. Furthermore, typical clustering methods construct oversimplified clusters that...

Descripción completa

Detalles Bibliográficos
Autores principales: Yun, Taegyun, Hwang, Taeho, Cha, Kihoon, Yi, Gwan-Su
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2896182/
https://www.ncbi.nlm.nih.gov/pubmed/20529873
http://dx.doi.org/10.1093/nar/gkq516
_version_ 1782183329680850944
author Yun, Taegyun
Hwang, Taeho
Cha, Kihoon
Yi, Gwan-Su
author_facet Yun, Taegyun
Hwang, Taeho
Cha, Kihoon
Yi, Gwan-Su
author_sort Yun, Taegyun
collection PubMed
description Large microarray data sets have recently become common. However, most available clustering methods do not easily handle large microarray data sets due to their very large computational complexity and memory requirements. Furthermore, typical clustering methods construct oversimplified clusters that ignore subtle but meaningful changes in the expression patterns present in large microarray data sets. It is necessary to develop an efficient clustering method that identifies both absolute expression differences and expression profile patterns in different expression levels for large microarray data sets. This study presents CLIC, which meets the requirements of clustering analysis particularly but not limited to large microarray data sets. CLIC is based on a novel concept in which genes are clustered in individual dimensions first and in which the ordinal labels of clusters in each dimension are then used for further full dimension-wide clustering. CLIC enables iterative sub-clustering into more homogeneous groups and the identification of common expression patterns among the genes separated in different groups due to the large difference in the expression levels. In addition, the computation of clustering is parallelized, the number of clusters is automatically detected, and the functional enrichment for each cluster and pattern is provided. CLIC is freely available at http://gexp2.kaist.ac.kr/clic.
format Text
id pubmed-2896182
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28961822010-07-02 CLIC: clustering analysis of large microarray datasets with individual dimension-based clustering Yun, Taegyun Hwang, Taeho Cha, Kihoon Yi, Gwan-Su Nucleic Acids Res Articles Large microarray data sets have recently become common. However, most available clustering methods do not easily handle large microarray data sets due to their very large computational complexity and memory requirements. Furthermore, typical clustering methods construct oversimplified clusters that ignore subtle but meaningful changes in the expression patterns present in large microarray data sets. It is necessary to develop an efficient clustering method that identifies both absolute expression differences and expression profile patterns in different expression levels for large microarray data sets. This study presents CLIC, which meets the requirements of clustering analysis particularly but not limited to large microarray data sets. CLIC is based on a novel concept in which genes are clustered in individual dimensions first and in which the ordinal labels of clusters in each dimension are then used for further full dimension-wide clustering. CLIC enables iterative sub-clustering into more homogeneous groups and the identification of common expression patterns among the genes separated in different groups due to the large difference in the expression levels. In addition, the computation of clustering is parallelized, the number of clusters is automatically detected, and the functional enrichment for each cluster and pattern is provided. CLIC is freely available at http://gexp2.kaist.ac.kr/clic. Oxford University Press 2010-07-01 2010-06-06 /pmc/articles/PMC2896182/ /pubmed/20529873 http://dx.doi.org/10.1093/nar/gkq516 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Yun, Taegyun
Hwang, Taeho
Cha, Kihoon
Yi, Gwan-Su
CLIC: clustering analysis of large microarray datasets with individual dimension-based clustering
title CLIC: clustering analysis of large microarray datasets with individual dimension-based clustering
title_full CLIC: clustering analysis of large microarray datasets with individual dimension-based clustering
title_fullStr CLIC: clustering analysis of large microarray datasets with individual dimension-based clustering
title_full_unstemmed CLIC: clustering analysis of large microarray datasets with individual dimension-based clustering
title_short CLIC: clustering analysis of large microarray datasets with individual dimension-based clustering
title_sort clic: clustering analysis of large microarray datasets with individual dimension-based clustering
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2896182/
https://www.ncbi.nlm.nih.gov/pubmed/20529873
http://dx.doi.org/10.1093/nar/gkq516
work_keys_str_mv AT yuntaegyun clicclusteringanalysisoflargemicroarraydatasetswithindividualdimensionbasedclustering
AT hwangtaeho clicclusteringanalysisoflargemicroarraydatasetswithindividualdimensionbasedclustering
AT chakihoon clicclusteringanalysisoflargemicroarraydatasetswithindividualdimensionbasedclustering
AT yigwansu clicclusteringanalysisoflargemicroarraydatasetswithindividualdimensionbasedclustering