Cargando…

Dynamically weighted clustering with noise set

Motivation: Various clustering methods have been applied to microarray gene expression data for identifying genes with similar expression profiles. As the biological annotation data accumulated, more and more genes have been organized into functional categories. Functionally related genes may be reg...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Yijing, Sun, Wei, Li, Ker-Chau
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2815660/
https://www.ncbi.nlm.nih.gov/pubmed/20007256
http://dx.doi.org/10.1093/bioinformatics/btp671
_version_ 1782177029533204480
author Shen, Yijing
Sun, Wei
Li, Ker-Chau
author_facet Shen, Yijing
Sun, Wei
Li, Ker-Chau
author_sort Shen, Yijing
collection PubMed
description Motivation: Various clustering methods have been applied to microarray gene expression data for identifying genes with similar expression profiles. As the biological annotation data accumulated, more and more genes have been organized into functional categories. Functionally related genes may be regulated by common cellular signals, thus likely to be co-expressed. Consequently, utilizing the rapidly increasing functional annotation resources such as Gene Ontology (GO) to improve the performance of clustering methods is of great interest. On the opposite side of clustering, there are genes that have distinct expression profiles and do not co-express with other genes. Identification of these scattered genes could enhance the performance of clustering methods. Results: We developed a new clustering algorithm, Dynamically Weighted Clustering with Noise set (DWCN), which makes use of gene annotation information and allows for a set of scattered genes, the noise set, to be left out of the main clusters. We tested the DWCN method and contrasted its results with those obtained using several common clustering techniques on a simulated dataset as well as on two public datasets: the Stanford yeast cell-cycle gene expression data, and a gene expression dataset for a group of genetically different yeast segregants. Conclusion: Our method produces clusters with more consistent functional annotations and more coherent expression patterns than existing clustering techniques. Contact: yshen@stat.ucla.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2815660
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28156602010-02-03 Dynamically weighted clustering with noise set Shen, Yijing Sun, Wei Li, Ker-Chau Bioinformatics Original Papers Motivation: Various clustering methods have been applied to microarray gene expression data for identifying genes with similar expression profiles. As the biological annotation data accumulated, more and more genes have been organized into functional categories. Functionally related genes may be regulated by common cellular signals, thus likely to be co-expressed. Consequently, utilizing the rapidly increasing functional annotation resources such as Gene Ontology (GO) to improve the performance of clustering methods is of great interest. On the opposite side of clustering, there are genes that have distinct expression profiles and do not co-express with other genes. Identification of these scattered genes could enhance the performance of clustering methods. Results: We developed a new clustering algorithm, Dynamically Weighted Clustering with Noise set (DWCN), which makes use of gene annotation information and allows for a set of scattered genes, the noise set, to be left out of the main clusters. We tested the DWCN method and contrasted its results with those obtained using several common clustering techniques on a simulated dataset as well as on two public datasets: the Stanford yeast cell-cycle gene expression data, and a gene expression dataset for a group of genetically different yeast segregants. Conclusion: Our method produces clusters with more consistent functional annotations and more coherent expression patterns than existing clustering techniques. Contact: yshen@stat.ucla.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-02-01 2009-12-09 /pmc/articles/PMC2815660/ /pubmed/20007256 http://dx.doi.org/10.1093/bioinformatics/btp671 Text en © The Author(s) 2009. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Shen, Yijing
Sun, Wei
Li, Ker-Chau
Dynamically weighted clustering with noise set
title Dynamically weighted clustering with noise set
title_full Dynamically weighted clustering with noise set
title_fullStr Dynamically weighted clustering with noise set
title_full_unstemmed Dynamically weighted clustering with noise set
title_short Dynamically weighted clustering with noise set
title_sort dynamically weighted clustering with noise set
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2815660/
https://www.ncbi.nlm.nih.gov/pubmed/20007256
http://dx.doi.org/10.1093/bioinformatics/btp671
work_keys_str_mv AT shenyijing dynamicallyweightedclusteringwithnoiseset
AT sunwei dynamicallyweightedclusteringwithnoiseset
AT likerchau dynamicallyweightedclusteringwithnoiseset