Cargando…

Supervised clustering of genes

BACKGROUND: We focus on microarray data where experiments monitor gene expression in different tissues and where each experiment is equipped with an additional response variable such as a cancer type. Although the number of measured genes is in the thousands, it is assumed that only a few marker com...

Descripción completa

Detalles Bibliográficos
Autores principales: Dettling, Marcel, Bühlmann, Peter
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC151171/
https://www.ncbi.nlm.nih.gov/pubmed/12537558
_version_ 1782120660350271488
author Dettling, Marcel
Bühlmann, Peter
author_facet Dettling, Marcel
Bühlmann, Peter
author_sort Dettling, Marcel
collection PubMed
description BACKGROUND: We focus on microarray data where experiments monitor gene expression in different tissues and where each experiment is equipped with an additional response variable such as a cancer type. Although the number of measured genes is in the thousands, it is assumed that only a few marker components of gene subsets determine the type of a tissue. Here we present a new method for finding such groups of genes by directly incorporating the response variables into the grouping process, yielding a supervised clustering algorithm for genes. RESULTS: An empirical study on eight publicly available microarray datasets shows that our algorithm identifies gene clusters with excellent predictive potential, often superior to classification with state-of-the-art methods based on single genes. Permutation tests and bootstrapping provide evidence that the output is reasonably stable and more than a noise artifact. CONCLUSIONS: In contrast to other methods such as hierarchical clustering, our algorithm identifies several gene clusters whose expression levels clearly distinguish the different tissue types. The identification of such gene clusters is potentially useful for medical diagnostics and may at the same time reveal insights into functional genomics.
format Text
id pubmed-151171
institution National Center for Biotechnology Information
language English
publishDate 2002
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-1511712003-03-13 Supervised clustering of genes Dettling, Marcel Bühlmann, Peter Genome Biol Research BACKGROUND: We focus on microarray data where experiments monitor gene expression in different tissues and where each experiment is equipped with an additional response variable such as a cancer type. Although the number of measured genes is in the thousands, it is assumed that only a few marker components of gene subsets determine the type of a tissue. Here we present a new method for finding such groups of genes by directly incorporating the response variables into the grouping process, yielding a supervised clustering algorithm for genes. RESULTS: An empirical study on eight publicly available microarray datasets shows that our algorithm identifies gene clusters with excellent predictive potential, often superior to classification with state-of-the-art methods based on single genes. Permutation tests and bootstrapping provide evidence that the output is reasonably stable and more than a noise artifact. CONCLUSIONS: In contrast to other methods such as hierarchical clustering, our algorithm identifies several gene clusters whose expression levels clearly distinguish the different tissue types. The identification of such gene clusters is potentially useful for medical diagnostics and may at the same time reveal insights into functional genomics. BioMed Central 2002 2002-11-25 /pmc/articles/PMC151171/ /pubmed/12537558 Text en Copyright © 2002 Dettling and Bühlmann, licensee BioMed Central Ltd
spellingShingle Research
Dettling, Marcel
Bühlmann, Peter
Supervised clustering of genes
title Supervised clustering of genes
title_full Supervised clustering of genes
title_fullStr Supervised clustering of genes
title_full_unstemmed Supervised clustering of genes
title_short Supervised clustering of genes
title_sort supervised clustering of genes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC151171/
https://www.ncbi.nlm.nih.gov/pubmed/12537558
work_keys_str_mv AT dettlingmarcel supervisedclusteringofgenes
AT buhlmannpeter supervisedclusteringofgenes