Cargando…

Finding distinct biclusters from background in gene expression matrices

Biclustering, or the discovery of subsets of samples and genes that are homogeneous and distinct from the background, has become an important technique in analyzing current microarray datasets. Most existing biclustering methods define a bicluster type as a fixed (predefined) pattern and then trying...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Zhengpeng, Ao, Jiangni, Zhang, Xuegong
Formato: Texto
Lenguaje:English
Publicado: Biomedical Informatics Publishing Group 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2241930/
https://www.ncbi.nlm.nih.gov/pubmed/18305830
_version_ 1782150559159025664
author Wu, Zhengpeng
Ao, Jiangni
Zhang, Xuegong
author_facet Wu, Zhengpeng
Ao, Jiangni
Zhang, Xuegong
author_sort Wu, Zhengpeng
collection PubMed
description Biclustering, or the discovery of subsets of samples and genes that are homogeneous and distinct from the background, has become an important technique in analyzing current microarray datasets. Most existing biclustering methods define a bicluster type as a fixed (predefined) pattern and then trying to get results in some searching process. In this work, we propose a novel method for finding biclusters or 2-dimensional patterns that are significantly distinct from the background without the need for pre-defining a pattern within the bicluster. The method named Distinct 2-Dimensional Pattern Finder (D2D) is composed of an iterative reordering step of the rows and columns in the matrix using a new similarity measure, and a flexible scanning-and-growing step to identify the biclusters. Experiments on a large variety of simulation data show that the method works consistently well under different conditions, whereas the existing methods compared may work well under some certain conditions but fail under some other conditions. The impact of noise levels, overlapping degrees between clusters and different setting of parameters were also investigated, which indicated that the D2D method is robust against these factors. The proposed D2D method can efficiently discover many different types of biclusters given that they have distinctive features from the background. The computer program is available upon request.
format Text
id pubmed-2241930
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Biomedical Informatics Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-22419302008-02-27 Finding distinct biclusters from background in gene expression matrices Wu, Zhengpeng Ao, Jiangni Zhang, Xuegong Bioinformation Prediction Model Biclustering, or the discovery of subsets of samples and genes that are homogeneous and distinct from the background, has become an important technique in analyzing current microarray datasets. Most existing biclustering methods define a bicluster type as a fixed (predefined) pattern and then trying to get results in some searching process. In this work, we propose a novel method for finding biclusters or 2-dimensional patterns that are significantly distinct from the background without the need for pre-defining a pattern within the bicluster. The method named Distinct 2-Dimensional Pattern Finder (D2D) is composed of an iterative reordering step of the rows and columns in the matrix using a new similarity measure, and a flexible scanning-and-growing step to identify the biclusters. Experiments on a large variety of simulation data show that the method works consistently well under different conditions, whereas the existing methods compared may work well under some certain conditions but fail under some other conditions. The impact of noise levels, overlapping degrees between clusters and different setting of parameters were also investigated, which indicated that the D2D method is robust against these factors. The proposed D2D method can efficiently discover many different types of biclusters given that they have distinctive features from the background. The computer program is available upon request. Biomedical Informatics Publishing Group 2007-12-30 /pmc/articles/PMC2241930/ /pubmed/18305830 Text en © 2007 Biomedical Informatics Publishing Group This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Prediction Model
Wu, Zhengpeng
Ao, Jiangni
Zhang, Xuegong
Finding distinct biclusters from background in gene expression matrices
title Finding distinct biclusters from background in gene expression matrices
title_full Finding distinct biclusters from background in gene expression matrices
title_fullStr Finding distinct biclusters from background in gene expression matrices
title_full_unstemmed Finding distinct biclusters from background in gene expression matrices
title_short Finding distinct biclusters from background in gene expression matrices
title_sort finding distinct biclusters from background in gene expression matrices
topic Prediction Model
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2241930/
https://www.ncbi.nlm.nih.gov/pubmed/18305830
work_keys_str_mv AT wuzhengpeng findingdistinctbiclustersfrombackgroundingeneexpressionmatrices
AT aojiangni findingdistinctbiclustersfrombackgroundingeneexpressionmatrices
AT zhangxuegong findingdistinctbiclustersfrombackgroundingeneexpressionmatrices