Cargando…

DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach

BACKGROUND: The analysis of massive high throughput data via clustering algorithms is very important for elucidating gene functions in biological systems. However, traditional clustering methods have several drawbacks. Biclustering overcomes these limitations by grouping genes and samples simultaneo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Serin, Akdes, Vingron, Martin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3152888/ https://www.ncbi.nlm.nih.gov/pubmed/21699691 http://dx.doi.org/10.1186/1748-7188-6-18

_version_	1782209812118896640
author	Serin, Akdes Vingron, Martin
author_facet	Serin, Akdes Vingron, Martin
author_sort	Serin, Akdes
collection	PubMed
description	BACKGROUND: The analysis of massive high throughput data via clustering algorithms is very important for elucidating gene functions in biological systems. However, traditional clustering methods have several drawbacks. Biclustering overcomes these limitations by grouping genes and samples simultaneously. It discovers subsets of genes that are co-expressed in certain samples. Recent studies showed that biclustering has a great potential in detecting marker genes that are associated with certain tissues or diseases. Several biclustering algorithms have been proposed. However, it is still a challenge to find biclusters that are significant based on biological validation measures. Besides that, there is a need for a biclustering algorithm that is capable of analyzing very large datasets in reasonable time. RESULTS: Here we present a fast biclustering algorithm called DeBi (Differentially Expressed BIclusters). The algorithm is based on a well known data mining approach called frequent itemset. It discovers maximum size homogeneous biclusters in which each gene is strongly associated with a subset of samples. We evaluate the performance of DeBi on a yeast dataset, on synthetic datasets and on human datasets. CONCLUSIONS: We demonstrate that the DeBi algorithm provides functionally more coherent gene sets compared to standard clustering or biclustering algorithms using biological validation measures such as Gene Ontology term and Transcription Factor Binding Site enrichment. We show that DeBi is a computationally efficient and powerful tool in analyzing large datasets. The method is also applicable on multiple gene expression datasets coming from different labs or platforms.
format	Online Article Text
id	pubmed-3152888
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-31528882011-08-10 DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach Serin, Akdes Vingron, Martin Algorithms Mol Biol Research BACKGROUND: The analysis of massive high throughput data via clustering algorithms is very important for elucidating gene functions in biological systems. However, traditional clustering methods have several drawbacks. Biclustering overcomes these limitations by grouping genes and samples simultaneously. It discovers subsets of genes that are co-expressed in certain samples. Recent studies showed that biclustering has a great potential in detecting marker genes that are associated with certain tissues or diseases. Several biclustering algorithms have been proposed. However, it is still a challenge to find biclusters that are significant based on biological validation measures. Besides that, there is a need for a biclustering algorithm that is capable of analyzing very large datasets in reasonable time. RESULTS: Here we present a fast biclustering algorithm called DeBi (Differentially Expressed BIclusters). The algorithm is based on a well known data mining approach called frequent itemset. It discovers maximum size homogeneous biclusters in which each gene is strongly associated with a subset of samples. We evaluate the performance of DeBi on a yeast dataset, on synthetic datasets and on human datasets. CONCLUSIONS: We demonstrate that the DeBi algorithm provides functionally more coherent gene sets compared to standard clustering or biclustering algorithms using biological validation measures such as Gene Ontology term and Transcription Factor Binding Site enrichment. We show that DeBi is a computationally efficient and powerful tool in analyzing large datasets. The method is also applicable on multiple gene expression datasets coming from different labs or platforms. BioMed Central 2011-06-23 /pmc/articles/PMC3152888/ /pubmed/21699691 http://dx.doi.org/10.1186/1748-7188-6-18 Text en Copyright ©2011 Serin and Vingron; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Serin, Akdes Vingron, Martin DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach
title	DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach
title_full	DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach
title_fullStr	DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach
title_full_unstemmed	DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach
title_short	DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach
title_sort	debi: discovering differentially expressed biclusters using a frequent itemset approach
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3152888/ https://www.ncbi.nlm.nih.gov/pubmed/21699691 http://dx.doi.org/10.1186/1748-7188-6-18
work_keys_str_mv	AT serinakdes debidiscoveringdifferentiallyexpressedbiclustersusingafrequentitemsetapproach AT vingronmartin debidiscoveringdifferentiallyexpressedbiclustersusingafrequentitemsetapproach

DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach

Ejemplares similares