Cargando…

A novel biclustering approach with iterative optimization to analyze gene expression data

OBJECTIVE: With the dramatic increase in microarray data, biclustering has become a promising tool for gene expression analysis. Biclustering has been proven to be superior over clustering in identifying multifunctional genes and searching for co-expressed genes under a few specific conditions; that...

Descripción completa

Detalles Bibliográficos
Autores principales: Sutheeworapong, Sawannee, Ota, Motonori, Ohta, Hiroyuki, Kinoshita, Kengo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove Medical Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3459542/
https://www.ncbi.nlm.nih.gov/pubmed/23055751
http://dx.doi.org/10.2147/AABC.S32622
_version_ 1782244817738137600
author Sutheeworapong, Sawannee
Ota, Motonori
Ohta, Hiroyuki
Kinoshita, Kengo
author_facet Sutheeworapong, Sawannee
Ota, Motonori
Ohta, Hiroyuki
Kinoshita, Kengo
author_sort Sutheeworapong, Sawannee
collection PubMed
description OBJECTIVE: With the dramatic increase in microarray data, biclustering has become a promising tool for gene expression analysis. Biclustering has been proven to be superior over clustering in identifying multifunctional genes and searching for co-expressed genes under a few specific conditions; that is, a subgroup of all conditions. Biclustering based on a genetic algorithm (GA) has shown better performance than greedy algorithms, but the overlap state for biclusters must be treated more systematically. RESULTS: We developed a new biclustering algorithm (binary-iterative genetic algorithm [BIGA]), based on an iterative GA, by introducing a novel, ternary-digit chromosome encoding function. BIGA searches for a set of biclusters by iterative binary divisions that allow the overlap state to be explicitly considered. In addition, the average of the Pearson’s correlation coefficient was employed to measure the relationship of genes within a bicluster, instead of the mean square residual, the popular classical index. As compared to the six existing algorithms, BIGA found highly correlated biclusters, with large gene coverage and reasonable gene overlap. The gene ontology (GO) enrichment showed that most of the biclusters are significant, with at least one GO term over represented. CONCLUSION: BIGA is a powerful tool to analyze large amounts of gene expression data, and will facilitate the elucidation of the underlying functional mechanisms in living organisms.
format Online
Article
Text
id pubmed-3459542
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Dove Medical Press
record_format MEDLINE/PubMed
spelling pubmed-34595422012-10-10 A novel biclustering approach with iterative optimization to analyze gene expression data Sutheeworapong, Sawannee Ota, Motonori Ohta, Hiroyuki Kinoshita, Kengo Adv Appl Bioinform Chem Methodology OBJECTIVE: With the dramatic increase in microarray data, biclustering has become a promising tool for gene expression analysis. Biclustering has been proven to be superior over clustering in identifying multifunctional genes and searching for co-expressed genes under a few specific conditions; that is, a subgroup of all conditions. Biclustering based on a genetic algorithm (GA) has shown better performance than greedy algorithms, but the overlap state for biclusters must be treated more systematically. RESULTS: We developed a new biclustering algorithm (binary-iterative genetic algorithm [BIGA]), based on an iterative GA, by introducing a novel, ternary-digit chromosome encoding function. BIGA searches for a set of biclusters by iterative binary divisions that allow the overlap state to be explicitly considered. In addition, the average of the Pearson’s correlation coefficient was employed to measure the relationship of genes within a bicluster, instead of the mean square residual, the popular classical index. As compared to the six existing algorithms, BIGA found highly correlated biclusters, with large gene coverage and reasonable gene overlap. The gene ontology (GO) enrichment showed that most of the biclusters are significant, with at least one GO term over represented. CONCLUSION: BIGA is a powerful tool to analyze large amounts of gene expression data, and will facilitate the elucidation of the underlying functional mechanisms in living organisms. Dove Medical Press 2012-09-07 /pmc/articles/PMC3459542/ /pubmed/23055751 http://dx.doi.org/10.2147/AABC.S32622 Text en © 2012 Sutheeworapong et al, publisher and licensee Dove Medical Press Ltd. This is an Open Access article which permits unrestricted noncommercial use, provided the original work is properly cited.
spellingShingle Methodology
Sutheeworapong, Sawannee
Ota, Motonori
Ohta, Hiroyuki
Kinoshita, Kengo
A novel biclustering approach with iterative optimization to analyze gene expression data
title A novel biclustering approach with iterative optimization to analyze gene expression data
title_full A novel biclustering approach with iterative optimization to analyze gene expression data
title_fullStr A novel biclustering approach with iterative optimization to analyze gene expression data
title_full_unstemmed A novel biclustering approach with iterative optimization to analyze gene expression data
title_short A novel biclustering approach with iterative optimization to analyze gene expression data
title_sort novel biclustering approach with iterative optimization to analyze gene expression data
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3459542/
https://www.ncbi.nlm.nih.gov/pubmed/23055751
http://dx.doi.org/10.2147/AABC.S32622
work_keys_str_mv AT sutheeworapongsawannee anovelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata
AT otamotonori anovelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata
AT ohtahiroyuki anovelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata
AT kinoshitakengo anovelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata
AT sutheeworapongsawannee novelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata
AT otamotonori novelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata
AT ohtahiroyuki novelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata
AT kinoshitakengo novelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata