Cargando…
A novel biclustering approach with iterative optimization to analyze gene expression data
OBJECTIVE: With the dramatic increase in microarray data, biclustering has become a promising tool for gene expression analysis. Biclustering has been proven to be superior over clustering in identifying multifunctional genes and searching for co-expressed genes under a few specific conditions; that...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Dove Medical Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3459542/ https://www.ncbi.nlm.nih.gov/pubmed/23055751 http://dx.doi.org/10.2147/AABC.S32622 |
_version_ | 1782244817738137600 |
---|---|
author | Sutheeworapong, Sawannee Ota, Motonori Ohta, Hiroyuki Kinoshita, Kengo |
author_facet | Sutheeworapong, Sawannee Ota, Motonori Ohta, Hiroyuki Kinoshita, Kengo |
author_sort | Sutheeworapong, Sawannee |
collection | PubMed |
description | OBJECTIVE: With the dramatic increase in microarray data, biclustering has become a promising tool for gene expression analysis. Biclustering has been proven to be superior over clustering in identifying multifunctional genes and searching for co-expressed genes under a few specific conditions; that is, a subgroup of all conditions. Biclustering based on a genetic algorithm (GA) has shown better performance than greedy algorithms, but the overlap state for biclusters must be treated more systematically. RESULTS: We developed a new biclustering algorithm (binary-iterative genetic algorithm [BIGA]), based on an iterative GA, by introducing a novel, ternary-digit chromosome encoding function. BIGA searches for a set of biclusters by iterative binary divisions that allow the overlap state to be explicitly considered. In addition, the average of the Pearson’s correlation coefficient was employed to measure the relationship of genes within a bicluster, instead of the mean square residual, the popular classical index. As compared to the six existing algorithms, BIGA found highly correlated biclusters, with large gene coverage and reasonable gene overlap. The gene ontology (GO) enrichment showed that most of the biclusters are significant, with at least one GO term over represented. CONCLUSION: BIGA is a powerful tool to analyze large amounts of gene expression data, and will facilitate the elucidation of the underlying functional mechanisms in living organisms. |
format | Online Article Text |
id | pubmed-3459542 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Dove Medical Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-34595422012-10-10 A novel biclustering approach with iterative optimization to analyze gene expression data Sutheeworapong, Sawannee Ota, Motonori Ohta, Hiroyuki Kinoshita, Kengo Adv Appl Bioinform Chem Methodology OBJECTIVE: With the dramatic increase in microarray data, biclustering has become a promising tool for gene expression analysis. Biclustering has been proven to be superior over clustering in identifying multifunctional genes and searching for co-expressed genes under a few specific conditions; that is, a subgroup of all conditions. Biclustering based on a genetic algorithm (GA) has shown better performance than greedy algorithms, but the overlap state for biclusters must be treated more systematically. RESULTS: We developed a new biclustering algorithm (binary-iterative genetic algorithm [BIGA]), based on an iterative GA, by introducing a novel, ternary-digit chromosome encoding function. BIGA searches for a set of biclusters by iterative binary divisions that allow the overlap state to be explicitly considered. In addition, the average of the Pearson’s correlation coefficient was employed to measure the relationship of genes within a bicluster, instead of the mean square residual, the popular classical index. As compared to the six existing algorithms, BIGA found highly correlated biclusters, with large gene coverage and reasonable gene overlap. The gene ontology (GO) enrichment showed that most of the biclusters are significant, with at least one GO term over represented. CONCLUSION: BIGA is a powerful tool to analyze large amounts of gene expression data, and will facilitate the elucidation of the underlying functional mechanisms in living organisms. Dove Medical Press 2012-09-07 /pmc/articles/PMC3459542/ /pubmed/23055751 http://dx.doi.org/10.2147/AABC.S32622 Text en © 2012 Sutheeworapong et al, publisher and licensee Dove Medical Press Ltd. This is an Open Access article which permits unrestricted noncommercial use, provided the original work is properly cited. |
spellingShingle | Methodology Sutheeworapong, Sawannee Ota, Motonori Ohta, Hiroyuki Kinoshita, Kengo A novel biclustering approach with iterative optimization to analyze gene expression data |
title | A novel biclustering approach with iterative optimization to analyze gene expression data |
title_full | A novel biclustering approach with iterative optimization to analyze gene expression data |
title_fullStr | A novel biclustering approach with iterative optimization to analyze gene expression data |
title_full_unstemmed | A novel biclustering approach with iterative optimization to analyze gene expression data |
title_short | A novel biclustering approach with iterative optimization to analyze gene expression data |
title_sort | novel biclustering approach with iterative optimization to analyze gene expression data |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3459542/ https://www.ncbi.nlm.nih.gov/pubmed/23055751 http://dx.doi.org/10.2147/AABC.S32622 |
work_keys_str_mv | AT sutheeworapongsawannee anovelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata AT otamotonori anovelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata AT ohtahiroyuki anovelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata AT kinoshitakengo anovelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata AT sutheeworapongsawannee novelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata AT otamotonori novelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata AT ohtahiroyuki novelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata AT kinoshitakengo novelbiclusteringapproachwithiterativeoptimizationtoanalyzegeneexpressiondata |