Cargando…

Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data

MOTIVATION: Growth phenotype profiling of genome-wide gene-deletion strains over stress conditions can offer a clear picture that the essentiality of genes depends on environmental conditions. Systematically identifying groups of genes from such high-throughput data that share similar patterns of co...

Descripción completa

Detalles Bibliográficos
Autores principales: Alzahrani, Majed, Kuwahara, Hiroyuki, Wang, Wei, Gao, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870648/
https://www.ncbi.nlm.nih.gov/pubmed/28379298
http://dx.doi.org/10.1093/bioinformatics/btx199
_version_ 1783309526000205824
author Alzahrani, Majed
Kuwahara, Hiroyuki
Wang, Wei
Gao, Xin
author_facet Alzahrani, Majed
Kuwahara, Hiroyuki
Wang, Wei
Gao, Xin
author_sort Alzahrani, Majed
collection PubMed
description MOTIVATION: Growth phenotype profiling of genome-wide gene-deletion strains over stress conditions can offer a clear picture that the essentiality of genes depends on environmental conditions. Systematically identifying groups of genes from such high-throughput data that share similar patterns of conditional essentiality and dispensability under various environmental conditions can elucidate how genetic interactions of the growth phenotype are regulated in response to the environment. RESULTS: We first demonstrate that detecting such ‘co-fit’ gene groups can be cast as a less well-studied problem in biclustering, i.e. constant-column biclustering. Despite significant advances in biclustering techniques, very few were designed for mining in growth phenotype data. Here, we propose Gracob, a novel, efficient graph-based method that casts and solves the constant-column biclustering problem as a maximal clique finding problem in a multipartite graph. We compared Gracob with a large collection of widely used biclustering methods that cover different types of algorithms designed to detect different types of biclusters. Gracob showed superior performance on finding co-fit genes over all the existing methods on both a variety of synthetic data sets with a wide range of settings, and three real growth phenotype datasets for E. coli, proteobacteria and yeast. AVAILABILITY AND IMPLEMENTATION: Our program is freely available for download at http://sfb.kaust.edu.sa/Pages/Software.aspx. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5870648
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58706482018-04-05 Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data Alzahrani, Majed Kuwahara, Hiroyuki Wang, Wei Gao, Xin Bioinformatics Original Papers MOTIVATION: Growth phenotype profiling of genome-wide gene-deletion strains over stress conditions can offer a clear picture that the essentiality of genes depends on environmental conditions. Systematically identifying groups of genes from such high-throughput data that share similar patterns of conditional essentiality and dispensability under various environmental conditions can elucidate how genetic interactions of the growth phenotype are regulated in response to the environment. RESULTS: We first demonstrate that detecting such ‘co-fit’ gene groups can be cast as a less well-studied problem in biclustering, i.e. constant-column biclustering. Despite significant advances in biclustering techniques, very few were designed for mining in growth phenotype data. Here, we propose Gracob, a novel, efficient graph-based method that casts and solves the constant-column biclustering problem as a maximal clique finding problem in a multipartite graph. We compared Gracob with a large collection of widely used biclustering methods that cover different types of algorithms designed to detect different types of biclusters. Gracob showed superior performance on finding co-fit genes over all the existing methods on both a variety of synthetic data sets with a wide range of settings, and three real growth phenotype datasets for E. coli, proteobacteria and yeast. AVAILABILITY AND IMPLEMENTATION: Our program is freely available for download at http://sfb.kaust.edu.sa/Pages/Software.aspx. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-08-15 2017-04-04 /pmc/articles/PMC5870648/ /pubmed/28379298 http://dx.doi.org/10.1093/bioinformatics/btx199 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Alzahrani, Majed
Kuwahara, Hiroyuki
Wang, Wei
Gao, Xin
Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data
title Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data
title_full Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data
title_fullStr Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data
title_full_unstemmed Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data
title_short Gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data
title_sort gracob: a novel graph-based constant-column biclustering method for mining growth phenotype data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870648/
https://www.ncbi.nlm.nih.gov/pubmed/28379298
http://dx.doi.org/10.1093/bioinformatics/btx199
work_keys_str_mv AT alzahranimajed gracobanovelgraphbasedconstantcolumnbiclusteringmethodformininggrowthphenotypedata
AT kuwaharahiroyuki gracobanovelgraphbasedconstantcolumnbiclusteringmethodformininggrowthphenotypedata
AT wangwei gracobanovelgraphbasedconstantcolumnbiclusteringmethodformininggrowthphenotypedata
AT gaoxin gracobanovelgraphbasedconstantcolumnbiclusteringmethodformininggrowthphenotypedata