Cargando…

Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data

BACKGROUND: The gene shaving algorithm and many other clustering algorithms identify gene clusters showing high variation across samples. However, gene expression in many signaling pathways show only modest and concordant changes that fail to be identified by these methods. The increasingly availabl...

Descripción completa

Detalles Bibliográficos
Autor principal: Zhu, Dongxiao
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648790/
https://www.ncbi.nlm.nih.gov/pubmed/19208157
http://dx.doi.org/10.1186/1471-2105-10-S1-S54
_version_ 1782164988937371648
author Zhu, Dongxiao
author_facet Zhu, Dongxiao
author_sort Zhu, Dongxiao
collection PubMed
description BACKGROUND: The gene shaving algorithm and many other clustering algorithms identify gene clusters showing high variation across samples. However, gene expression in many signaling pathways show only modest and concordant changes that fail to be identified by these methods. The increasingly available signaling pathway prior knowledge provide new opportunity to solve this problem. RESULTS: We propose an innovative semi-supervised gene clustering algorithm, where the original gene shaving algorithm was extended and generalized so that prior knowledge of signaling pathways can be incorporated. Different from other methods, our method identifies gene clusters showing concerted and modest expression variation as well as strong expression correlation. Using available pathway gene sets as prior knowledge, whether complete or incomplete, our algorithm is capable of forming tightly regulated gene clusters showing modest variation across samples. We demonstrate the advantages of our algorithm over the original gene shaving algorithm using two microarray data sets. The stability of the gene clusters was accessed using a jackknife approach. CONCLUSION: Our algorithm represents one of the first clustering algorithms that is particularly designed to identify signaling pathways of low and concordant gene expression variation. The discriminating power is achieved by manufacturing a principal component enriched by signaling pathways.
format Text
id pubmed-2648790
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26487902009-03-03 Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data Zhu, Dongxiao BMC Bioinformatics Research BACKGROUND: The gene shaving algorithm and many other clustering algorithms identify gene clusters showing high variation across samples. However, gene expression in many signaling pathways show only modest and concordant changes that fail to be identified by these methods. The increasingly available signaling pathway prior knowledge provide new opportunity to solve this problem. RESULTS: We propose an innovative semi-supervised gene clustering algorithm, where the original gene shaving algorithm was extended and generalized so that prior knowledge of signaling pathways can be incorporated. Different from other methods, our method identifies gene clusters showing concerted and modest expression variation as well as strong expression correlation. Using available pathway gene sets as prior knowledge, whether complete or incomplete, our algorithm is capable of forming tightly regulated gene clusters showing modest variation across samples. We demonstrate the advantages of our algorithm over the original gene shaving algorithm using two microarray data sets. The stability of the gene clusters was accessed using a jackknife approach. CONCLUSION: Our algorithm represents one of the first clustering algorithms that is particularly designed to identify signaling pathways of low and concordant gene expression variation. The discriminating power is achieved by manufacturing a principal component enriched by signaling pathways. BioMed Central 2009-01-30 /pmc/articles/PMC2648790/ /pubmed/19208157 http://dx.doi.org/10.1186/1471-2105-10-S1-S54 Text en Copyright © 2009 Zhu; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Zhu, Dongxiao
Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data
title Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data
title_full Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data
title_fullStr Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data
title_full_unstemmed Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data
title_short Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data
title_sort semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648790/
https://www.ncbi.nlm.nih.gov/pubmed/19208157
http://dx.doi.org/10.1186/1471-2105-10-S1-S54
work_keys_str_mv AT zhudongxiao semisupervisedgeneshavingmethodforpredictinglowvariationbiologicalpathwaysfromgenomewidedata