Cargando…

biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure

Biclustering techniques have become very popular in cancer genetics studies, as they are tools that are expected to connect phenotypes to genotypes, i.e. to identify subgroups of cancer patients based on the fact that they share similar gene expression patterns as well as to identify subgroups of ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Chia-Pei, Fushing, Hsieh, Atwill, Rob, Koehl, Patrice
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4105625/
https://www.ncbi.nlm.nih.gov/pubmed/25047553
http://dx.doi.org/10.1371/journal.pone.0102445
_version_ 1782327404988989440
author Chen, Chia-Pei
Fushing, Hsieh
Atwill, Rob
Koehl, Patrice
author_facet Chen, Chia-Pei
Fushing, Hsieh
Atwill, Rob
Koehl, Patrice
author_sort Chen, Chia-Pei
collection PubMed
description Biclustering techniques have become very popular in cancer genetics studies, as they are tools that are expected to connect phenotypes to genotypes, i.e. to identify subgroups of cancer patients based on the fact that they share similar gene expression patterns as well as to identify subgroups of genes that are specific to these subtypes of cancer and therefore could serve as biomarkers. In this paper we propose a new approach for identifying such relationships or biclusters between patients and gene expression profiles. This method, named biDCG, rests on two key concepts. First, it uses a new clustering technique, DCG-tree [Fushing et al, PLos One, 8, e56259 (2013)] that generates ultrametric topological spaces that capture the geometries of both the patient data set and the gene data set. Second, it optimizes the definitions of bicluster membership through an iterative two-way reclustering procedure in which patients and genes are reclustered in turn, based respectively on subsets of genes and patients defined in the previous round. We have validated biDCG on simulated and real data. Based on the simulated data we have shown that biDCG compares favorably to other biclustering techniques applied to cancer genomics data. The results on the real data sets have shown that biDCG is able to retrieve relevant biological information.
format Online
Article
Text
id pubmed-4105625
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41056252014-07-23 biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure Chen, Chia-Pei Fushing, Hsieh Atwill, Rob Koehl, Patrice PLoS One Research Article Biclustering techniques have become very popular in cancer genetics studies, as they are tools that are expected to connect phenotypes to genotypes, i.e. to identify subgroups of cancer patients based on the fact that they share similar gene expression patterns as well as to identify subgroups of genes that are specific to these subtypes of cancer and therefore could serve as biomarkers. In this paper we propose a new approach for identifying such relationships or biclusters between patients and gene expression profiles. This method, named biDCG, rests on two key concepts. First, it uses a new clustering technique, DCG-tree [Fushing et al, PLos One, 8, e56259 (2013)] that generates ultrametric topological spaces that capture the geometries of both the patient data set and the gene data set. Second, it optimizes the definitions of bicluster membership through an iterative two-way reclustering procedure in which patients and genes are reclustered in turn, based respectively on subsets of genes and patients defined in the previous round. We have validated biDCG on simulated and real data. Based on the simulated data we have shown that biDCG compares favorably to other biclustering techniques applied to cancer genomics data. The results on the real data sets have shown that biDCG is able to retrieve relevant biological information. Public Library of Science 2014-07-21 /pmc/articles/PMC4105625/ /pubmed/25047553 http://dx.doi.org/10.1371/journal.pone.0102445 Text en © 2014 Chen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Chen, Chia-Pei
Fushing, Hsieh
Atwill, Rob
Koehl, Patrice
biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure
title biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure
title_full biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure
title_fullStr biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure
title_full_unstemmed biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure
title_short biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure
title_sort bidcg: a new method for discovering global features of dna microarray data via an iterative re-clustering procedure
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4105625/
https://www.ncbi.nlm.nih.gov/pubmed/25047553
http://dx.doi.org/10.1371/journal.pone.0102445
work_keys_str_mv AT chenchiapei bidcganewmethodfordiscoveringglobalfeaturesofdnamicroarraydataviaaniterativereclusteringprocedure
AT fushinghsieh bidcganewmethodfordiscoveringglobalfeaturesofdnamicroarraydataviaaniterativereclusteringprocedure
AT atwillrob bidcganewmethodfordiscoveringglobalfeaturesofdnamicroarraydataviaaniterativereclusteringprocedure
AT koehlpatrice bidcganewmethodfordiscoveringglobalfeaturesofdnamicroarraydataviaaniterativereclusteringprocedure