Cargando…
biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure
Biclustering techniques have become very popular in cancer genetics studies, as they are tools that are expected to connect phenotypes to genotypes, i.e. to identify subgroups of cancer patients based on the fact that they share similar gene expression patterns as well as to identify subgroups of ge...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4105625/ https://www.ncbi.nlm.nih.gov/pubmed/25047553 http://dx.doi.org/10.1371/journal.pone.0102445 |
_version_ | 1782327404988989440 |
---|---|
author | Chen, Chia-Pei Fushing, Hsieh Atwill, Rob Koehl, Patrice |
author_facet | Chen, Chia-Pei Fushing, Hsieh Atwill, Rob Koehl, Patrice |
author_sort | Chen, Chia-Pei |
collection | PubMed |
description | Biclustering techniques have become very popular in cancer genetics studies, as they are tools that are expected to connect phenotypes to genotypes, i.e. to identify subgroups of cancer patients based on the fact that they share similar gene expression patterns as well as to identify subgroups of genes that are specific to these subtypes of cancer and therefore could serve as biomarkers. In this paper we propose a new approach for identifying such relationships or biclusters between patients and gene expression profiles. This method, named biDCG, rests on two key concepts. First, it uses a new clustering technique, DCG-tree [Fushing et al, PLos One, 8, e56259 (2013)] that generates ultrametric topological spaces that capture the geometries of both the patient data set and the gene data set. Second, it optimizes the definitions of bicluster membership through an iterative two-way reclustering procedure in which patients and genes are reclustered in turn, based respectively on subsets of genes and patients defined in the previous round. We have validated biDCG on simulated and real data. Based on the simulated data we have shown that biDCG compares favorably to other biclustering techniques applied to cancer genomics data. The results on the real data sets have shown that biDCG is able to retrieve relevant biological information. |
format | Online Article Text |
id | pubmed-4105625 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-41056252014-07-23 biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure Chen, Chia-Pei Fushing, Hsieh Atwill, Rob Koehl, Patrice PLoS One Research Article Biclustering techniques have become very popular in cancer genetics studies, as they are tools that are expected to connect phenotypes to genotypes, i.e. to identify subgroups of cancer patients based on the fact that they share similar gene expression patterns as well as to identify subgroups of genes that are specific to these subtypes of cancer and therefore could serve as biomarkers. In this paper we propose a new approach for identifying such relationships or biclusters between patients and gene expression profiles. This method, named biDCG, rests on two key concepts. First, it uses a new clustering technique, DCG-tree [Fushing et al, PLos One, 8, e56259 (2013)] that generates ultrametric topological spaces that capture the geometries of both the patient data set and the gene data set. Second, it optimizes the definitions of bicluster membership through an iterative two-way reclustering procedure in which patients and genes are reclustered in turn, based respectively on subsets of genes and patients defined in the previous round. We have validated biDCG on simulated and real data. Based on the simulated data we have shown that biDCG compares favorably to other biclustering techniques applied to cancer genomics data. The results on the real data sets have shown that biDCG is able to retrieve relevant biological information. Public Library of Science 2014-07-21 /pmc/articles/PMC4105625/ /pubmed/25047553 http://dx.doi.org/10.1371/journal.pone.0102445 Text en © 2014 Chen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Chen, Chia-Pei Fushing, Hsieh Atwill, Rob Koehl, Patrice biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure |
title | biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure |
title_full | biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure |
title_fullStr | biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure |
title_full_unstemmed | biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure |
title_short | biDCG: A New Method for Discovering Global Features of DNA Microarray Data via an Iterative Re-Clustering Procedure |
title_sort | bidcg: a new method for discovering global features of dna microarray data via an iterative re-clustering procedure |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4105625/ https://www.ncbi.nlm.nih.gov/pubmed/25047553 http://dx.doi.org/10.1371/journal.pone.0102445 |
work_keys_str_mv | AT chenchiapei bidcganewmethodfordiscoveringglobalfeaturesofdnamicroarraydataviaaniterativereclusteringprocedure AT fushinghsieh bidcganewmethodfordiscoveringglobalfeaturesofdnamicroarraydataviaaniterativereclusteringprocedure AT atwillrob bidcganewmethodfordiscoveringglobalfeaturesofdnamicroarraydataviaaniterativereclusteringprocedure AT koehlpatrice bidcganewmethodfordiscoveringglobalfeaturesofdnamicroarraydataviaaniterativereclusteringprocedure |