Cargando…

Clustering gene expression data using a diffraction‐inspired framework

BACKGROUND: The recent developments in microarray technology has allowed for the simultaneous measurement of gene expression levels. The large amount of captured data challenges conventional statistical tools for analysing and finding inherent correlations between genes and samples. The unsupervised...

Descripción completa

Detalles Bibliográficos
Autores principales: Dinger, Steven C, Van Wyk, Michael A, Carmona, Sergio, Rubin, David M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549897/
https://www.ncbi.nlm.nih.gov/pubmed/23164195
http://dx.doi.org/10.1186/1475-925X-11-85
_version_ 1782256493636091904
author Dinger, Steven C
Van Wyk, Michael A
Carmona, Sergio
Rubin, David M
author_facet Dinger, Steven C
Van Wyk, Michael A
Carmona, Sergio
Rubin, David M
author_sort Dinger, Steven C
collection PubMed
description BACKGROUND: The recent developments in microarray technology has allowed for the simultaneous measurement of gene expression levels. The large amount of captured data challenges conventional statistical tools for analysing and finding inherent correlations between genes and samples. The unsupervised clustering approach is often used, resulting in the development of a wide variety of algorithms. Typical clustering algorithms require selecting certain parameters to operate, for instance the number of expected clusters, as well as defining a similarity measure to quantify the distance between data points. The diffraction‐based clustering algorithm however is designed to overcome this necessity for user‐defined parameters, as it is able to automatically search the data for any underlying structure. METHODS: The diffraction‐based clustering algorithm presented in this paper is tested using five well‐known expression datasets pertaining to cancerous tissue samples. The clustering results are then compared to those results obtained from conventional algorithms such as the k‐means, fuzzy c‐means, self‐organising map, hierarchical clustering algorithm, Gaussian mixture model and density‐based spatial clustering of applications with noise (DBSCAN). The performance of each algorithm is measured using an average external criterion and an average validity index. RESULTS: The diffraction‐based clustering algorithm is shown to be independent of the number of clusters as the algorithm searches the feature space and requires no form of parameter selection. The results show that the diffraction‐based clustering algorithm performs significantly better on the real biological datasets compared to the other existing algorithms. CONCLUSION: The results of the diffraction‐based clustering algorithm presented in this paper suggest that the method can provide researchers with a new tool for successfully analysing microarray data.
format Online
Article
Text
id pubmed-3549897
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35498972013-01-24 Clustering gene expression data using a diffraction‐inspired framework Dinger, Steven C Van Wyk, Michael A Carmona, Sergio Rubin, David M Biomed Eng Online Research BACKGROUND: The recent developments in microarray technology has allowed for the simultaneous measurement of gene expression levels. The large amount of captured data challenges conventional statistical tools for analysing and finding inherent correlations between genes and samples. The unsupervised clustering approach is often used, resulting in the development of a wide variety of algorithms. Typical clustering algorithms require selecting certain parameters to operate, for instance the number of expected clusters, as well as defining a similarity measure to quantify the distance between data points. The diffraction‐based clustering algorithm however is designed to overcome this necessity for user‐defined parameters, as it is able to automatically search the data for any underlying structure. METHODS: The diffraction‐based clustering algorithm presented in this paper is tested using five well‐known expression datasets pertaining to cancerous tissue samples. The clustering results are then compared to those results obtained from conventional algorithms such as the k‐means, fuzzy c‐means, self‐organising map, hierarchical clustering algorithm, Gaussian mixture model and density‐based spatial clustering of applications with noise (DBSCAN). The performance of each algorithm is measured using an average external criterion and an average validity index. RESULTS: The diffraction‐based clustering algorithm is shown to be independent of the number of clusters as the algorithm searches the feature space and requires no form of parameter selection. The results show that the diffraction‐based clustering algorithm performs significantly better on the real biological datasets compared to the other existing algorithms. CONCLUSION: The results of the diffraction‐based clustering algorithm presented in this paper suggest that the method can provide researchers with a new tool for successfully analysing microarray data. BioMed Central 2012-11-19 /pmc/articles/PMC3549897/ /pubmed/23164195 http://dx.doi.org/10.1186/1475-925X-11-85 Text en Copyright ©2012 Dinger et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Dinger, Steven C
Van Wyk, Michael A
Carmona, Sergio
Rubin, David M
Clustering gene expression data using a diffraction‐inspired framework
title Clustering gene expression data using a diffraction‐inspired framework
title_full Clustering gene expression data using a diffraction‐inspired framework
title_fullStr Clustering gene expression data using a diffraction‐inspired framework
title_full_unstemmed Clustering gene expression data using a diffraction‐inspired framework
title_short Clustering gene expression data using a diffraction‐inspired framework
title_sort clustering gene expression data using a diffraction‐inspired framework
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549897/
https://www.ncbi.nlm.nih.gov/pubmed/23164195
http://dx.doi.org/10.1186/1475-925X-11-85
work_keys_str_mv AT dingerstevenc clusteringgeneexpressiondatausingadiffractioninspiredframework
AT vanwykmichaela clusteringgeneexpressiondatausingadiffractioninspiredframework
AT carmonasergio clusteringgeneexpressiondatausingadiffractioninspiredframework
AT rubindavidm clusteringgeneexpressiondatausingadiffractioninspiredframework