Cargando…
Clustering gene expression data using a diffraction‐inspired framework
BACKGROUND: The recent developments in microarray technology has allowed for the simultaneous measurement of gene expression levels. The large amount of captured data challenges conventional statistical tools for analysing and finding inherent correlations between genes and samples. The unsupervised...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549897/ https://www.ncbi.nlm.nih.gov/pubmed/23164195 http://dx.doi.org/10.1186/1475-925X-11-85 |
_version_ | 1782256493636091904 |
---|---|
author | Dinger, Steven C Van Wyk, Michael A Carmona, Sergio Rubin, David M |
author_facet | Dinger, Steven C Van Wyk, Michael A Carmona, Sergio Rubin, David M |
author_sort | Dinger, Steven C |
collection | PubMed |
description | BACKGROUND: The recent developments in microarray technology has allowed for the simultaneous measurement of gene expression levels. The large amount of captured data challenges conventional statistical tools for analysing and finding inherent correlations between genes and samples. The unsupervised clustering approach is often used, resulting in the development of a wide variety of algorithms. Typical clustering algorithms require selecting certain parameters to operate, for instance the number of expected clusters, as well as defining a similarity measure to quantify the distance between data points. The diffraction‐based clustering algorithm however is designed to overcome this necessity for user‐defined parameters, as it is able to automatically search the data for any underlying structure. METHODS: The diffraction‐based clustering algorithm presented in this paper is tested using five well‐known expression datasets pertaining to cancerous tissue samples. The clustering results are then compared to those results obtained from conventional algorithms such as the k‐means, fuzzy c‐means, self‐organising map, hierarchical clustering algorithm, Gaussian mixture model and density‐based spatial clustering of applications with noise (DBSCAN). The performance of each algorithm is measured using an average external criterion and an average validity index. RESULTS: The diffraction‐based clustering algorithm is shown to be independent of the number of clusters as the algorithm searches the feature space and requires no form of parameter selection. The results show that the diffraction‐based clustering algorithm performs significantly better on the real biological datasets compared to the other existing algorithms. CONCLUSION: The results of the diffraction‐based clustering algorithm presented in this paper suggest that the method can provide researchers with a new tool for successfully analysing microarray data. |
format | Online Article Text |
id | pubmed-3549897 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35498972013-01-24 Clustering gene expression data using a diffraction‐inspired framework Dinger, Steven C Van Wyk, Michael A Carmona, Sergio Rubin, David M Biomed Eng Online Research BACKGROUND: The recent developments in microarray technology has allowed for the simultaneous measurement of gene expression levels. The large amount of captured data challenges conventional statistical tools for analysing and finding inherent correlations between genes and samples. The unsupervised clustering approach is often used, resulting in the development of a wide variety of algorithms. Typical clustering algorithms require selecting certain parameters to operate, for instance the number of expected clusters, as well as defining a similarity measure to quantify the distance between data points. The diffraction‐based clustering algorithm however is designed to overcome this necessity for user‐defined parameters, as it is able to automatically search the data for any underlying structure. METHODS: The diffraction‐based clustering algorithm presented in this paper is tested using five well‐known expression datasets pertaining to cancerous tissue samples. The clustering results are then compared to those results obtained from conventional algorithms such as the k‐means, fuzzy c‐means, self‐organising map, hierarchical clustering algorithm, Gaussian mixture model and density‐based spatial clustering of applications with noise (DBSCAN). The performance of each algorithm is measured using an average external criterion and an average validity index. RESULTS: The diffraction‐based clustering algorithm is shown to be independent of the number of clusters as the algorithm searches the feature space and requires no form of parameter selection. The results show that the diffraction‐based clustering algorithm performs significantly better on the real biological datasets compared to the other existing algorithms. CONCLUSION: The results of the diffraction‐based clustering algorithm presented in this paper suggest that the method can provide researchers with a new tool for successfully analysing microarray data. BioMed Central 2012-11-19 /pmc/articles/PMC3549897/ /pubmed/23164195 http://dx.doi.org/10.1186/1475-925X-11-85 Text en Copyright ©2012 Dinger et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Dinger, Steven C Van Wyk, Michael A Carmona, Sergio Rubin, David M Clustering gene expression data using a diffraction‐inspired framework |
title | Clustering gene expression data using a diffraction‐inspired framework |
title_full | Clustering gene expression data using a diffraction‐inspired framework |
title_fullStr | Clustering gene expression data using a diffraction‐inspired framework |
title_full_unstemmed | Clustering gene expression data using a diffraction‐inspired framework |
title_short | Clustering gene expression data using a diffraction‐inspired framework |
title_sort | clustering gene expression data using a diffraction‐inspired framework |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549897/ https://www.ncbi.nlm.nih.gov/pubmed/23164195 http://dx.doi.org/10.1186/1475-925X-11-85 |
work_keys_str_mv | AT dingerstevenc clusteringgeneexpressiondatausingadiffractioninspiredframework AT vanwykmichaela clusteringgeneexpressiondatausingadiffractioninspiredframework AT carmonasergio clusteringgeneexpressiondatausingadiffractioninspiredframework AT rubindavidm clusteringgeneexpressiondatausingadiffractioninspiredframework |