Cargando…

ICGE: an R package for detecting relevant clusters and atypical units in gene expression

BACKGROUND: Gene expression technologies have opened up new ways to diagnose and treat cancer and other diseases. Clustering algorithms are a useful approach with which to analyze genome expression data. They attempt to partition the genes into groups exhibiting similar patterns of variation in expr...

Descripción completa

Detalles Bibliográficos
Autores principales: Irigoien, Itziar, Sierra, Basilio, Arenas, Concepcion
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3364157/
https://www.ncbi.nlm.nih.gov/pubmed/22330431
http://dx.doi.org/10.1186/1471-2105-13-30
_version_ 1782234494434017280
author Irigoien, Itziar
Sierra, Basilio
Arenas, Concepcion
author_facet Irigoien, Itziar
Sierra, Basilio
Arenas, Concepcion
author_sort Irigoien, Itziar
collection PubMed
description BACKGROUND: Gene expression technologies have opened up new ways to diagnose and treat cancer and other diseases. Clustering algorithms are a useful approach with which to analyze genome expression data. They attempt to partition the genes into groups exhibiting similar patterns of variation in expression level. An important problem associated with gene classification is to discern whether the clustering process can find a relevant partition as well as the identification of new genes classes. There are two key aspects to classification: the estimation of the number of clusters, and the decision as to whether a new unit (gene, tumor sample...) belongs to one of these previously identified clusters or to a new group. RESULTS: ICGE is a user-friendly R package which provides many functions related to this problem: identify the number of clusters using mixed variables, usually found by applied biomedical researchers; detect whether the data have a cluster structure; identify whether a new unit belongs to one of the pre-identified clusters or to a novel group, and classify new units into the corresponding cluster. The functions in the ICGE package are accompanied by help files and easy examples to facilitate its use. CONCLUSIONS: We demonstrate the utility of ICGE by analyzing simulated and real data sets. The results show that ICGE could be very useful to a broad research community.
format Online
Article
Text
id pubmed-3364157
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33641572012-06-01 ICGE: an R package for detecting relevant clusters and atypical units in gene expression Irigoien, Itziar Sierra, Basilio Arenas, Concepcion BMC Bioinformatics Software BACKGROUND: Gene expression technologies have opened up new ways to diagnose and treat cancer and other diseases. Clustering algorithms are a useful approach with which to analyze genome expression data. They attempt to partition the genes into groups exhibiting similar patterns of variation in expression level. An important problem associated with gene classification is to discern whether the clustering process can find a relevant partition as well as the identification of new genes classes. There are two key aspects to classification: the estimation of the number of clusters, and the decision as to whether a new unit (gene, tumor sample...) belongs to one of these previously identified clusters or to a new group. RESULTS: ICGE is a user-friendly R package which provides many functions related to this problem: identify the number of clusters using mixed variables, usually found by applied biomedical researchers; detect whether the data have a cluster structure; identify whether a new unit belongs to one of the pre-identified clusters or to a novel group, and classify new units into the corresponding cluster. The functions in the ICGE package are accompanied by help files and easy examples to facilitate its use. CONCLUSIONS: We demonstrate the utility of ICGE by analyzing simulated and real data sets. The results show that ICGE could be very useful to a broad research community. BioMed Central 2012-02-13 /pmc/articles/PMC3364157/ /pubmed/22330431 http://dx.doi.org/10.1186/1471-2105-13-30 Text en Copyright ©2012 Irigoien et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Irigoien, Itziar
Sierra, Basilio
Arenas, Concepcion
ICGE: an R package for detecting relevant clusters and atypical units in gene expression
title ICGE: an R package for detecting relevant clusters and atypical units in gene expression
title_full ICGE: an R package for detecting relevant clusters and atypical units in gene expression
title_fullStr ICGE: an R package for detecting relevant clusters and atypical units in gene expression
title_full_unstemmed ICGE: an R package for detecting relevant clusters and atypical units in gene expression
title_short ICGE: an R package for detecting relevant clusters and atypical units in gene expression
title_sort icge: an r package for detecting relevant clusters and atypical units in gene expression
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3364157/
https://www.ncbi.nlm.nih.gov/pubmed/22330431
http://dx.doi.org/10.1186/1471-2105-13-30
work_keys_str_mv AT irigoienitziar icgeanrpackagefordetectingrelevantclustersandatypicalunitsingeneexpression
AT sierrabasilio icgeanrpackagefordetectingrelevantclustersandatypicalunitsingeneexpression
AT arenasconcepcion icgeanrpackagefordetectingrelevantclustersandatypicalunitsingeneexpression