Cargando…

Som-Based Class Discovery Exploring the ICA-Reduced Features of Microarray Expression Profiles

Gene expression datasets are large and complex, having many variables and unknown internal structure. We apply independent component analysis (ICA) to derive a less redundant representation of the expression data. The decomposition produces components with minimal statistical dependence and reveals...

Descripción completa

Detalles Bibliográficos
Autores principales: Dragomir, Andrei, Mavroudi, Seferina, Bezerianos, Anastasios
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447468/
https://www.ncbi.nlm.nih.gov/pubmed/18629176
http://dx.doi.org/10.1002/cfg.444
Descripción
Sumario:Gene expression datasets are large and complex, having many variables and unknown internal structure. We apply independent component analysis (ICA) to derive a less redundant representation of the expression data. The decomposition produces components with minimal statistical dependence and reveals biologically relevant information. Consequently, to the transformed data, we apply cluster analysis (an important and popular analysis tool for obtaining an initial understanding of the data, usually employed for class discovery). The proposed self-organizing map (SOM)-based clustering algorithm automatically determines the number of ‘natural’ subgroups of the data, being aided at this task by the available prior knowledge of the functional categories of genes. An entropy criterion allows each gene to be assigned to multiple classes, which is closer to the biological representation. These features, however, are not achieved at the cost of the simplicity of the algorithm, since the map grows on a simple grid structure and the learning algorithm remains equal to Kohonen’s one.