Cargando…

Non-linear mapping for exploratory data analysis in functional genomics

BACKGROUND: Several supervised and unsupervised learning tools are available to classify functional genomics data. However, relatively less attention has been given to exploratory, visualisation-driven approaches. Such approaches should satisfy the following factors: Support for intuitive cluster vi...

Descripción completa

Detalles Bibliográficos
Autores principales: Azuaje, Francisco, Wang, Haiying, Chesneau, Alban
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC548129/
https://www.ncbi.nlm.nih.gov/pubmed/15661072
http://dx.doi.org/10.1186/1471-2105-6-13
_version_ 1782122320157999104
author Azuaje, Francisco
Wang, Haiying
Chesneau, Alban
author_facet Azuaje, Francisco
Wang, Haiying
Chesneau, Alban
author_sort Azuaje, Francisco
collection PubMed
description BACKGROUND: Several supervised and unsupervised learning tools are available to classify functional genomics data. However, relatively less attention has been given to exploratory, visualisation-driven approaches. Such approaches should satisfy the following factors: Support for intuitive cluster visualisation, user-friendly and robust application, computational efficiency and generation of biologically meaningful outcomes. This research assesses a relaxation method for non-linear mapping that addresses these concerns. Its applications to gene expression and protein-protein interaction data analyses are investigated RESULTS: Publicly available expression data originating from leukaemia, round blue-cell tumours and Parkinson disease studies were analysed. The method distinguished relevant clusters and critical analysis areas. The system does not require assumptions about the inherent class structure of the data, its mapping process is controlled by only one parameter and the resulting transformations offer intuitive, meaningful visual displays. Comparisons with traditional mapping models are presented. As a way of promoting potential, alternative applications of the methodology presented, an example of exploratory data analysis of interactome networks is illustrated. Data from the C. elegans interactome were analysed. Results suggest that this method might represent an effective solution for detecting key network hubs and for clustering biologically meaningful groups of proteins. CONCLUSION: A relaxation method for non-linear mapping provided the basis for visualisation-driven analyses using different types of data. This study indicates that such a system may represent a user-friendly and robust approach to exploratory data analysis. It may allow users to gain better insights into the underlying data structure, detect potential outliers and assess assumptions about the cluster composition of the data.
format Text
id pubmed-548129
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5481292005-02-05 Non-linear mapping for exploratory data analysis in functional genomics Azuaje, Francisco Wang, Haiying Chesneau, Alban BMC Bioinformatics Methodology Article BACKGROUND: Several supervised and unsupervised learning tools are available to classify functional genomics data. However, relatively less attention has been given to exploratory, visualisation-driven approaches. Such approaches should satisfy the following factors: Support for intuitive cluster visualisation, user-friendly and robust application, computational efficiency and generation of biologically meaningful outcomes. This research assesses a relaxation method for non-linear mapping that addresses these concerns. Its applications to gene expression and protein-protein interaction data analyses are investigated RESULTS: Publicly available expression data originating from leukaemia, round blue-cell tumours and Parkinson disease studies were analysed. The method distinguished relevant clusters and critical analysis areas. The system does not require assumptions about the inherent class structure of the data, its mapping process is controlled by only one parameter and the resulting transformations offer intuitive, meaningful visual displays. Comparisons with traditional mapping models are presented. As a way of promoting potential, alternative applications of the methodology presented, an example of exploratory data analysis of interactome networks is illustrated. Data from the C. elegans interactome were analysed. Results suggest that this method might represent an effective solution for detecting key network hubs and for clustering biologically meaningful groups of proteins. CONCLUSION: A relaxation method for non-linear mapping provided the basis for visualisation-driven analyses using different types of data. This study indicates that such a system may represent a user-friendly and robust approach to exploratory data analysis. It may allow users to gain better insights into the underlying data structure, detect potential outliers and assess assumptions about the cluster composition of the data. BioMed Central 2005-01-20 /pmc/articles/PMC548129/ /pubmed/15661072 http://dx.doi.org/10.1186/1471-2105-6-13 Text en Copyright © 2005 Azuaje et al; licensee BioMed Central Ltd.
spellingShingle Methodology Article
Azuaje, Francisco
Wang, Haiying
Chesneau, Alban
Non-linear mapping for exploratory data analysis in functional genomics
title Non-linear mapping for exploratory data analysis in functional genomics
title_full Non-linear mapping for exploratory data analysis in functional genomics
title_fullStr Non-linear mapping for exploratory data analysis in functional genomics
title_full_unstemmed Non-linear mapping for exploratory data analysis in functional genomics
title_short Non-linear mapping for exploratory data analysis in functional genomics
title_sort non-linear mapping for exploratory data analysis in functional genomics
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC548129/
https://www.ncbi.nlm.nih.gov/pubmed/15661072
http://dx.doi.org/10.1186/1471-2105-6-13
work_keys_str_mv AT azuajefrancisco nonlinearmappingforexploratorydataanalysisinfunctionalgenomics
AT wanghaiying nonlinearmappingforexploratorydataanalysisinfunctionalgenomics
AT chesneaualban nonlinearmappingforexploratorydataanalysisinfunctionalgenomics