Cargando…

Integrated analysis of gene expression by association rules discovery

BACKGROUND: Microarray technology is generating huge amounts of data about the expression level of thousands of genes, or even whole genomes, across different experimental conditions. To extract biological knowledge, and to fully understand such datasets, it is essential to include external biologic...

Descripción completa

Detalles Bibliográficos
Autores principales: Carmona-Saez, Pedro, Chagoyen, Monica, Rodriguez, Andres, Trelles, Oswaldo, Carazo, Jose M, Pascual-Montano, Alberto
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1386712/
https://www.ncbi.nlm.nih.gov/pubmed/16464256
http://dx.doi.org/10.1186/1471-2105-7-54
_version_ 1782126883020734464
author Carmona-Saez, Pedro
Chagoyen, Monica
Rodriguez, Andres
Trelles, Oswaldo
Carazo, Jose M
Pascual-Montano, Alberto
author_facet Carmona-Saez, Pedro
Chagoyen, Monica
Rodriguez, Andres
Trelles, Oswaldo
Carazo, Jose M
Pascual-Montano, Alberto
author_sort Carmona-Saez, Pedro
collection PubMed
description BACKGROUND: Microarray technology is generating huge amounts of data about the expression level of thousands of genes, or even whole genomes, across different experimental conditions. To extract biological knowledge, and to fully understand such datasets, it is essential to include external biological information about genes and gene products to the analysis of expression data. However, most of the current approaches to analyze microarray datasets are mainly focused on the analysis of experimental data, and external biological information is incorporated as a posterior process. RESULTS: In this study we present a method for the integrative analysis of microarray data based on the Association Rules Discovery data mining technique. The approach integrates gene annotations and expression data to discover intrinsic associations among both data sources based on co-occurrence patterns. We applied the proposed methodology to the analysis of gene expression datasets in which genes were annotated with metabolic pathways, transcriptional regulators and Gene Ontology categories. Automatically extracted associations revealed significant relationships among these gene attributes and expression patterns, where many of them are clearly supported by recently reported work. CONCLUSION: The integration of external biological information and gene expression data can provide insights about the biological processes associated to gene expression programs. In this paper we show that the proposed methodology is able to integrate multiple gene annotations and expression data in the same analytic framework and extract meaningful associations among heterogeneous sources of data. An implementation of the method is included in the Engene software package.
format Text
id pubmed-1386712
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-13867122006-04-21 Integrated analysis of gene expression by association rules discovery Carmona-Saez, Pedro Chagoyen, Monica Rodriguez, Andres Trelles, Oswaldo Carazo, Jose M Pascual-Montano, Alberto BMC Bioinformatics Methodology Article BACKGROUND: Microarray technology is generating huge amounts of data about the expression level of thousands of genes, or even whole genomes, across different experimental conditions. To extract biological knowledge, and to fully understand such datasets, it is essential to include external biological information about genes and gene products to the analysis of expression data. However, most of the current approaches to analyze microarray datasets are mainly focused on the analysis of experimental data, and external biological information is incorporated as a posterior process. RESULTS: In this study we present a method for the integrative analysis of microarray data based on the Association Rules Discovery data mining technique. The approach integrates gene annotations and expression data to discover intrinsic associations among both data sources based on co-occurrence patterns. We applied the proposed methodology to the analysis of gene expression datasets in which genes were annotated with metabolic pathways, transcriptional regulators and Gene Ontology categories. Automatically extracted associations revealed significant relationships among these gene attributes and expression patterns, where many of them are clearly supported by recently reported work. CONCLUSION: The integration of external biological information and gene expression data can provide insights about the biological processes associated to gene expression programs. In this paper we show that the proposed methodology is able to integrate multiple gene annotations and expression data in the same analytic framework and extract meaningful associations among heterogeneous sources of data. An implementation of the method is included in the Engene software package. BioMed Central 2006-02-07 /pmc/articles/PMC1386712/ /pubmed/16464256 http://dx.doi.org/10.1186/1471-2105-7-54 Text en Copyright © 2006 Carmona-Saez et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Carmona-Saez, Pedro
Chagoyen, Monica
Rodriguez, Andres
Trelles, Oswaldo
Carazo, Jose M
Pascual-Montano, Alberto
Integrated analysis of gene expression by association rules discovery
title Integrated analysis of gene expression by association rules discovery
title_full Integrated analysis of gene expression by association rules discovery
title_fullStr Integrated analysis of gene expression by association rules discovery
title_full_unstemmed Integrated analysis of gene expression by association rules discovery
title_short Integrated analysis of gene expression by association rules discovery
title_sort integrated analysis of gene expression by association rules discovery
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1386712/
https://www.ncbi.nlm.nih.gov/pubmed/16464256
http://dx.doi.org/10.1186/1471-2105-7-54
work_keys_str_mv AT carmonasaezpedro integratedanalysisofgeneexpressionbyassociationrulesdiscovery
AT chagoyenmonica integratedanalysisofgeneexpressionbyassociationrulesdiscovery
AT rodriguezandres integratedanalysisofgeneexpressionbyassociationrulesdiscovery
AT trellesoswaldo integratedanalysisofgeneexpressionbyassociationrulesdiscovery
AT carazojosem integratedanalysisofgeneexpressionbyassociationrulesdiscovery
AT pascualmontanoalberto integratedanalysisofgeneexpressionbyassociationrulesdiscovery