Cargando…

A factor model to analyze heterogeneity in gene expression

BACKGROUND: Microarray technology allows the simultaneous analysis of thousands of genes within a single experiment. Significance analyses of transcriptomic data ignore the gene dependence structure. This leads to correlation among test statistics which affects a strong control of the false discover...

Descripción completa

Detalles Bibliográficos
Autores principales: Blum, Yuna, Le Mignon, Guillaume, Lagarrigue, Sandrine, Causeur, David
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2911460/
https://www.ncbi.nlm.nih.gov/pubmed/20598132
http://dx.doi.org/10.1186/1471-2105-11-368
_version_ 1782184470197043200
author Blum, Yuna
Le Mignon, Guillaume
Lagarrigue, Sandrine
Causeur, David
author_facet Blum, Yuna
Le Mignon, Guillaume
Lagarrigue, Sandrine
Causeur, David
author_sort Blum, Yuna
collection PubMed
description BACKGROUND: Microarray technology allows the simultaneous analysis of thousands of genes within a single experiment. Significance analyses of transcriptomic data ignore the gene dependence structure. This leads to correlation among test statistics which affects a strong control of the false discovery proportion. A recent method called FAMT allows capturing the gene dependence into factors in order to improve high-dimensional multiple testing procedures. In the subsequent analyses aiming at a functional characterization of the differentially expressed genes, our study shows how these factors can be used both to identify the components of expression heterogeneity and to give more insight into the underlying biological processes. RESULTS: The use of factors to characterize simple patterns of heterogeneity is first demonstrated on illustrative gene expression data sets. An expression data set primarily generated to map QTL for fatness in chickens is then analyzed. Contrarily to the analysis based on the raw data, a relevant functional information about a QTL region is revealed by factor-adjustment of the gene expressions. Additionally, the interpretation of the independent factors regarding known information about both experimental design and genes shows that some factors may have different and complex origins. CONCLUSIONS: As biological information and technological biases are identified in what was before simply considered as statistical noise, analyzing heterogeneity in gene expression yields a new point of view on transcriptomic data.
format Text
id pubmed-2911460
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29114602010-07-29 A factor model to analyze heterogeneity in gene expression Blum, Yuna Le Mignon, Guillaume Lagarrigue, Sandrine Causeur, David BMC Bioinformatics Methodology Article BACKGROUND: Microarray technology allows the simultaneous analysis of thousands of genes within a single experiment. Significance analyses of transcriptomic data ignore the gene dependence structure. This leads to correlation among test statistics which affects a strong control of the false discovery proportion. A recent method called FAMT allows capturing the gene dependence into factors in order to improve high-dimensional multiple testing procedures. In the subsequent analyses aiming at a functional characterization of the differentially expressed genes, our study shows how these factors can be used both to identify the components of expression heterogeneity and to give more insight into the underlying biological processes. RESULTS: The use of factors to characterize simple patterns of heterogeneity is first demonstrated on illustrative gene expression data sets. An expression data set primarily generated to map QTL for fatness in chickens is then analyzed. Contrarily to the analysis based on the raw data, a relevant functional information about a QTL region is revealed by factor-adjustment of the gene expressions. Additionally, the interpretation of the independent factors regarding known information about both experimental design and genes shows that some factors may have different and complex origins. CONCLUSIONS: As biological information and technological biases are identified in what was before simply considered as statistical noise, analyzing heterogeneity in gene expression yields a new point of view on transcriptomic data. BioMed Central 2010-07-02 /pmc/articles/PMC2911460/ /pubmed/20598132 http://dx.doi.org/10.1186/1471-2105-11-368 Text en Copyright ©2010 Blum et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Blum, Yuna
Le Mignon, Guillaume
Lagarrigue, Sandrine
Causeur, David
A factor model to analyze heterogeneity in gene expression
title A factor model to analyze heterogeneity in gene expression
title_full A factor model to analyze heterogeneity in gene expression
title_fullStr A factor model to analyze heterogeneity in gene expression
title_full_unstemmed A factor model to analyze heterogeneity in gene expression
title_short A factor model to analyze heterogeneity in gene expression
title_sort factor model to analyze heterogeneity in gene expression
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2911460/
https://www.ncbi.nlm.nih.gov/pubmed/20598132
http://dx.doi.org/10.1186/1471-2105-11-368
work_keys_str_mv AT blumyuna afactormodeltoanalyzeheterogeneityingeneexpression
AT lemignonguillaume afactormodeltoanalyzeheterogeneityingeneexpression
AT lagarriguesandrine afactormodeltoanalyzeheterogeneityingeneexpression
AT causeurdavid afactormodeltoanalyzeheterogeneityingeneexpression
AT blumyuna factormodeltoanalyzeheterogeneityingeneexpression
AT lemignonguillaume factormodeltoanalyzeheterogeneityingeneexpression
AT lagarriguesandrine factormodeltoanalyzeheterogeneityingeneexpression
AT causeurdavid factormodeltoanalyzeheterogeneityingeneexpression