Cargando…

Integrative analysis of multiple gene expression profiles with quality-adjusted effect size models

BACKGROUND: With the explosion of microarray studies, an enormous amount of data is being produced. Systematic integration of gene expression data from different sources increases statistical power of detecting differentially expressed genes and allows assessment of heterogeneity. The challenge, how...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Pingzhao, Greenwood, Celia MT, Beyene, Joseph
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1173085/
https://www.ncbi.nlm.nih.gov/pubmed/15921507
http://dx.doi.org/10.1186/1471-2105-6-128
_version_ 1782124453026594816
author Hu, Pingzhao
Greenwood, Celia MT
Beyene, Joseph
author_facet Hu, Pingzhao
Greenwood, Celia MT
Beyene, Joseph
author_sort Hu, Pingzhao
collection PubMed
description BACKGROUND: With the explosion of microarray studies, an enormous amount of data is being produced. Systematic integration of gene expression data from different sources increases statistical power of detecting differentially expressed genes and allows assessment of heterogeneity. The challenge, however, is in designing and implementing efficient analytic methodologies for combination of data generated by different research groups. RESULTS: We extended traditional effect size models to combine information from different microarray datasets by incorporating a quality measure for each gene in each study into the effect size estimation. We illustrated our method by integrating two datasets generated using different Affymetrix oligonucleotide types. Our results indicate that the proposed quality-adjusted weighting strategy for modelling inter-study variation of gene expression profiles not only increases consistency and decreases heterogeneous results between these two datasets, but also identifies many more differentially expressed genes than methods proposed previously. CONCLUSION: Data integration and synthesis is becoming increasingly important. We live in a high-throughput era where technologies constantly change leaving behind a trail of data with different forms, shapes and sizes. Statistical and computational methodologies are therefore critical for extracting the most out of these related but not identical sources of data.
format Text
id pubmed-1173085
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11730852005-07-07 Integrative analysis of multiple gene expression profiles with quality-adjusted effect size models Hu, Pingzhao Greenwood, Celia MT Beyene, Joseph BMC Bioinformatics Research Article BACKGROUND: With the explosion of microarray studies, an enormous amount of data is being produced. Systematic integration of gene expression data from different sources increases statistical power of detecting differentially expressed genes and allows assessment of heterogeneity. The challenge, however, is in designing and implementing efficient analytic methodologies for combination of data generated by different research groups. RESULTS: We extended traditional effect size models to combine information from different microarray datasets by incorporating a quality measure for each gene in each study into the effect size estimation. We illustrated our method by integrating two datasets generated using different Affymetrix oligonucleotide types. Our results indicate that the proposed quality-adjusted weighting strategy for modelling inter-study variation of gene expression profiles not only increases consistency and decreases heterogeneous results between these two datasets, but also identifies many more differentially expressed genes than methods proposed previously. CONCLUSION: Data integration and synthesis is becoming increasingly important. We live in a high-throughput era where technologies constantly change leaving behind a trail of data with different forms, shapes and sizes. Statistical and computational methodologies are therefore critical for extracting the most out of these related but not identical sources of data. BioMed Central 2005-05-27 /pmc/articles/PMC1173085/ /pubmed/15921507 http://dx.doi.org/10.1186/1471-2105-6-128 Text en Copyright © 2005 Hu et al; licensee BioMed Central Ltd.
spellingShingle Research Article
Hu, Pingzhao
Greenwood, Celia MT
Beyene, Joseph
Integrative analysis of multiple gene expression profiles with quality-adjusted effect size models
title Integrative analysis of multiple gene expression profiles with quality-adjusted effect size models
title_full Integrative analysis of multiple gene expression profiles with quality-adjusted effect size models
title_fullStr Integrative analysis of multiple gene expression profiles with quality-adjusted effect size models
title_full_unstemmed Integrative analysis of multiple gene expression profiles with quality-adjusted effect size models
title_short Integrative analysis of multiple gene expression profiles with quality-adjusted effect size models
title_sort integrative analysis of multiple gene expression profiles with quality-adjusted effect size models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1173085/
https://www.ncbi.nlm.nih.gov/pubmed/15921507
http://dx.doi.org/10.1186/1471-2105-6-128
work_keys_str_mv AT hupingzhao integrativeanalysisofmultiplegeneexpressionprofileswithqualityadjustedeffectsizemodels
AT greenwoodceliamt integrativeanalysisofmultiplegeneexpressionprofileswithqualityadjustedeffectsizemodels
AT beyenejoseph integrativeanalysisofmultiplegeneexpressionprofileswithqualityadjustedeffectsizemodels