Cargando…

A statistical framework for integrating two microarray data sets in differential expression analysis

BACKGROUND: Different microarray data sets can be collected for studying the same or similar diseases. We expect to achieve a more efficient analysis of differential expression if an efficient statistical method can be developed for integrating different microarray data sets. Although many statistic...

Descripción completa

Detalles Bibliográficos
Autores principales: Lai, Yinglei, Eckenrode, Sarah E, She, Jin-Xiong
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648727/
https://www.ncbi.nlm.nih.gov/pubmed/19208123
http://dx.doi.org/10.1186/1471-2105-10-S1-S23
_version_ 1782164974047592448
author Lai, Yinglei
Eckenrode, Sarah E
She, Jin-Xiong
author_facet Lai, Yinglei
Eckenrode, Sarah E
She, Jin-Xiong
author_sort Lai, Yinglei
collection PubMed
description BACKGROUND: Different microarray data sets can be collected for studying the same or similar diseases. We expect to achieve a more efficient analysis of differential expression if an efficient statistical method can be developed for integrating different microarray data sets. Although many statistical methods have been proposed for data integration, the genome-wide concordance of different data sets has not been well considered in the analysis. RESULTS: Before considering data integration, it is necessary to evaluate the genome-wide concordance so that misleading results can be avoided. Based on the test results, different subsequent actions are suggested. The evaluation of genome-wide concordance and the data integration can be achieved based on the normal distribution based mixture models. CONCLUSION: The results from our simulation study suggest that misleading results can be generated if the genome-wide concordance issue is not appropriately considered. Our method provides a rigorous parametric solution. The results also show that our method is robust to certain model misspecification and is practically useful for the integrative analysis of differential expression.
format Text
id pubmed-2648727
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26487272009-03-03 A statistical framework for integrating two microarray data sets in differential expression analysis Lai, Yinglei Eckenrode, Sarah E She, Jin-Xiong BMC Bioinformatics Research BACKGROUND: Different microarray data sets can be collected for studying the same or similar diseases. We expect to achieve a more efficient analysis of differential expression if an efficient statistical method can be developed for integrating different microarray data sets. Although many statistical methods have been proposed for data integration, the genome-wide concordance of different data sets has not been well considered in the analysis. RESULTS: Before considering data integration, it is necessary to evaluate the genome-wide concordance so that misleading results can be avoided. Based on the test results, different subsequent actions are suggested. The evaluation of genome-wide concordance and the data integration can be achieved based on the normal distribution based mixture models. CONCLUSION: The results from our simulation study suggest that misleading results can be generated if the genome-wide concordance issue is not appropriately considered. Our method provides a rigorous parametric solution. The results also show that our method is robust to certain model misspecification and is practically useful for the integrative analysis of differential expression. BioMed Central 2009-01-30 /pmc/articles/PMC2648727/ /pubmed/19208123 http://dx.doi.org/10.1186/1471-2105-10-S1-S23 Text en Copyright © 2009 Lai et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Lai, Yinglei
Eckenrode, Sarah E
She, Jin-Xiong
A statistical framework for integrating two microarray data sets in differential expression analysis
title A statistical framework for integrating two microarray data sets in differential expression analysis
title_full A statistical framework for integrating two microarray data sets in differential expression analysis
title_fullStr A statistical framework for integrating two microarray data sets in differential expression analysis
title_full_unstemmed A statistical framework for integrating two microarray data sets in differential expression analysis
title_short A statistical framework for integrating two microarray data sets in differential expression analysis
title_sort statistical framework for integrating two microarray data sets in differential expression analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648727/
https://www.ncbi.nlm.nih.gov/pubmed/19208123
http://dx.doi.org/10.1186/1471-2105-10-S1-S23
work_keys_str_mv AT laiyinglei astatisticalframeworkforintegratingtwomicroarraydatasetsindifferentialexpressionanalysis
AT eckenrodesarahe astatisticalframeworkforintegratingtwomicroarraydatasetsindifferentialexpressionanalysis
AT shejinxiong astatisticalframeworkforintegratingtwomicroarraydatasetsindifferentialexpressionanalysis
AT laiyinglei statisticalframeworkforintegratingtwomicroarraydatasetsindifferentialexpressionanalysis
AT eckenrodesarahe statisticalframeworkforintegratingtwomicroarraydatasetsindifferentialexpressionanalysis
AT shejinxiong statisticalframeworkforintegratingtwomicroarraydatasetsindifferentialexpressionanalysis