Cargando…

Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach

BACKGROUND: In testing for differential gene expression involving multiple serial analysis of gene expression (SAGE) libraries, it is critical to account for both between and within library variation. Several methods have been proposed, including the t test, t(w )test, and an overdispersed logistic...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Jun, Tomfohr, John K, Kepler, Thomas B
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1189357/
https://www.ncbi.nlm.nih.gov/pubmed/15987513
http://dx.doi.org/10.1186/1471-2105-6-165
_version_ 1782124791049748480
author Lu, Jun
Tomfohr, John K
Kepler, Thomas B
author_facet Lu, Jun
Tomfohr, John K
Kepler, Thomas B
author_sort Lu, Jun
collection PubMed
description BACKGROUND: In testing for differential gene expression involving multiple serial analysis of gene expression (SAGE) libraries, it is critical to account for both between and within library variation. Several methods have been proposed, including the t test, t(w )test, and an overdispersed logistic regression approach. The merits of these tests, however, have not been fully evaluated. Questions still remain on whether further improvements can be made. RESULTS: In this article, we introduce an overdispersed log-linear model approach to analyzing SAGE; we evaluate and compare its performance with three other tests: the two-sample t test, t(w )test and another based on overdispersed logistic linear regression. Analysis of simulated and real datasets show that both the log-linear and logistic overdispersion methods generally perform better than the t and t(w )tests; the log-linear method is further found to have better performance than the logistic method, showing equal or higher statistical power over a range of parameter values and with different data distributions. CONCLUSION: Overdispersed log-linear models provide an attractive and reliable framework for analyzing SAGE experiments involving multiple libraries. For convenience, the implementation of this method is available through a user-friendly web-interface available at .
format Text
id pubmed-1189357
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11893572005-08-25 Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach Lu, Jun Tomfohr, John K Kepler, Thomas B BMC Bioinformatics Methodology Article BACKGROUND: In testing for differential gene expression involving multiple serial analysis of gene expression (SAGE) libraries, it is critical to account for both between and within library variation. Several methods have been proposed, including the t test, t(w )test, and an overdispersed logistic regression approach. The merits of these tests, however, have not been fully evaluated. Questions still remain on whether further improvements can be made. RESULTS: In this article, we introduce an overdispersed log-linear model approach to analyzing SAGE; we evaluate and compare its performance with three other tests: the two-sample t test, t(w )test and another based on overdispersed logistic linear regression. Analysis of simulated and real datasets show that both the log-linear and logistic overdispersion methods generally perform better than the t and t(w )tests; the log-linear method is further found to have better performance than the logistic method, showing equal or higher statistical power over a range of parameter values and with different data distributions. CONCLUSION: Overdispersed log-linear models provide an attractive and reliable framework for analyzing SAGE experiments involving multiple libraries. For convenience, the implementation of this method is available through a user-friendly web-interface available at . BioMed Central 2005-06-29 /pmc/articles/PMC1189357/ /pubmed/15987513 http://dx.doi.org/10.1186/1471-2105-6-165 Text en Copyright © 2005 Lu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Lu, Jun
Tomfohr, John K
Kepler, Thomas B
Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach
title Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach
title_full Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach
title_fullStr Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach
title_full_unstemmed Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach
title_short Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach
title_sort identifying differential expression in multiple sage libraries: an overdispersed log-linear model approach
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1189357/
https://www.ncbi.nlm.nih.gov/pubmed/15987513
http://dx.doi.org/10.1186/1471-2105-6-165
work_keys_str_mv AT lujun identifyingdifferentialexpressioninmultiplesagelibrariesanoverdispersedloglinearmodelapproach
AT tomfohrjohnk identifyingdifferentialexpressioninmultiplesagelibrariesanoverdispersedloglinearmodelapproach
AT keplerthomasb identifyingdifferentialexpressioninmultiplesagelibrariesanoverdispersedloglinearmodelapproach