Cargando…

Basic properties and information theory of Audic-Claverie statistic for analyzing cDNA arrays

BACKGROUND: The Audic-Claverie method [1] has been and still continues to be a popular approach for detection of differentially expressed genes in the SAGE framework. The method is based on the assumption that under the null hypothesis tag counts of the same gene in two libraries come from the same...

Descripción completa

Detalles Bibliográficos
Autor principal: Tiňo, Peter
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2761412/
https://www.ncbi.nlm.nih.gov/pubmed/19775462
http://dx.doi.org/10.1186/1471-2105-10-310
_version_ 1782172834950283264
author Tiňo, Peter
author_facet Tiňo, Peter
author_sort Tiňo, Peter
collection PubMed
description BACKGROUND: The Audic-Claverie method [1] has been and still continues to be a popular approach for detection of differentially expressed genes in the SAGE framework. The method is based on the assumption that under the null hypothesis tag counts of the same gene in two libraries come from the same but unknown Poisson distribution. The problem is that each SAGE library represents only a single measurement. We ask: Given that the tag count samples from SAGE libraries are extremely limited, how useful actually is the Audic-Claverie methodology? We rigorously analyze the A-C statistic that forms a backbone of the methodology and represents our knowledge of the underlying tag generating process based on one observation. RESULTS: We show that the A-C statistic and the underlying Poisson distribution of the tag counts share the same mode structure. Moreover, the K-L divergence from the true unknown Poisson distribution to the A-C statistic is minimized when the A-C statistic is conditioned on the mode of the Poisson distribution. Most importantly, the expectation of this K-L divergence never exceeds 1/2 bit. CONCLUSION: A rigorous underpinning of the Audic-Claverie methodology has been missing. Our results constitute a rigorous argument supporting the use of Audic-Claverie method even though the SAGE libraries represent very sparse samples.
format Text
id pubmed-2761412
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27614122009-10-14 Basic properties and information theory of Audic-Claverie statistic for analyzing cDNA arrays Tiňo, Peter BMC Bioinformatics Research Article BACKGROUND: The Audic-Claverie method [1] has been and still continues to be a popular approach for detection of differentially expressed genes in the SAGE framework. The method is based on the assumption that under the null hypothesis tag counts of the same gene in two libraries come from the same but unknown Poisson distribution. The problem is that each SAGE library represents only a single measurement. We ask: Given that the tag count samples from SAGE libraries are extremely limited, how useful actually is the Audic-Claverie methodology? We rigorously analyze the A-C statistic that forms a backbone of the methodology and represents our knowledge of the underlying tag generating process based on one observation. RESULTS: We show that the A-C statistic and the underlying Poisson distribution of the tag counts share the same mode structure. Moreover, the K-L divergence from the true unknown Poisson distribution to the A-C statistic is minimized when the A-C statistic is conditioned on the mode of the Poisson distribution. Most importantly, the expectation of this K-L divergence never exceeds 1/2 bit. CONCLUSION: A rigorous underpinning of the Audic-Claverie methodology has been missing. Our results constitute a rigorous argument supporting the use of Audic-Claverie method even though the SAGE libraries represent very sparse samples. BioMed Central 2009-09-23 /pmc/articles/PMC2761412/ /pubmed/19775462 http://dx.doi.org/10.1186/1471-2105-10-310 Text en Copyright © 2009 Tiňo; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Tiňo, Peter
Basic properties and information theory of Audic-Claverie statistic for analyzing cDNA arrays
title Basic properties and information theory of Audic-Claverie statistic for analyzing cDNA arrays
title_full Basic properties and information theory of Audic-Claverie statistic for analyzing cDNA arrays
title_fullStr Basic properties and information theory of Audic-Claverie statistic for analyzing cDNA arrays
title_full_unstemmed Basic properties and information theory of Audic-Claverie statistic for analyzing cDNA arrays
title_short Basic properties and information theory of Audic-Claverie statistic for analyzing cDNA arrays
title_sort basic properties and information theory of audic-claverie statistic for analyzing cdna arrays
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2761412/
https://www.ncbi.nlm.nih.gov/pubmed/19775462
http://dx.doi.org/10.1186/1471-2105-10-310
work_keys_str_mv AT tinopeter basicpropertiesandinformationtheoryofaudicclaveriestatisticforanalyzingcdnaarrays