Cargando…

A Population Proportion approach for ranking differentially expressed genes

BACKGROUND: DNA microarrays are used to investigate differences in gene expression between two or more classes of samples. Most currently used approaches compare mean expression levels between classes and are not geared to find genes whose expression is significantly different in only a subset of sa...

Descripción completa

Detalles Bibliográficos
Autor principal: Gadgil, Mugdha
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2566584/
https://www.ncbi.nlm.nih.gov/pubmed/18801167
http://dx.doi.org/10.1186/1471-2105-9-380
_version_ 1782159953295835136
author Gadgil, Mugdha
author_facet Gadgil, Mugdha
author_sort Gadgil, Mugdha
collection PubMed
description BACKGROUND: DNA microarrays are used to investigate differences in gene expression between two or more classes of samples. Most currently used approaches compare mean expression levels between classes and are not geared to find genes whose expression is significantly different in only a subset of samples in a class. However, biological variability can lead to situations where key genes are differentially expressed in only a subset of samples. To facilitate the identification of such genes, a new method is reported. METHODS: The key difference between the Population Proportion Ranking Method (PPRM) presented here and almost all other methods currently used is in the quantification of variability. PPRM quantifies variability in terms of inter-sample ratios and can be used to calculate the relative merit of differentially expressed genes with a specified difference in expression level between at least some samples in the two classes, which at the same time have lower than a specified variability within each class. RESULTS: PPRM is tested on simulated data and on three publicly available cancer data sets. It is compared to the t test, PPST, COPA, OS, ORT and MOST using the simulated data. Under the conditions tested, it performs as well or better than the other methods tested under low intra-class variability and better than t test, PPST, COPA and OS when a gene is differentially expressed in only a subset of samples. It performs better than ORT and MOST in recognizing non differentially expressed genes with high variability in expression levels across all samples. For biological data, the success of predictor genes identified in appropriately classifying an independent sample is reported.
format Text
id pubmed-2566584
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25665842008-10-14 A Population Proportion approach for ranking differentially expressed genes Gadgil, Mugdha BMC Bioinformatics Methodology Article BACKGROUND: DNA microarrays are used to investigate differences in gene expression between two or more classes of samples. Most currently used approaches compare mean expression levels between classes and are not geared to find genes whose expression is significantly different in only a subset of samples in a class. However, biological variability can lead to situations where key genes are differentially expressed in only a subset of samples. To facilitate the identification of such genes, a new method is reported. METHODS: The key difference between the Population Proportion Ranking Method (PPRM) presented here and almost all other methods currently used is in the quantification of variability. PPRM quantifies variability in terms of inter-sample ratios and can be used to calculate the relative merit of differentially expressed genes with a specified difference in expression level between at least some samples in the two classes, which at the same time have lower than a specified variability within each class. RESULTS: PPRM is tested on simulated data and on three publicly available cancer data sets. It is compared to the t test, PPST, COPA, OS, ORT and MOST using the simulated data. Under the conditions tested, it performs as well or better than the other methods tested under low intra-class variability and better than t test, PPST, COPA and OS when a gene is differentially expressed in only a subset of samples. It performs better than ORT and MOST in recognizing non differentially expressed genes with high variability in expression levels across all samples. For biological data, the success of predictor genes identified in appropriately classifying an independent sample is reported. BioMed Central 2008-09-18 /pmc/articles/PMC2566584/ /pubmed/18801167 http://dx.doi.org/10.1186/1471-2105-9-380 Text en Copyright © 2008 Gadgil; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Gadgil, Mugdha
A Population Proportion approach for ranking differentially expressed genes
title A Population Proportion approach for ranking differentially expressed genes
title_full A Population Proportion approach for ranking differentially expressed genes
title_fullStr A Population Proportion approach for ranking differentially expressed genes
title_full_unstemmed A Population Proportion approach for ranking differentially expressed genes
title_short A Population Proportion approach for ranking differentially expressed genes
title_sort population proportion approach for ranking differentially expressed genes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2566584/
https://www.ncbi.nlm.nih.gov/pubmed/18801167
http://dx.doi.org/10.1186/1471-2105-9-380
work_keys_str_mv AT gadgilmugdha apopulationproportionapproachforrankingdifferentiallyexpressedgenes
AT gadgilmugdha populationproportionapproachforrankingdifferentiallyexpressedgenes