Cargando…

A weighted average difference method for detecting differentially expressed genes from microarray data

BACKGROUND: Identification of differentially expressed genes (DEGs) under different experimental conditions is an important task in many microarray studies. However, choosing which method to use for a particular application is problematic because its performance depends on the evaluation metric, the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kadota, Koji, Nakai, Yuji, Shimizu, Kentaro
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2008
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2464587/ https://www.ncbi.nlm.nih.gov/pubmed/18578891 http://dx.doi.org/10.1186/1748-7188-3-8

_version_	1782157424934780928
author	Kadota, Koji Nakai, Yuji Shimizu, Kentaro
author_facet	Kadota, Koji Nakai, Yuji Shimizu, Kentaro
author_sort	Kadota, Koji
collection	PubMed
description	BACKGROUND: Identification of differentially expressed genes (DEGs) under different experimental conditions is an important task in many microarray studies. However, choosing which method to use for a particular application is problematic because its performance depends on the evaluation metric, the dataset, and so on. In addition, when using the Affymetrix GeneChip(® )system, researchers must select a preprocessing algorithm from a number of competing algorithms such as MAS, RMA, and DFW, for obtaining expression-level measurements. To achieve optimal performance for detecting DEGs, a suitable combination of gene selection method and preprocessing algorithm needs to be selected for a given probe-level dataset. RESULTS: We introduce a new fold-change (FC)-based method, the weighted average difference method (WAD), for ranking DEGs. It uses the average difference and relative average signal intensity so that highly expressed genes are highly ranked on the average for the different conditions. The idea is based on our observation that known or potential marker genes (or proteins) tend to have high expression levels. We compared WAD with seven other methods; average difference (AD), FC, rank products (RP), moderated t statistic (modT), significance analysis of microarrays (samT), shrinkage t statistic (shrinkT), and intensity-based moderated t statistic (ibmT). The evaluation was performed using a total of 38 different binary (two-class) probe-level datasets: two artificial "spike-in" datasets and 36 real experimental datasets. The results indicate that WAD outperforms the other methods when sensitivity and specificity are considered simultaneously: the area under the receiver operating characteristic curve for WAD was the highest on average for the 38 datasets. The gene ranking for WAD was also the most consistent when subsets of top-ranked genes produced from three different preprocessed data (MAS, RMA, and DFW) were compared. Overall, WAD performed the best for MAS-preprocessed data and the FC-based methods (AD, WAD, FC, or RP) performed well for RMA and DFW-preprocessed data. CONCLUSION: WAD is a promising alternative to existing methods for ranking DEGs with two classes. Its high performance should increase researchers' confidence in microarray analyses.
format	Text
id	pubmed-2464587
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-24645872008-07-15 A weighted average difference method for detecting differentially expressed genes from microarray data Kadota, Koji Nakai, Yuji Shimizu, Kentaro Algorithms Mol Biol Research BACKGROUND: Identification of differentially expressed genes (DEGs) under different experimental conditions is an important task in many microarray studies. However, choosing which method to use for a particular application is problematic because its performance depends on the evaluation metric, the dataset, and so on. In addition, when using the Affymetrix GeneChip(® )system, researchers must select a preprocessing algorithm from a number of competing algorithms such as MAS, RMA, and DFW, for obtaining expression-level measurements. To achieve optimal performance for detecting DEGs, a suitable combination of gene selection method and preprocessing algorithm needs to be selected for a given probe-level dataset. RESULTS: We introduce a new fold-change (FC)-based method, the weighted average difference method (WAD), for ranking DEGs. It uses the average difference and relative average signal intensity so that highly expressed genes are highly ranked on the average for the different conditions. The idea is based on our observation that known or potential marker genes (or proteins) tend to have high expression levels. We compared WAD with seven other methods; average difference (AD), FC, rank products (RP), moderated t statistic (modT), significance analysis of microarrays (samT), shrinkage t statistic (shrinkT), and intensity-based moderated t statistic (ibmT). The evaluation was performed using a total of 38 different binary (two-class) probe-level datasets: two artificial "spike-in" datasets and 36 real experimental datasets. The results indicate that WAD outperforms the other methods when sensitivity and specificity are considered simultaneously: the area under the receiver operating characteristic curve for WAD was the highest on average for the 38 datasets. The gene ranking for WAD was also the most consistent when subsets of top-ranked genes produced from three different preprocessed data (MAS, RMA, and DFW) were compared. Overall, WAD performed the best for MAS-preprocessed data and the FC-based methods (AD, WAD, FC, or RP) performed well for RMA and DFW-preprocessed data. CONCLUSION: WAD is a promising alternative to existing methods for ranking DEGs with two classes. Its high performance should increase researchers' confidence in microarray analyses. BioMed Central 2008-06-26 /pmc/articles/PMC2464587/ /pubmed/18578891 http://dx.doi.org/10.1186/1748-7188-3-8 Text en Copyright © 2008 Kadota et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Kadota, Koji Nakai, Yuji Shimizu, Kentaro A weighted average difference method for detecting differentially expressed genes from microarray data
title	A weighted average difference method for detecting differentially expressed genes from microarray data
title_full	A weighted average difference method for detecting differentially expressed genes from microarray data
title_fullStr	A weighted average difference method for detecting differentially expressed genes from microarray data
title_full_unstemmed	A weighted average difference method for detecting differentially expressed genes from microarray data
title_short	A weighted average difference method for detecting differentially expressed genes from microarray data
title_sort	weighted average difference method for detecting differentially expressed genes from microarray data
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2464587/ https://www.ncbi.nlm.nih.gov/pubmed/18578891 http://dx.doi.org/10.1186/1748-7188-3-8
work_keys_str_mv	AT kadotakoji aweightedaveragedifferencemethodfordetectingdifferentiallyexpressedgenesfrommicroarraydata AT nakaiyuji aweightedaveragedifferencemethodfordetectingdifferentiallyexpressedgenesfrommicroarraydata AT shimizukentaro aweightedaveragedifferencemethodfordetectingdifferentiallyexpressedgenesfrommicroarraydata AT kadotakoji weightedaveragedifferencemethodfordetectingdifferentiallyexpressedgenesfrommicroarraydata AT nakaiyuji weightedaveragedifferencemethodfordetectingdifferentiallyexpressedgenesfrommicroarraydata AT shimizukentaro weightedaveragedifferencemethodfordetectingdifferentiallyexpressedgenesfrommicroarraydata

A weighted average difference method for detecting differentially expressed genes from microarray data

Ejemplares similares