Cargando…

Estimating the False Discovery Rate Using Mixed Normal Distribution for Identifying Differentially Expressed Genes in Microarray Data Analysis

The recent development of DNA microarray technology allows us to measure simultaneously the expression levels of thousands of genes and to identify truly correlated genes with anticancer drug response (differentially expressed genes) from many candidate genes. Significance Analysis of Microarray (SA...

Descripción completa

Detalles Bibliográficos
Autores principales: Hirakawa, Akihiro, Sato, Yasunori, Sozu, Takashi, Hamada, Chikuma, Yoshimura, Isao
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675830/
https://www.ncbi.nlm.nih.gov/pubmed/19455258
_version_ 1782166716538683392
author Hirakawa, Akihiro
Sato, Yasunori
Sozu, Takashi
Hamada, Chikuma
Yoshimura, Isao
author_facet Hirakawa, Akihiro
Sato, Yasunori
Sozu, Takashi
Hamada, Chikuma
Yoshimura, Isao
author_sort Hirakawa, Akihiro
collection PubMed
description The recent development of DNA microarray technology allows us to measure simultaneously the expression levels of thousands of genes and to identify truly correlated genes with anticancer drug response (differentially expressed genes) from many candidate genes. Significance Analysis of Microarray (SAM) is often used to estimate the false discovery rate (FDR), which is an index for optimizing the identifiability of differentially expressed genes, while the accuracy of the estimated FDR by SAM is not necessarily confirmed. We propose a new method for estimating the FDR assuming a mixed normal distribution on the test statistic and examine the performance of the proposed method and SAM using simulated data. The simulation results indicate that the accuracy of the estimated FDR by the proposed method and SAM, varied depending on the experimental conditions. We applied both methods to actual data comprised of expression levels of 12,625 genes of 10 responders and 14 non-responders to docetaxel for breast cancer. The proposed method identified 280 differentially expressed genes correlated with docetaxel response using a cut-off value for achieving FDR <0.01 to prevent false-positive genes, although 92 genes were previously thought to be correlated with docetaxel response ones.
format Text
id pubmed-2675830
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26758302009-05-19 Estimating the False Discovery Rate Using Mixed Normal Distribution for Identifying Differentially Expressed Genes in Microarray Data Analysis Hirakawa, Akihiro Sato, Yasunori Sozu, Takashi Hamada, Chikuma Yoshimura, Isao Cancer Inform Original Research The recent development of DNA microarray technology allows us to measure simultaneously the expression levels of thousands of genes and to identify truly correlated genes with anticancer drug response (differentially expressed genes) from many candidate genes. Significance Analysis of Microarray (SAM) is often used to estimate the false discovery rate (FDR), which is an index for optimizing the identifiability of differentially expressed genes, while the accuracy of the estimated FDR by SAM is not necessarily confirmed. We propose a new method for estimating the FDR assuming a mixed normal distribution on the test statistic and examine the performance of the proposed method and SAM using simulated data. The simulation results indicate that the accuracy of the estimated FDR by the proposed method and SAM, varied depending on the experimental conditions. We applied both methods to actual data comprised of expression levels of 12,625 genes of 10 responders and 14 non-responders to docetaxel for breast cancer. The proposed method identified 280 differentially expressed genes correlated with docetaxel response using a cut-off value for achieving FDR <0.01 to prevent false-positive genes, although 92 genes were previously thought to be correlated with docetaxel response ones. Libertas Academica 2008-01-22 /pmc/articles/PMC2675830/ /pubmed/19455258 Text en © 2007 The authors. http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Original Research
Hirakawa, Akihiro
Sato, Yasunori
Sozu, Takashi
Hamada, Chikuma
Yoshimura, Isao
Estimating the False Discovery Rate Using Mixed Normal Distribution for Identifying Differentially Expressed Genes in Microarray Data Analysis
title Estimating the False Discovery Rate Using Mixed Normal Distribution for Identifying Differentially Expressed Genes in Microarray Data Analysis
title_full Estimating the False Discovery Rate Using Mixed Normal Distribution for Identifying Differentially Expressed Genes in Microarray Data Analysis
title_fullStr Estimating the False Discovery Rate Using Mixed Normal Distribution for Identifying Differentially Expressed Genes in Microarray Data Analysis
title_full_unstemmed Estimating the False Discovery Rate Using Mixed Normal Distribution for Identifying Differentially Expressed Genes in Microarray Data Analysis
title_short Estimating the False Discovery Rate Using Mixed Normal Distribution for Identifying Differentially Expressed Genes in Microarray Data Analysis
title_sort estimating the false discovery rate using mixed normal distribution for identifying differentially expressed genes in microarray data analysis
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675830/
https://www.ncbi.nlm.nih.gov/pubmed/19455258
work_keys_str_mv AT hirakawaakihiro estimatingthefalsediscoveryrateusingmixednormaldistributionforidentifyingdifferentiallyexpressedgenesinmicroarraydataanalysis
AT satoyasunori estimatingthefalsediscoveryrateusingmixednormaldistributionforidentifyingdifferentiallyexpressedgenesinmicroarraydataanalysis
AT sozutakashi estimatingthefalsediscoveryrateusingmixednormaldistributionforidentifyingdifferentiallyexpressedgenesinmicroarraydataanalysis
AT hamadachikuma estimatingthefalsediscoveryrateusingmixednormaldistributionforidentifyingdifferentiallyexpressedgenesinmicroarraydataanalysis
AT yoshimuraisao estimatingthefalsediscoveryrateusingmixednormaldistributionforidentifyingdifferentiallyexpressedgenesinmicroarraydataanalysis