Cargando…

Bootstrapping of gene-expression data improves and controls the false discovery rate of differentially expressed genes

The ordinary-, penalized-, and bootstrap t-test, least squares and best linear unbiased prediction were compared for their false discovery rates (FDR), i.e. the fraction of falsely discovered genes, which was empirically estimated in a duplicate of the data set. The bootstrap-t-test yielded up to 80...

Descripción completa

Detalles Bibliográficos
Autores principales: Meuwissen, Theo HE, Goddard, Mike E
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2697185/
https://www.ncbi.nlm.nih.gov/pubmed/15040898
http://dx.doi.org/10.1186/1297-9686-36-2-191
_version_ 1782168300132761600
author Meuwissen, Theo HE
Goddard, Mike E
author_facet Meuwissen, Theo HE
Goddard, Mike E
author_sort Meuwissen, Theo HE
collection PubMed
description The ordinary-, penalized-, and bootstrap t-test, least squares and best linear unbiased prediction were compared for their false discovery rates (FDR), i.e. the fraction of falsely discovered genes, which was empirically estimated in a duplicate of the data set. The bootstrap-t-test yielded up to 80% lower FDRs than the alternative statistics, and its FDR was always as good as or better than any of the alternatives. Generally, the predicted FDR from the bootstrapped P-values agreed well with their empirical estimates, except when the number of mRNA samples is smaller than 16. In a cancer data set, the bootstrap-t-test discovered 200 differentially regulated genes at a FDR of 2.6%, and in a knock-out gene expression experiment 10 genes were discovered at a FDR of 3.2%. It is argued that, in the case of microarray data, control of the FDR takes sufficient account of the multiple testing, whilst being less stringent than Bonferoni-type multiple testing corrections. Extensions of the bootstrap simulations to more complicated test-statistics are discussed.
format Text
id pubmed-2697185
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26971852009-06-16 Bootstrapping of gene-expression data improves and controls the false discovery rate of differentially expressed genes Meuwissen, Theo HE Goddard, Mike E Genet Sel Evol Research The ordinary-, penalized-, and bootstrap t-test, least squares and best linear unbiased prediction were compared for their false discovery rates (FDR), i.e. the fraction of falsely discovered genes, which was empirically estimated in a duplicate of the data set. The bootstrap-t-test yielded up to 80% lower FDRs than the alternative statistics, and its FDR was always as good as or better than any of the alternatives. Generally, the predicted FDR from the bootstrapped P-values agreed well with their empirical estimates, except when the number of mRNA samples is smaller than 16. In a cancer data set, the bootstrap-t-test discovered 200 differentially regulated genes at a FDR of 2.6%, and in a knock-out gene expression experiment 10 genes were discovered at a FDR of 3.2%. It is argued that, in the case of microarray data, control of the FDR takes sufficient account of the multiple testing, whilst being less stringent than Bonferoni-type multiple testing corrections. Extensions of the bootstrap simulations to more complicated test-statistics are discussed. BioMed Central 2004-03-15 /pmc/articles/PMC2697185/ /pubmed/15040898 http://dx.doi.org/10.1186/1297-9686-36-2-191 Text en Copyright © 2004 INRA, EDP Sciences
spellingShingle Research
Meuwissen, Theo HE
Goddard, Mike E
Bootstrapping of gene-expression data improves and controls the false discovery rate of differentially expressed genes
title Bootstrapping of gene-expression data improves and controls the false discovery rate of differentially expressed genes
title_full Bootstrapping of gene-expression data improves and controls the false discovery rate of differentially expressed genes
title_fullStr Bootstrapping of gene-expression data improves and controls the false discovery rate of differentially expressed genes
title_full_unstemmed Bootstrapping of gene-expression data improves and controls the false discovery rate of differentially expressed genes
title_short Bootstrapping of gene-expression data improves and controls the false discovery rate of differentially expressed genes
title_sort bootstrapping of gene-expression data improves and controls the false discovery rate of differentially expressed genes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2697185/
https://www.ncbi.nlm.nih.gov/pubmed/15040898
http://dx.doi.org/10.1186/1297-9686-36-2-191
work_keys_str_mv AT meuwissentheohe bootstrappingofgeneexpressiondataimprovesandcontrolsthefalsediscoveryrateofdifferentiallyexpressedgenes
AT goddardmikee bootstrappingofgeneexpressiondataimprovesandcontrolsthefalsediscoveryrateofdifferentiallyexpressedgenes