Cargando…

Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation

BACKGROUND: The use of current high-throughput genetic, genomic and post-genomic data leads to the simultaneous evaluation of a large number of statistical hypothesis and, at the same time, to the multiple-testing problem. As an alternative to the too conservative Family-Wise Error-Rate (FWER), the...

Descripción completa

Detalles Bibliográficos
Autores principales: Guedj, Mickael, Robin, Stephane, Celisse, Alain, Nuel, Gregory
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2679733/
https://www.ncbi.nlm.nih.gov/pubmed/19291295
http://dx.doi.org/10.1186/1471-2105-10-84
_version_ 1782166916453892096
author Guedj, Mickael
Robin, Stephane
Celisse, Alain
Nuel, Gregory
author_facet Guedj, Mickael
Robin, Stephane
Celisse, Alain
Nuel, Gregory
author_sort Guedj, Mickael
collection PubMed
description BACKGROUND: The use of current high-throughput genetic, genomic and post-genomic data leads to the simultaneous evaluation of a large number of statistical hypothesis and, at the same time, to the multiple-testing problem. As an alternative to the too conservative Family-Wise Error-Rate (FWER), the False Discovery Rate (FDR) has appeared for the last ten years as more appropriate to handle this problem. However one drawback of FDR is related to a given rejection region for the considered statistics, attributing the same value to those that are close to the boundary and those that are not. As a result, the local FDR has been recently proposed to quantify the specific probability for a given null hypothesis to be true. RESULTS: In this context we present a semi-parametric approach based on kernel estimators which is applied to different high-throughput biological data such as patterns in DNA sequences, genes expression and genome-wide association studies. CONCLUSION: The proposed method has the practical advantages, over existing approaches, to consider complex heterogeneities in the alternative hypothesis, to take into account prior information (from an expert judgment or previous studies) by allowing a semi-supervised mode, and to deal with truncated distributions such as those obtained in Monte-Carlo simulations. This method has been implemented and is available through the R package kerfdr via the CRAN or at .
format Text
id pubmed-2679733
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26797332009-05-11 Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation Guedj, Mickael Robin, Stephane Celisse, Alain Nuel, Gregory BMC Bioinformatics Methodology Article BACKGROUND: The use of current high-throughput genetic, genomic and post-genomic data leads to the simultaneous evaluation of a large number of statistical hypothesis and, at the same time, to the multiple-testing problem. As an alternative to the too conservative Family-Wise Error-Rate (FWER), the False Discovery Rate (FDR) has appeared for the last ten years as more appropriate to handle this problem. However one drawback of FDR is related to a given rejection region for the considered statistics, attributing the same value to those that are close to the boundary and those that are not. As a result, the local FDR has been recently proposed to quantify the specific probability for a given null hypothesis to be true. RESULTS: In this context we present a semi-parametric approach based on kernel estimators which is applied to different high-throughput biological data such as patterns in DNA sequences, genes expression and genome-wide association studies. CONCLUSION: The proposed method has the practical advantages, over existing approaches, to consider complex heterogeneities in the alternative hypothesis, to take into account prior information (from an expert judgment or previous studies) by allowing a semi-supervised mode, and to deal with truncated distributions such as those obtained in Monte-Carlo simulations. This method has been implemented and is available through the R package kerfdr via the CRAN or at . BioMed Central 2009-03-16 /pmc/articles/PMC2679733/ /pubmed/19291295 http://dx.doi.org/10.1186/1471-2105-10-84 Text en Copyright © 2009 Guedj et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Guedj, Mickael
Robin, Stephane
Celisse, Alain
Nuel, Gregory
Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation
title Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation
title_full Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation
title_fullStr Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation
title_full_unstemmed Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation
title_short Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation
title_sort kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2679733/
https://www.ncbi.nlm.nih.gov/pubmed/19291295
http://dx.doi.org/10.1186/1471-2105-10-84
work_keys_str_mv AT guedjmickael kerfdrasemiparametrickernelbasedapproachtolocalfalsediscoveryrateestimation
AT robinstephane kerfdrasemiparametrickernelbasedapproachtolocalfalsediscoveryrateestimation
AT celissealain kerfdrasemiparametrickernelbasedapproachtolocalfalsediscoveryrateestimation
AT nuelgregory kerfdrasemiparametrickernelbasedapproachtolocalfalsediscoveryrateestimation