Cargando…
Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation
BACKGROUND: The use of current high-throughput genetic, genomic and post-genomic data leads to the simultaneous evaluation of a large number of statistical hypothesis and, at the same time, to the multiple-testing problem. As an alternative to the too conservative Family-Wise Error-Rate (FWER), the...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2679733/ https://www.ncbi.nlm.nih.gov/pubmed/19291295 http://dx.doi.org/10.1186/1471-2105-10-84 |
_version_ | 1782166916453892096 |
---|---|
author | Guedj, Mickael Robin, Stephane Celisse, Alain Nuel, Gregory |
author_facet | Guedj, Mickael Robin, Stephane Celisse, Alain Nuel, Gregory |
author_sort | Guedj, Mickael |
collection | PubMed |
description | BACKGROUND: The use of current high-throughput genetic, genomic and post-genomic data leads to the simultaneous evaluation of a large number of statistical hypothesis and, at the same time, to the multiple-testing problem. As an alternative to the too conservative Family-Wise Error-Rate (FWER), the False Discovery Rate (FDR) has appeared for the last ten years as more appropriate to handle this problem. However one drawback of FDR is related to a given rejection region for the considered statistics, attributing the same value to those that are close to the boundary and those that are not. As a result, the local FDR has been recently proposed to quantify the specific probability for a given null hypothesis to be true. RESULTS: In this context we present a semi-parametric approach based on kernel estimators which is applied to different high-throughput biological data such as patterns in DNA sequences, genes expression and genome-wide association studies. CONCLUSION: The proposed method has the practical advantages, over existing approaches, to consider complex heterogeneities in the alternative hypothesis, to take into account prior information (from an expert judgment or previous studies) by allowing a semi-supervised mode, and to deal with truncated distributions such as those obtained in Monte-Carlo simulations. This method has been implemented and is available through the R package kerfdr via the CRAN or at . |
format | Text |
id | pubmed-2679733 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26797332009-05-11 Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation Guedj, Mickael Robin, Stephane Celisse, Alain Nuel, Gregory BMC Bioinformatics Methodology Article BACKGROUND: The use of current high-throughput genetic, genomic and post-genomic data leads to the simultaneous evaluation of a large number of statistical hypothesis and, at the same time, to the multiple-testing problem. As an alternative to the too conservative Family-Wise Error-Rate (FWER), the False Discovery Rate (FDR) has appeared for the last ten years as more appropriate to handle this problem. However one drawback of FDR is related to a given rejection region for the considered statistics, attributing the same value to those that are close to the boundary and those that are not. As a result, the local FDR has been recently proposed to quantify the specific probability for a given null hypothesis to be true. RESULTS: In this context we present a semi-parametric approach based on kernel estimators which is applied to different high-throughput biological data such as patterns in DNA sequences, genes expression and genome-wide association studies. CONCLUSION: The proposed method has the practical advantages, over existing approaches, to consider complex heterogeneities in the alternative hypothesis, to take into account prior information (from an expert judgment or previous studies) by allowing a semi-supervised mode, and to deal with truncated distributions such as those obtained in Monte-Carlo simulations. This method has been implemented and is available through the R package kerfdr via the CRAN or at . BioMed Central 2009-03-16 /pmc/articles/PMC2679733/ /pubmed/19291295 http://dx.doi.org/10.1186/1471-2105-10-84 Text en Copyright © 2009 Guedj et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Guedj, Mickael Robin, Stephane Celisse, Alain Nuel, Gregory Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation |
title | Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation |
title_full | Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation |
title_fullStr | Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation |
title_full_unstemmed | Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation |
title_short | Kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation |
title_sort | kerfdr: a semi-parametric kernel-based approach to local false discovery rate estimation |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2679733/ https://www.ncbi.nlm.nih.gov/pubmed/19291295 http://dx.doi.org/10.1186/1471-2105-10-84 |
work_keys_str_mv | AT guedjmickael kerfdrasemiparametrickernelbasedapproachtolocalfalsediscoveryrateestimation AT robinstephane kerfdrasemiparametrickernelbasedapproachtolocalfalsediscoveryrateestimation AT celissealain kerfdrasemiparametrickernelbasedapproachtolocalfalsediscoveryrateestimation AT nuelgregory kerfdrasemiparametrickernelbasedapproachtolocalfalsediscoveryrateestimation |