Cargando…

Clipper: p-value-free FDR control on high-throughput data from two conditions

High-throughput biological data analysis commonly involves identifying features such as genes, genomic regions, and proteins, whose values differ between two conditions, from numerous features measured simultaneously. The most widely used criterion to ensure the analysis reliability is the false dis...

Descripción completa

Detalles Bibliográficos
Autores principales: Ge, Xinzhou, Chen, Yiling Elaine, Song, Dongyuan, McDermott, MeiLu, Woyshner, Kyla, Manousopoulou, Antigoni, Wang, Ning, Li, Wei, Wang, Leo D., Li, Jingyi Jessica
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8504070/
https://www.ncbi.nlm.nih.gov/pubmed/34635147
http://dx.doi.org/10.1186/s13059-021-02506-9
Descripción
Sumario:High-throughput biological data analysis commonly involves identifying features such as genes, genomic regions, and proteins, whose values differ between two conditions, from numerous features measured simultaneously. The most widely used criterion to ensure the analysis reliability is the false discovery rate (FDR), which is primarily controlled based on p-values. However, obtaining valid p-values relies on either reasonable assumptions of data distribution or large numbers of replicates under both conditions. Clipper is a general statistical framework for FDR control without relying on p-values or specific data distributions. Clipper outperforms existing methods for a broad range of applications in high-throughput data analysis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s13059-021-02506-9).