Cargando…

Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines

Making binary decisions is a common data analytical task in scientific research and industrial applications. In data sciences, there are two related but distinct strategies: hypothesis testing and binary classification. In practice, how to choose between these two strategies can be unclear and rathe...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jingyi Jessica, Tong, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7546185/
https://www.ncbi.nlm.nih.gov/pubmed/33073257
http://dx.doi.org/10.1016/j.patter.2020.100115
Descripción
Sumario:Making binary decisions is a common data analytical task in scientific research and industrial applications. In data sciences, there are two related but distinct strategies: hypothesis testing and binary classification. In practice, how to choose between these two strategies can be unclear and rather confusing. Here, we summarize key distinctions between these two strategies in three aspects and list five practical guidelines for data analysts to choose the appropriate strategy for specific analysis needs. We demonstrate the use of those guidelines in a cancer driver gene prediction example.