Cargando…

Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines

Making binary decisions is a common data analytical task in scientific research and industrial applications. In data sciences, there are two related but distinct strategies: hypothesis testing and binary classification. In practice, how to choose between these two strategies can be unclear and rathe...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jingyi Jessica, Tong, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7546185/
https://www.ncbi.nlm.nih.gov/pubmed/33073257
http://dx.doi.org/10.1016/j.patter.2020.100115
_version_ 1783592178527764480
author Li, Jingyi Jessica
Tong, Xin
author_facet Li, Jingyi Jessica
Tong, Xin
author_sort Li, Jingyi Jessica
collection PubMed
description Making binary decisions is a common data analytical task in scientific research and industrial applications. In data sciences, there are two related but distinct strategies: hypothesis testing and binary classification. In practice, how to choose between these two strategies can be unclear and rather confusing. Here, we summarize key distinctions between these two strategies in three aspects and list five practical guidelines for data analysts to choose the appropriate strategy for specific analysis needs. We demonstrate the use of those guidelines in a cancer driver gene prediction example.
format Online
Article
Text
id pubmed-7546185
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-75461852020-10-13 Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines Li, Jingyi Jessica Tong, Xin Patterns (N Y) Perspective Making binary decisions is a common data analytical task in scientific research and industrial applications. In data sciences, there are two related but distinct strategies: hypothesis testing and binary classification. In practice, how to choose between these two strategies can be unclear and rather confusing. Here, we summarize key distinctions between these two strategies in three aspects and list five practical guidelines for data analysts to choose the appropriate strategy for specific analysis needs. We demonstrate the use of those guidelines in a cancer driver gene prediction example. Elsevier 2020-10-09 /pmc/articles/PMC7546185/ /pubmed/33073257 http://dx.doi.org/10.1016/j.patter.2020.100115 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Perspective
Li, Jingyi Jessica
Tong, Xin
Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines
title Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines
title_full Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines
title_fullStr Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines
title_full_unstemmed Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines
title_short Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines
title_sort statistical hypothesis testing versus machine learning binary classification: distinctions and guidelines
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7546185/
https://www.ncbi.nlm.nih.gov/pubmed/33073257
http://dx.doi.org/10.1016/j.patter.2020.100115
work_keys_str_mv AT lijingyijessica statisticalhypothesistestingversusmachinelearningbinaryclassificationdistinctionsandguidelines
AT tongxin statisticalhypothesistestingversusmachinelearningbinaryclassificationdistinctionsandguidelines