Cargando…
Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines
Making binary decisions is a common data analytical task in scientific research and industrial applications. In data sciences, there are two related but distinct strategies: hypothesis testing and binary classification. In practice, how to choose between these two strategies can be unclear and rathe...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7546185/ https://www.ncbi.nlm.nih.gov/pubmed/33073257 http://dx.doi.org/10.1016/j.patter.2020.100115 |
_version_ | 1783592178527764480 |
---|---|
author | Li, Jingyi Jessica Tong, Xin |
author_facet | Li, Jingyi Jessica Tong, Xin |
author_sort | Li, Jingyi Jessica |
collection | PubMed |
description | Making binary decisions is a common data analytical task in scientific research and industrial applications. In data sciences, there are two related but distinct strategies: hypothesis testing and binary classification. In practice, how to choose between these two strategies can be unclear and rather confusing. Here, we summarize key distinctions between these two strategies in three aspects and list five practical guidelines for data analysts to choose the appropriate strategy for specific analysis needs. We demonstrate the use of those guidelines in a cancer driver gene prediction example. |
format | Online Article Text |
id | pubmed-7546185 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-75461852020-10-13 Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines Li, Jingyi Jessica Tong, Xin Patterns (N Y) Perspective Making binary decisions is a common data analytical task in scientific research and industrial applications. In data sciences, there are two related but distinct strategies: hypothesis testing and binary classification. In practice, how to choose between these two strategies can be unclear and rather confusing. Here, we summarize key distinctions between these two strategies in three aspects and list five practical guidelines for data analysts to choose the appropriate strategy for specific analysis needs. We demonstrate the use of those guidelines in a cancer driver gene prediction example. Elsevier 2020-10-09 /pmc/articles/PMC7546185/ /pubmed/33073257 http://dx.doi.org/10.1016/j.patter.2020.100115 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Perspective Li, Jingyi Jessica Tong, Xin Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines |
title | Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines |
title_full | Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines |
title_fullStr | Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines |
title_full_unstemmed | Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines |
title_short | Statistical Hypothesis Testing versus Machine Learning Binary Classification: Distinctions and Guidelines |
title_sort | statistical hypothesis testing versus machine learning binary classification: distinctions and guidelines |
topic | Perspective |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7546185/ https://www.ncbi.nlm.nih.gov/pubmed/33073257 http://dx.doi.org/10.1016/j.patter.2020.100115 |
work_keys_str_mv | AT lijingyijessica statisticalhypothesistestingversusmachinelearningbinaryclassificationdistinctionsandguidelines AT tongxin statisticalhypothesistestingversusmachinelearningbinaryclassificationdistinctionsandguidelines |