Cargando…

Hybrid Basketball Game Outcome Prediction Model by Integrating Data Mining Methods for the National Basketball Association

The sports market has grown rapidly over the last several decades. Sports outcomes prediction is an attractive sports analytic challenge as it provides useful information for operations in the sports market. In this study, a hybrid basketball game outcomes prediction scheme is developed for predicti...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Wei-Jen, Jhou, Mao-Jhen, Lee, Tian-Shyug, Lu, Chi-Jie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8073849/
https://www.ncbi.nlm.nih.gov/pubmed/33920720
http://dx.doi.org/10.3390/e23040477
_version_ 1783684224408092672
author Chen, Wei-Jen
Jhou, Mao-Jhen
Lee, Tian-Shyug
Lu, Chi-Jie
author_facet Chen, Wei-Jen
Jhou, Mao-Jhen
Lee, Tian-Shyug
Lu, Chi-Jie
author_sort Chen, Wei-Jen
collection PubMed
description The sports market has grown rapidly over the last several decades. Sports outcomes prediction is an attractive sports analytic challenge as it provides useful information for operations in the sports market. In this study, a hybrid basketball game outcomes prediction scheme is developed for predicting the final score of the National Basketball Association (NBA) games by integrating five data mining techniques, including extreme learning machine, multivariate adaptive regression splines, k-nearest neighbors, eXtreme gradient boosting (XGBoost), and stochastic gradient boosting. Designed features are generated by merging different game-lags information from fundamental basketball statistics and used in the proposed scheme. This study collected data from all the games of the NBA 2018–2019 seasons. There are 30 teams in the NBA and each team play 82 games per season. A total of 2460 NBA game data points were collected. Empirical results illustrated that the proposed hybrid basketball game prediction scheme achieves high prediction performance and identifies suitable game-lag information and relevant game features (statistics). Our findings suggested that a two-stage XGBoost model using four pieces of game-lags information achieves the best prediction performance among all competing models. The six designed features, including averaged defensive rebounds, averaged two-point field goal percentage, averaged free throw percentage, averaged offensive rebounds, averaged assists, and averaged three-point field goal attempts, from four game-lags have a greater effect on the prediction of final scores of NBA games than other game-lags. The findings of this study provide relevant insights and guidance for other team or individual sports outcomes prediction research.
format Online
Article
Text
id pubmed-8073849
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80738492021-04-27 Hybrid Basketball Game Outcome Prediction Model by Integrating Data Mining Methods for the National Basketball Association Chen, Wei-Jen Jhou, Mao-Jhen Lee, Tian-Shyug Lu, Chi-Jie Entropy (Basel) Article The sports market has grown rapidly over the last several decades. Sports outcomes prediction is an attractive sports analytic challenge as it provides useful information for operations in the sports market. In this study, a hybrid basketball game outcomes prediction scheme is developed for predicting the final score of the National Basketball Association (NBA) games by integrating five data mining techniques, including extreme learning machine, multivariate adaptive regression splines, k-nearest neighbors, eXtreme gradient boosting (XGBoost), and stochastic gradient boosting. Designed features are generated by merging different game-lags information from fundamental basketball statistics and used in the proposed scheme. This study collected data from all the games of the NBA 2018–2019 seasons. There are 30 teams in the NBA and each team play 82 games per season. A total of 2460 NBA game data points were collected. Empirical results illustrated that the proposed hybrid basketball game prediction scheme achieves high prediction performance and identifies suitable game-lag information and relevant game features (statistics). Our findings suggested that a two-stage XGBoost model using four pieces of game-lags information achieves the best prediction performance among all competing models. The six designed features, including averaged defensive rebounds, averaged two-point field goal percentage, averaged free throw percentage, averaged offensive rebounds, averaged assists, and averaged three-point field goal attempts, from four game-lags have a greater effect on the prediction of final scores of NBA games than other game-lags. The findings of this study provide relevant insights and guidance for other team or individual sports outcomes prediction research. MDPI 2021-04-17 /pmc/articles/PMC8073849/ /pubmed/33920720 http://dx.doi.org/10.3390/e23040477 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chen, Wei-Jen
Jhou, Mao-Jhen
Lee, Tian-Shyug
Lu, Chi-Jie
Hybrid Basketball Game Outcome Prediction Model by Integrating Data Mining Methods for the National Basketball Association
title Hybrid Basketball Game Outcome Prediction Model by Integrating Data Mining Methods for the National Basketball Association
title_full Hybrid Basketball Game Outcome Prediction Model by Integrating Data Mining Methods for the National Basketball Association
title_fullStr Hybrid Basketball Game Outcome Prediction Model by Integrating Data Mining Methods for the National Basketball Association
title_full_unstemmed Hybrid Basketball Game Outcome Prediction Model by Integrating Data Mining Methods for the National Basketball Association
title_short Hybrid Basketball Game Outcome Prediction Model by Integrating Data Mining Methods for the National Basketball Association
title_sort hybrid basketball game outcome prediction model by integrating data mining methods for the national basketball association
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8073849/
https://www.ncbi.nlm.nih.gov/pubmed/33920720
http://dx.doi.org/10.3390/e23040477
work_keys_str_mv AT chenweijen hybridbasketballgameoutcomepredictionmodelbyintegratingdataminingmethodsforthenationalbasketballassociation
AT jhoumaojhen hybridbasketballgameoutcomepredictionmodelbyintegratingdataminingmethodsforthenationalbasketballassociation
AT leetianshyug hybridbasketballgameoutcomepredictionmodelbyintegratingdataminingmethodsforthenationalbasketballassociation
AT luchijie hybridbasketballgameoutcomepredictionmodelbyintegratingdataminingmethodsforthenationalbasketballassociation