Cargando…

A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football?

Data-related analysis in football increasingly benefits from Big Data approaches and machine learning methods. One relevant application of data analysis in football is forecasting, which relies on understanding and accurately modelling the process of a match. The present paper tackles two neglected...

Descripción completa

Detalles Bibliográficos
Autores principales: Wunderlich, Fabian, Memmert, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Vienna 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8714875/
https://www.ncbi.nlm.nih.gov/pubmed/34976228
http://dx.doi.org/10.1007/s13278-021-00842-z
_version_ 1784624022036676608
author Wunderlich, Fabian
Memmert, Daniel
author_facet Wunderlich, Fabian
Memmert, Daniel
author_sort Wunderlich, Fabian
collection PubMed
description Data-related analysis in football increasingly benefits from Big Data approaches and machine learning methods. One relevant application of data analysis in football is forecasting, which relies on understanding and accurately modelling the process of a match. The present paper tackles two neglected facets of forecasting in football: Forecasts on the total number of goals and in-play forecasting (forecasts based on within-match information). Sentiment analysis techniques were used to extract the information reflected in almost two million tweets from more than 400 Premier League matches. By means of wordclouds and timely analysis of several tweet-based features, the Twitter communication over the full course of matches and shortly before and after goals was visualized and systematically analysed. Moreover, several forecasting models including a random forest model have been used to obtain in-play forecasts. Results suggest that in-play forecasting of goals is highly challenging, and in-play information does not improve forecasting accuracy. An additional analysis of goals from more than 30,000 matches from the main European football leagues supports the notion that the predictive value of in-play information is highly limited compared to pre-game information. This is a relevant result for coaches, match analysts and broadcasters who should not overestimate the value of in-play information. The present study also sheds light on how the perception and behaviour of Twitter users change over the course of a football match. A main result is that the sentiment of Twitter users decreases when the match progresses, which might be caused by an unjustified high expectation of football fans before the match.
format Online
Article
Text
id pubmed-8714875
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Vienna
record_format MEDLINE/PubMed
spelling pubmed-87148752021-12-29 A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football? Wunderlich, Fabian Memmert, Daniel Soc Netw Anal Min Original Article Data-related analysis in football increasingly benefits from Big Data approaches and machine learning methods. One relevant application of data analysis in football is forecasting, which relies on understanding and accurately modelling the process of a match. The present paper tackles two neglected facets of forecasting in football: Forecasts on the total number of goals and in-play forecasting (forecasts based on within-match information). Sentiment analysis techniques were used to extract the information reflected in almost two million tweets from more than 400 Premier League matches. By means of wordclouds and timely analysis of several tweet-based features, the Twitter communication over the full course of matches and shortly before and after goals was visualized and systematically analysed. Moreover, several forecasting models including a random forest model have been used to obtain in-play forecasts. Results suggest that in-play forecasting of goals is highly challenging, and in-play information does not improve forecasting accuracy. An additional analysis of goals from more than 30,000 matches from the main European football leagues supports the notion that the predictive value of in-play information is highly limited compared to pre-game information. This is a relevant result for coaches, match analysts and broadcasters who should not overestimate the value of in-play information. The present study also sheds light on how the perception and behaviour of Twitter users change over the course of a football match. A main result is that the sentiment of Twitter users decreases when the match progresses, which might be caused by an unjustified high expectation of football fans before the match. Springer Vienna 2021-12-29 2022 /pmc/articles/PMC8714875/ /pubmed/34976228 http://dx.doi.org/10.1007/s13278-021-00842-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Article
Wunderlich, Fabian
Memmert, Daniel
A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football?
title A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football?
title_full A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football?
title_fullStr A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football?
title_full_unstemmed A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football?
title_short A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football?
title_sort big data analysis of twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football?
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8714875/
https://www.ncbi.nlm.nih.gov/pubmed/34976228
http://dx.doi.org/10.1007/s13278-021-00842-z
work_keys_str_mv AT wunderlichfabian abigdataanalysisoftwitterdataduringpremierleaguematchesdotweetscontaininformationvaluableforinplayforecastingofgoalsinfootball
AT memmertdaniel abigdataanalysisoftwitterdataduringpremierleaguematchesdotweetscontaininformationvaluableforinplayforecastingofgoalsinfootball
AT wunderlichfabian bigdataanalysisoftwitterdataduringpremierleaguematchesdotweetscontaininformationvaluableforinplayforecastingofgoalsinfootball
AT memmertdaniel bigdataanalysisoftwitterdataduringpremierleaguematchesdotweetscontaininformationvaluableforinplayforecastingofgoalsinfootball