Predictive models of the performance of professional football players
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.14/37917 |
Resumo: | The aim of this present study was to predict professional player performance, based on a set of features, including team-related ones and its effect on performance. The predictions were made for two player roles: attackers and midfielders and two distinct independent variables were used: goals total and goals assists. The dataset used corresponded to 4523 players from season 2018-2019, from ten different top European leagues. Some individual performance features were used, like passes accuracy, shots on, duels won were used, as well as some player features like age, height and weight, some club performance features like club market value and club goals total and even popularity features like google search and twitter average likes. The team-related features were calculated by taking the average of a variable for the whole team, excluding the player itself. The results showed that goals_assists_team, goals_total_midfielder_team and market_value_opponents were found to be the most important variables and statistically significant (p-value < 0.05) when predicting goals total. At the same time, goals_assists_team, passes_accuracy_midfielder_team, duels_won_defender_team and market_value_opponents were the most important team-related variables when predicting goals assists and they were all statistically significant (p-value < 0.05). Stochastic Gradient Descent Regressor was the most suitable Machine Learning (ML) model to predict goals total, with RMSE of 1.3543, whereas the Ridge Regression achieved RMSE of 1.054 to predict goals assists. Clubs and players should be aware of these team factors that affect goals and assists, to increase knowledge about the best player-team fit and therefore, improve performance. |
id |
RCAP_4d797be32001ade5d8a1f2db1ba4630d |
---|---|
oai_identifier_str |
oai:repositorio.ucp.pt:10400.14/37917 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Predictive models of the performance of professional football playersFeature importanceFeature selectionFeature significanceHyperparameter tuningPerformance prediction (goals total and goals assists)Supervised machine learning modelsTeam-related variablesDomínio/Área Científica::Ciências Sociais::Economia e GestãoThe aim of this present study was to predict professional player performance, based on a set of features, including team-related ones and its effect on performance. The predictions were made for two player roles: attackers and midfielders and two distinct independent variables were used: goals total and goals assists. The dataset used corresponded to 4523 players from season 2018-2019, from ten different top European leagues. Some individual performance features were used, like passes accuracy, shots on, duels won were used, as well as some player features like age, height and weight, some club performance features like club market value and club goals total and even popularity features like google search and twitter average likes. The team-related features were calculated by taking the average of a variable for the whole team, excluding the player itself. The results showed that goals_assists_team, goals_total_midfielder_team and market_value_opponents were found to be the most important variables and statistically significant (p-value < 0.05) when predicting goals total. At the same time, goals_assists_team, passes_accuracy_midfielder_team, duels_won_defender_team and market_value_opponents were the most important team-related variables when predicting goals assists and they were all statistically significant (p-value < 0.05). Stochastic Gradient Descent Regressor was the most suitable Machine Learning (ML) model to predict goals total, with RMSE of 1.3543, whereas the Ridge Regression achieved RMSE of 1.054 to predict goals assists. Clubs and players should be aware of these team factors that affect goals and assists, to increase knowledge about the best player-team fit and therefore, improve performance.Pekar, ViktorVeritati - Repositório Institucional da Universidade Católica PortuguesaConceição, Mafalda Teixeira Costa da2022-06-21T14:32:56Z2021-10-202021-092021-10-20T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10400.14/37917TID:202962997enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-12T17:43:26Zoai:repositorio.ucp.pt:10400.14/37917Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T18:30:53.852890Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Predictive models of the performance of professional football players |
title |
Predictive models of the performance of professional football players |
spellingShingle |
Predictive models of the performance of professional football players Conceição, Mafalda Teixeira Costa da Feature importance Feature selection Feature significance Hyperparameter tuning Performance prediction (goals total and goals assists) Supervised machine learning models Team-related variables Domínio/Área Científica::Ciências Sociais::Economia e Gestão |
title_short |
Predictive models of the performance of professional football players |
title_full |
Predictive models of the performance of professional football players |
title_fullStr |
Predictive models of the performance of professional football players |
title_full_unstemmed |
Predictive models of the performance of professional football players |
title_sort |
Predictive models of the performance of professional football players |
author |
Conceição, Mafalda Teixeira Costa da |
author_facet |
Conceição, Mafalda Teixeira Costa da |
author_role |
author |
dc.contributor.none.fl_str_mv |
Pekar, Viktor Veritati - Repositório Institucional da Universidade Católica Portuguesa |
dc.contributor.author.fl_str_mv |
Conceição, Mafalda Teixeira Costa da |
dc.subject.por.fl_str_mv |
Feature importance Feature selection Feature significance Hyperparameter tuning Performance prediction (goals total and goals assists) Supervised machine learning models Team-related variables Domínio/Área Científica::Ciências Sociais::Economia e Gestão |
topic |
Feature importance Feature selection Feature significance Hyperparameter tuning Performance prediction (goals total and goals assists) Supervised machine learning models Team-related variables Domínio/Área Científica::Ciências Sociais::Economia e Gestão |
description |
The aim of this present study was to predict professional player performance, based on a set of features, including team-related ones and its effect on performance. The predictions were made for two player roles: attackers and midfielders and two distinct independent variables were used: goals total and goals assists. The dataset used corresponded to 4523 players from season 2018-2019, from ten different top European leagues. Some individual performance features were used, like passes accuracy, shots on, duels won were used, as well as some player features like age, height and weight, some club performance features like club market value and club goals total and even popularity features like google search and twitter average likes. The team-related features were calculated by taking the average of a variable for the whole team, excluding the player itself. The results showed that goals_assists_team, goals_total_midfielder_team and market_value_opponents were found to be the most important variables and statistically significant (p-value < 0.05) when predicting goals total. At the same time, goals_assists_team, passes_accuracy_midfielder_team, duels_won_defender_team and market_value_opponents were the most important team-related variables when predicting goals assists and they were all statistically significant (p-value < 0.05). Stochastic Gradient Descent Regressor was the most suitable Machine Learning (ML) model to predict goals total, with RMSE of 1.3543, whereas the Ridge Regression achieved RMSE of 1.054 to predict goals assists. Clubs and players should be aware of these team factors that affect goals and assists, to increase knowledge about the best player-team fit and therefore, improve performance. |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021-10-20 2021-09 2021-10-20T00:00:00Z 2022-06-21T14:32:56Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.14/37917 TID:202962997 |
url |
http://hdl.handle.net/10400.14/37917 |
identifier_str_mv |
TID:202962997 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799132031296208896 |