Robotic soccer: behaviors for simulated humanoid robots
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10773/30084 |
Resumo: | This dissertation was focused on the improvement of the goalkeeper of the FC Portugal 3D team, participant in the RoboCup 3D Soccer Simulation league. All the experiments during this work were undertaken within the league environment. This competition consists on a virtual soccer match between simulated humanoid robots. A soccer goalkeeper has as its primary mission keeping his goal clear. To accomplish that goal, he needs to save every shot targeted to its goal and that can be a very difficult challenge. The challenge of saving a shot can be divided into two different tasks: predicting the ball movement and intersecting the ball trajectory. During this work, the primary objective is the application of machine learning and reinforcement learning techniques to solve these problems. Predicting the ball movement is a matter of replicating the environment physics. So, a ball position estimator was developed following a supervised machine learning approach. Although this estimator was initially developed thinking mainly on the goalkeeper, it can also be very useful for the field players. A precise ball estimator opens the door for behaviors capable of interacting with the ball while it is moving, which is rare at the current state of the league. Intersecting the ball trajectory and saving the shots was accomplished by a generic behavior developed with a Deep Reinforcement Learning approach. This generic behavior is capable of saving different types of shots: shots coming from different positions of the field and different 3D initial velocities. The new behavior was tested and evaluated in comparison with the previous version of the goalkeeper and achieved indisputable better results. This was a huge achievement because the previous version of the goalkeeper was composed by thirteen different behaviors and a selection algorithm and it was outperformed and replaced by only one behavior. |
id |
RCAP_9f627f3575834890332636fc621630b1 |
---|---|
oai_identifier_str |
oai:ria.ua.pt:10773/30084 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Robotic soccer: behaviors for simulated humanoid robotsArtificial intelligenceMachine learningReinforcement learningDeep reinforcement learningRobotic soccerRoboCupHumanoid robotsThis dissertation was focused on the improvement of the goalkeeper of the FC Portugal 3D team, participant in the RoboCup 3D Soccer Simulation league. All the experiments during this work were undertaken within the league environment. This competition consists on a virtual soccer match between simulated humanoid robots. A soccer goalkeeper has as its primary mission keeping his goal clear. To accomplish that goal, he needs to save every shot targeted to its goal and that can be a very difficult challenge. The challenge of saving a shot can be divided into two different tasks: predicting the ball movement and intersecting the ball trajectory. During this work, the primary objective is the application of machine learning and reinforcement learning techniques to solve these problems. Predicting the ball movement is a matter of replicating the environment physics. So, a ball position estimator was developed following a supervised machine learning approach. Although this estimator was initially developed thinking mainly on the goalkeeper, it can also be very useful for the field players. A precise ball estimator opens the door for behaviors capable of interacting with the ball while it is moving, which is rare at the current state of the league. Intersecting the ball trajectory and saving the shots was accomplished by a generic behavior developed with a Deep Reinforcement Learning approach. This generic behavior is capable of saving different types of shots: shots coming from different positions of the field and different 3D initial velocities. The new behavior was tested and evaluated in comparison with the previous version of the goalkeeper and achieved indisputable better results. This was a huge achievement because the previous version of the goalkeeper was composed by thirteen different behaviors and a selection algorithm and it was outperformed and replaced by only one behavior.Esta dissertação foi dedicada à melhoria do guarda-redes da equipa FC Portugal 3D, participante da liga RoboCup 3D Soccer Simulation. Por esta razão, todas as experiências feitas durante este trabalho foram desenvolvidas no ambiente da competição. Esta competição consiste num jogo virtual de futebol entre robôs humanoides simulados. Um guarda-redes de futebol tem como principal missão não sofrer golos. Para atingir esse objetivo, precisa de ser capaz de defender qualquer remate à baliza e isso é bastante difícil. Defender um remate pode ser dividido em duas partes: prever a trajetória da bola e interseção dessa trajetória. Durante este trabalho, o principal objetivo é a utilização de técnicas de aprendizagem automática e aprendizagem por reforço para resolução destes desafios. Para prever a trajetória da bola é necessário replicar as físicas do ambiente. Por essa razão, um estimador da posição da bola foi desenvolvido seguindo uma abordagem de aprendizagem automática supervisionada. Embora este estimador tenha sido inicialmente pensado principalmente para o guarda-redes, também pode ser bastante útil para os jogadores de campo. Um estimador da posição da bola preciso, permite o desenvolvimento de comportamentos capazes de interagir com a bola em movimento, algo que é raro no atual estado da liga. Intersetar a trajetória da bola e defender os remates foi conseguido por um comportamento genérico desenvolvido com uma abordagem de Aprendizagem por Reforço Profunda. Este comportamento genérico é capaz de defender diferentes tipos de remates: remates provenientes de diferentes regiões do campo e com diferentes velocidades 3D. O novo comportamento foi testado e comparado à versão anterior do guarda-redes e obteve melhores resultados. Esta foi uma grande conquista já que a versão anterior do guarda-redes era composta por treze comportamentos diferentes e um algoritmo de seleção e foi substituída por apenas um comportamento com melhores resultados.2020-12-15T17:09:12Z2020-10-02T00:00:00Z2020-10-02info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10773/30084engResende, Francisco Manuel Gonçalves deinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-02-22T11:58:11Zoai:ria.ua.pt:10773/30084Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:02:17.141948Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Robotic soccer: behaviors for simulated humanoid robots |
title |
Robotic soccer: behaviors for simulated humanoid robots |
spellingShingle |
Robotic soccer: behaviors for simulated humanoid robots Resende, Francisco Manuel Gonçalves de Artificial intelligence Machine learning Reinforcement learning Deep reinforcement learning Robotic soccer RoboCup Humanoid robots |
title_short |
Robotic soccer: behaviors for simulated humanoid robots |
title_full |
Robotic soccer: behaviors for simulated humanoid robots |
title_fullStr |
Robotic soccer: behaviors for simulated humanoid robots |
title_full_unstemmed |
Robotic soccer: behaviors for simulated humanoid robots |
title_sort |
Robotic soccer: behaviors for simulated humanoid robots |
author |
Resende, Francisco Manuel Gonçalves de |
author_facet |
Resende, Francisco Manuel Gonçalves de |
author_role |
author |
dc.contributor.author.fl_str_mv |
Resende, Francisco Manuel Gonçalves de |
dc.subject.por.fl_str_mv |
Artificial intelligence Machine learning Reinforcement learning Deep reinforcement learning Robotic soccer RoboCup Humanoid robots |
topic |
Artificial intelligence Machine learning Reinforcement learning Deep reinforcement learning Robotic soccer RoboCup Humanoid robots |
description |
This dissertation was focused on the improvement of the goalkeeper of the FC Portugal 3D team, participant in the RoboCup 3D Soccer Simulation league. All the experiments during this work were undertaken within the league environment. This competition consists on a virtual soccer match between simulated humanoid robots. A soccer goalkeeper has as its primary mission keeping his goal clear. To accomplish that goal, he needs to save every shot targeted to its goal and that can be a very difficult challenge. The challenge of saving a shot can be divided into two different tasks: predicting the ball movement and intersecting the ball trajectory. During this work, the primary objective is the application of machine learning and reinforcement learning techniques to solve these problems. Predicting the ball movement is a matter of replicating the environment physics. So, a ball position estimator was developed following a supervised machine learning approach. Although this estimator was initially developed thinking mainly on the goalkeeper, it can also be very useful for the field players. A precise ball estimator opens the door for behaviors capable of interacting with the ball while it is moving, which is rare at the current state of the league. Intersecting the ball trajectory and saving the shots was accomplished by a generic behavior developed with a Deep Reinforcement Learning approach. This generic behavior is capable of saving different types of shots: shots coming from different positions of the field and different 3D initial velocities. The new behavior was tested and evaluated in comparison with the previous version of the goalkeeper and achieved indisputable better results. This was a huge achievement because the previous version of the goalkeeper was composed by thirteen different behaviors and a selection algorithm and it was outperformed and replaced by only one behavior. |
publishDate |
2020 |
dc.date.none.fl_str_mv |
2020-12-15T17:09:12Z 2020-10-02T00:00:00Z 2020-10-02 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10773/30084 |
url |
http://hdl.handle.net/10773/30084 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799137678391771136 |