Reinforcement Learning in the Navigation of Mobile Robots

Alves, Diogo António Ferreira Temporão

Reinforcement Learning in the Navigation of Mobile Robots

Detalhes bibliográficos
Autor(a) principal:	Alves, Diogo António Ferreira Temporão
Data de Publicação:	2019
Tipo de documento:	Dissertação
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10316/87924
Resumo:	Dissertação de Mestrado Integrado em Engenharia Electrotécnica e de Computadores apresentada à Faculdade de Ciências e Tecnologia

Metadados do item

id	RCAP_b2909ad9b1b0dd639a5f41b92aa2632f
oai_identifier_str	oai:estudogeral.uc.pt:10316/87924
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Reinforcement Learning in the Navigation of Mobile RobotsAprendizagem por reforço na navegação de robôs móveisNavegaçãoPlaneamentoReinforcement LearningRecompensasAçõesNavigationPlanningReinforcement LearningRewardsActionsDissertação de Mestrado Integrado em Engenharia Electrotécnica e de Computadores apresentada à Faculdade de Ciências e TecnologiaCom o passar do tempo, a ideia de que os robôs desempenham unicamente papeis ligados ao sector industrial tem vindo a desaparecer. Atualmente, na sociedade, existe uma forte integração de robôs com o objetivo de auxiliar/melhorar a execução de determinadas tarefas.Desta forma, os robôs podem ser vistos como ferramentas essenciais no nosso quotidiano, em diversas áreas como medicina, educação, ou a nível pessoal.Esta dissertação de Mestrado tem como objetivo principal desenvolver e implementar um novo método de navegação local para robôs móveis tendo por base aprendizagem por reforço Reinforcement Learning. Este método permite que plataformas móveis virtuais ou reais como InterBot-Social Robot, desenvolvida no Instituto de Sistemas e Robótica (ISR), siga um caminho de forma a navegar de um local A para B. O método consiste em dois estágios: estágio de treino e estágio online. O estágio de treino consiste em o robô aprender a seguir um caminho previamente definido. Este estágio é realizado num ambiente de simulação, permitindo uma total liberdade no desenvolvimento e aperfeiçoamento do método. Através do treino é obtido um modelo que é utilizado no estágio online permitindo que uma plataforma móvel, num ambiente de simulação, se mova ao longo de um caminho evitando obstáculos. Uma conjunto de testes e experiências foram feitos em diferentes cenários. Diferentes testes como limitar o número de ações disponivéis, alterar o tipo de representação do caminho (definido por segmentos de reta ou splines cúbicos) e introduzir obstáculos perto do caminho. O método desenvolvido apresenta resultados promissores para caminhos com e sem obstáculos. Quando há limitação no número das ações o comportamento do robô é bastante instável embora consiga comprir o ojetivo pretendido.Over time, the idea that robots only carry out roles related to the industrial sector has been disappearing. Today, in society, there is a strong integration of robots in order to help/improve the execution of certain tasks.As a result, robots can be seen as essential tools in our daily lives, in many areas such as medicine, education, or at a personal level.The main objective of this Master's dissertation is to develop and implement a new local navigation method for mobile robots based on Reinforcement Learning. This method enables virtual or real mobile platforms such as InterBot-Social Robot, developed at the Institute of Systems and Robotics (ISR), to follow a path to navigate from location A to B. The method consists of two stages: training stage and online stage. The training stage consists in the robot learning to follow a previously defined path. This stage is performed in a simulation environment, providing total freedom in the development and improvement of the method. Through the training, a model is obtained and is used in the online stage enabling a mobile platform, in a virtual environment, to move along a path avoiding obstacles. A set of tests and experiments were performed in different scenarios. Different tests such as limiting the number of available actions, changing the type of path representation (defined by line segments or cubic cubic splines) and introducing obstacles near the path. The method developed presents promising results for paths with and without obstacles. When there is a limitation in the number of actions, the robot's behavior is quite unstable, although it can accomplish the desired objective.2019-09-25info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesishttp://hdl.handle.net/10316/87924http://hdl.handle.net/10316/87924TID:202306100engAlves, Diogo António Ferreira Temporãoinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2021-06-01T11:47:06Zoai:estudogeral.uc.pt:10316/87924Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:08:44.482789Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Reinforcement Learning in the Navigation of Mobile Robots Aprendizagem por reforço na navegação de robôs móveis
title	Reinforcement Learning in the Navigation of Mobile Robots
spellingShingle	Reinforcement Learning in the Navigation of Mobile Robots Alves, Diogo António Ferreira Temporão Navegação Planeamento Reinforcement Learning Recompensas Ações Navigation Planning Reinforcement Learning Rewards Actions
title_short	Reinforcement Learning in the Navigation of Mobile Robots
title_full	Reinforcement Learning in the Navigation of Mobile Robots
title_fullStr	Reinforcement Learning in the Navigation of Mobile Robots
title_full_unstemmed	Reinforcement Learning in the Navigation of Mobile Robots
title_sort	Reinforcement Learning in the Navigation of Mobile Robots
author	Alves, Diogo António Ferreira Temporão
author_facet	Alves, Diogo António Ferreira Temporão
author_role	author
dc.contributor.author.fl_str_mv	Alves, Diogo António Ferreira Temporão
dc.subject.por.fl_str_mv	Navegação Planeamento Reinforcement Learning Recompensas Ações Navigation Planning Reinforcement Learning Rewards Actions
topic	Navegação Planeamento Reinforcement Learning Recompensas Ações Navigation Planning Reinforcement Learning Rewards Actions
description	Dissertação de Mestrado Integrado em Engenharia Electrotécnica e de Computadores apresentada à Faculdade de Ciências e Tecnologia
publishDate	2019
dc.date.none.fl_str_mv	2019-09-25
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10316/87924 http://hdl.handle.net/10316/87924 TID:202306100
url	http://hdl.handle.net/10316/87924
identifier_str_mv	TID:202306100
dc.language.iso.fl_str_mv	eng
language	eng
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799133980798222336

Reinforcement Learning in the Navigation of Mobile Robots

Registros relacionados