Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states

Silva,Valdinei Freire da; Costa,Anna Helena Reali

Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states

Detalhes bibliográficos
Autor(a) principal:	Silva,Valdinei Freire da
Data de Publicação:	2009
Outros Autores:	Costa,Anna Helena Reali
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Journal of the Brazilian Computer Society
Texto Completo:	http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-65002009000300007
Resumo:	Reinforcement Learning is carried out on-line, through trial-and-error interactions of the agent with the environment, which can be very time consuming when considering robots. In this paper we contribute a new learning algorithm, CFQ-Learning, which uses macro-states, a low-resolution discretisation of the state space, and a partial-policy to get around obstacles, both of them based on the complexity of the environment structure. The use of macro-states avoids convergence of algorithms, but can accelerate the learning process. In the other hand, partial-policies can guarantee that an agent fulfils its task, even through macro-state. Experiments show that the CFQ-Learning performs a good balance between policy quality and learning rate.

Metadados do item

id	UFRGS-28_a6c5a4118006b6821211906a8a1f0699
oai_identifier_str	oai:scielo:S0104-65002009000300007
network_acronym_str	UFRGS-28
network_name_str	Journal of the Brazilian Computer Society
repository_id_str
spelling	Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-statesmachine learningreinforcement learningabstractionpartial-policymacro-statesReinforcement Learning is carried out on-line, through trial-and-error interactions of the agent with the environment, which can be very time consuming when considering robots. In this paper we contribute a new learning algorithm, CFQ-Learning, which uses macro-states, a low-resolution discretisation of the state space, and a partial-policy to get around obstacles, both of them based on the complexity of the environment structure. The use of macro-states avoids convergence of algorithms, but can accelerate the learning process. In the other hand, partial-policies can guarantee that an agent fulfils its task, even through macro-state. Experiments show that the CFQ-Learning performs a good balance between policy quality and learning rate.Sociedade Brasileira de Computação2009-09-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-65002009000300007Journal of the Brazilian Computer Society v.15 n.3 2009reponame:Journal of the Brazilian Computer Societyinstname:Sociedade Brasileira de Computação (SBC)instacron:UFRGS10.1007/BF03194507info:eu-repo/semantics/openAccessSilva,Valdinei Freire daCosta,Anna Helena Realieng2009-12-17T00:00:00Zoai:scielo:S0104-65002009000300007Revistahttps://journal-bcs.springeropen.com/PUBhttps://old.scielo.br/oai/scielo-oai.phpjbcs@icmc.sc.usp.br1678-48040104-6500opendoar:2009-12-17T00:00Journal of the Brazilian Computer Society - Sociedade Brasileira de Computação (SBC)false
dc.title.none.fl_str_mv	Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states
title	Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states
spellingShingle	Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states Silva,Valdinei Freire da machine learning reinforcement learning abstraction partial-policy macro-states
title_short	Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states
title_full	Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states
title_fullStr	Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states
title_full_unstemmed	Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states
title_sort	Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states
author	Silva,Valdinei Freire da
author_facet	Silva,Valdinei Freire da Costa,Anna Helena Reali
author_role	author
author2	Costa,Anna Helena Reali
author2_role	author
dc.contributor.author.fl_str_mv	Silva,Valdinei Freire da Costa,Anna Helena Reali
dc.subject.por.fl_str_mv	machine learning reinforcement learning abstraction partial-policy macro-states
topic	machine learning reinforcement learning abstraction partial-policy macro-states
description	Reinforcement Learning is carried out on-line, through trial-and-error interactions of the agent with the environment, which can be very time consuming when considering robots. In this paper we contribute a new learning algorithm, CFQ-Learning, which uses macro-states, a low-resolution discretisation of the state space, and a partial-policy to get around obstacles, both of them based on the complexity of the environment structure. The use of macro-states avoids convergence of algorithms, but can accelerate the learning process. In the other hand, partial-policies can guarantee that an agent fulfils its task, even through macro-state. Experiments show that the CFQ-Learning performs a good balance between policy quality and learning rate.
publishDate	2009
dc.date.none.fl_str_mv	2009-09-01
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-65002009000300007
url	http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-65002009000300007
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	10.1007/BF03194507
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	text/html
dc.publisher.none.fl_str_mv	Sociedade Brasileira de Computação
publisher.none.fl_str_mv	Sociedade Brasileira de Computação
dc.source.none.fl_str_mv	Journal of the Brazilian Computer Society v.15 n.3 2009 reponame:Journal of the Brazilian Computer Society instname:Sociedade Brasileira de Computação (SBC) instacron:UFRGS
instname_str	Sociedade Brasileira de Computação (SBC)
instacron_str	UFRGS
institution	UFRGS
reponame_str	Journal of the Brazilian Computer Society
collection	Journal of the Brazilian Computer Society
repository.name.fl_str_mv	Journal of the Brazilian Computer Society - Sociedade Brasileira de Computação (SBC)
repository.mail.fl_str_mv	jbcs@icmc.sc.usp.br
_version_	1754734670005665792

Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states

Registros relacionados