Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out

Santos, Gabriel Machado

Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out

Detalhes bibliográficos
Autor(a) principal:	Santos, Gabriel Machado
Data de Publicação:	2015
Tipo de documento:	Dissertação
Idioma:	por
Título da fonte:	Repositório Institucional da UFU
Texto Completo:	https://repositorio.ufu.br/handle/123456789/38758 http://doi.org/10.14393/ufu.di.2023.7078
Resumo:	Despite being a very ancient game, probably originated in China about 2000 BCE, the game of Go is one of the greatest challenges in the Ąeld of ArtiĄcial Intelligence. In this thesis is described the agent Go-Ahead: an automatic Go player that uses a new technique to improve the accuracy of the pre-estimated values of the moves which are candidate to be introduced into the classical Monte Carlo Tree Search (MCTS) algorithm used by many current top agents for Go. Go-Ahead is built upon the framework of one of these agents: the well known open- source automatic player Fuego, in which these pre-estimated values are obtained by means of a heuristic called prior-knowledge. Go-Ahead copes with the task of reĄning the calculus of these values through a new technique that performs a balanced combination between the prior-knowledge heuristic and some relevant information retrieved from the numerous play-out simulation phases that are repeatedly executed throughout the Monte Carlo search. With such a strategy, Go-Ahead provides two distinct contributions: Ąrst, it enables the agent to enhance the process of choosing appropriate moves. Second, the balancing in the combination of the prior-knowledge and the play-out information - which is obtained by means of an adjustable parameter - represents an interesting alternative to attenuate the supervised character of the calculus of the node evaluations in MCTS based agents, since it allows to reduce the impact of the prior-knowledge heuristic by strengthening the impact of this information. The results obtained in tournaments against Fuego conĄrm the beneĄts and the con- tributions provided by this approach.

Metadados do item

id	UFU_6f42ce83f8d13e692f116cc84dbe0f0e
oai_identifier_str	oai:repositorio.ufu.br:123456789/38758
network_acronym_str	UFU
network_name_str	Repositório Institucional da UFU
repository_id_str
spelling	Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-outGo-Ahead: improving prior-knowledge heuristics through Information extracted from play-out simulationsComputer-GoMCTS GoPrior-KnowledgePlay-OutsGoMCTSCNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAOComputaçãoSimulação (Computadores digitais)Jogos eletrônicosInteligência artificialDespite being a very ancient game, probably originated in China about 2000 BCE, the game of Go is one of the greatest challenges in the Ąeld of ArtiĄcial Intelligence. In this thesis is described the agent Go-Ahead: an automatic Go player that uses a new technique to improve the accuracy of the pre-estimated values of the moves which are candidate to be introduced into the classical Monte Carlo Tree Search (MCTS) algorithm used by many current top agents for Go. Go-Ahead is built upon the framework of one of these agents: the well known open- source automatic player Fuego, in which these pre-estimated values are obtained by means of a heuristic called prior-knowledge. Go-Ahead copes with the task of reĄning the calculus of these values through a new technique that performs a balanced combination between the prior-knowledge heuristic and some relevant information retrieved from the numerous play-out simulation phases that are repeatedly executed throughout the Monte Carlo search. With such a strategy, Go-Ahead provides two distinct contributions: Ąrst, it enables the agent to enhance the process of choosing appropriate moves. Second, the balancing in the combination of the prior-knowledge and the play-out information - which is obtained by means of an adjustable parameter - represents an interesting alternative to attenuate the supervised character of the calculus of the node evaluations in MCTS based agents, since it allows to reduce the impact of the prior-knowledge heuristic by strengthening the impact of this information. The results obtained in tournaments against Fuego conĄrm the beneĄts and the con- tributions provided by this approach.Dissertação (Mestrado)Apesar de muito antigo, originado provavelmente na China há 4000 anos atrás, o jogo de Go é um dos maiores desaĄos na área de Inteligência ArtiĄcial. Neste trabalho de Mestrado é descrito o agente Go-Ahead: um jogador automático para o jogo de Go que utiliza uma técnica inovadora a Ąm de melhorar a acuidade dos valores pré-estimados para movimentos candidatos a serem introduzidos na clássica árvore de busca Monte Carlo (MCTS) utilizada por vários agentes de ponta na cena de Computer-Go. Go-Ahead foi desenvolvido sobre o conjunto de bibliotecas de um desses agentes: o conhecido jogador automático de código aberto Fuego, no qual tais valores pré-estimados são obtidos através de uma heurística chamada Prior-Knowledge. Go-Ahead contribui para a redeĄnição da função que gera tais estimativas através de uma técnica que realiza uma combinação balanceada entre a heurística Prior-Knowledge e conhecimento relevante extraído das inúmeras simulações Play-Out que são repetidamente executadas durante o processo de busca do algoritmo MCTS. Com tal estratégia, Go-Ahead provê duas contribuições distintas: primeiramente, apri- mora a habilidade do agente no processo de escolha de movimentos apropriados; como segunda contribuição, através do balanço efetuado na combinação do Prior-Knowledge com o conhecimento extraído dos Play-Outs, - o qual é obtido através de um parâmetro ajustável - provê uma alternativa interessante para a atenuação do caráter supervisionado do processo de pré-avaliação de nós inerente aos agentes baseados no MCTS. Tal ganho é resultado da redução do impacto das heurísticas de Prior-Knowledge possibilitado pela inserção de novos conhecimentos recuperados durante a busca. Os resultados obtidos em torneios contra o agente Fuego conĄrmam os benefícios e as contribuições oferecidas através desta abordagem.Universidade Federal de UberlândiaBrasilPrograma de Pós-graduação em Ciência da ComputaçãoJulia, Rita Maria da Silvahttp://lattes.cnpq.br/8032993126633250Oliveira, Gina Maira Barbosa dehttp://lattes.cnpq.br/7119433066704111Chaimowicz, Luizhttp://lattes.cnpq.br/4499928813481251Santos, Gabriel Machado2023-07-26T13:05:14Z2023-07-26T13:05:14Z2015-04-24info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfSANTOS, Gabriel Machado. Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out. 2015. 104 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Uberlândia, Uberlândia, 2015. DOI http://doi.org/10.14393/ufu.di.2023.7078.https://repositorio.ufu.br/handle/123456789/38758http://doi.org/10.14393/ufu.di.2023.7078porinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFUinstname:Universidade Federal de Uberlândia (UFU)instacron:UFU2023-10-17T18:10:41Zoai:repositorio.ufu.br:123456789/38758Repositório InstitucionalONGhttp://repositorio.ufu.br/oai/requestdiinf@dirbi.ufu.bropendoar:2023-10-17T18:10:41Repositório Institucional da UFU - Universidade Federal de Uberlândia (UFU)false
dc.title.none.fl_str_mv	Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out Go-Ahead: improving prior-knowledge heuristics through Information extracted from play-out simulations
title	Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out
spellingShingle	Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out Santos, Gabriel Machado Computer-Go MCTS Go Prior-Knowledge Play-Outs Go MCTS CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO Computação Simulação (Computadores digitais) Jogos eletrônicos Inteligência artificial
title_short	Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out
title_full	Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out
title_fullStr	Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out
title_full_unstemmed	Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out
title_sort	Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out
author	Santos, Gabriel Machado
author_facet	Santos, Gabriel Machado
author_role	author
dc.contributor.none.fl_str_mv	Julia, Rita Maria da Silva http://lattes.cnpq.br/8032993126633250 Oliveira, Gina Maira Barbosa de http://lattes.cnpq.br/7119433066704111 Chaimowicz, Luiz http://lattes.cnpq.br/4499928813481251
dc.contributor.author.fl_str_mv	Santos, Gabriel Machado
dc.subject.por.fl_str_mv	Computer-Go MCTS Go Prior-Knowledge Play-Outs Go MCTS CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO Computação Simulação (Computadores digitais) Jogos eletrônicos Inteligência artificial
topic	Computer-Go MCTS Go Prior-Knowledge Play-Outs Go MCTS CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO Computação Simulação (Computadores digitais) Jogos eletrônicos Inteligência artificial
description	Despite being a very ancient game, probably originated in China about 2000 BCE, the game of Go is one of the greatest challenges in the Ąeld of ArtiĄcial Intelligence. In this thesis is described the agent Go-Ahead: an automatic Go player that uses a new technique to improve the accuracy of the pre-estimated values of the moves which are candidate to be introduced into the classical Monte Carlo Tree Search (MCTS) algorithm used by many current top agents for Go. Go-Ahead is built upon the framework of one of these agents: the well known open- source automatic player Fuego, in which these pre-estimated values are obtained by means of a heuristic called prior-knowledge. Go-Ahead copes with the task of reĄning the calculus of these values through a new technique that performs a balanced combination between the prior-knowledge heuristic and some relevant information retrieved from the numerous play-out simulation phases that are repeatedly executed throughout the Monte Carlo search. With such a strategy, Go-Ahead provides two distinct contributions: Ąrst, it enables the agent to enhance the process of choosing appropriate moves. Second, the balancing in the combination of the prior-knowledge and the play-out information - which is obtained by means of an adjustable parameter - represents an interesting alternative to attenuate the supervised character of the calculus of the node evaluations in MCTS based agents, since it allows to reduce the impact of the prior-knowledge heuristic by strengthening the impact of this information. The results obtained in tournaments against Fuego conĄrm the beneĄts and the con- tributions provided by this approach.
publishDate	2015
dc.date.none.fl_str_mv	2015-04-24 2023-07-26T13:05:14Z 2023-07-26T13:05:14Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	SANTOS, Gabriel Machado. Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out. 2015. 104 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Uberlândia, Uberlândia, 2015. DOI http://doi.org/10.14393/ufu.di.2023.7078. https://repositorio.ufu.br/handle/123456789/38758 http://doi.org/10.14393/ufu.di.2023.7078
identifier_str_mv	SANTOS, Gabriel Machado. Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out. 2015. 104 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Uberlândia, Uberlândia, 2015. DOI http://doi.org/10.14393/ufu.di.2023.7078.
url	https://repositorio.ufu.br/handle/123456789/38758 http://doi.org/10.14393/ufu.di.2023.7078
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Federal de Uberlândia Brasil Programa de Pós-graduação em Ciência da Computação
publisher.none.fl_str_mv	Universidade Federal de Uberlândia Brasil Programa de Pós-graduação em Ciência da Computação
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFU instname:Universidade Federal de Uberlândia (UFU) instacron:UFU
instname_str	Universidade Federal de Uberlândia (UFU)
instacron_str	UFU
institution	UFU
reponame_str	Repositório Institucional da UFU
collection	Repositório Institucional da UFU
repository.name.fl_str_mv	Repositório Institucional da UFU - Universidade Federal de Uberlândia (UFU)
repository.mail.fl_str_mv	diinf@dirbi.ufu.br
_version_	1813711565317210112

Go-Ahead: melhorando heurísticas prior-knowledge através de informações extraídas das simulações play-out

Registros relacionados