An algorithm for cooperative probabilistic control design

Barão, Miguel

An algorithm for cooperative probabilistic control design

Detalhes bibliográficos
Autor(a) principal:	Barão, Miguel
Data de Publicação:	2012
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10174/8089 https://doi.org/10.1109/MED.2012.6265795
Resumo:	This paper deals with the decentralized closed loop control in a pure probabilistic framework. In this framework, a system is a controlled Markov chain whose transition probabilities depend on the actions of the agents. The agents are also described in a probabilistic way. The objective is to drive the system so that the joint state and agents actions are close to a set of given target probability distributions. The Kullback-Leibler divergence is used as a performance measure. The resulting algorithm uses dynamic programming interleaved with an iterative process that computes the behavior of each agent.

Metadados do item

id	RCAP_4ed55ec12bf0d4edaa908f5b13616040
oai_identifier_str	oai:dspace.uevora.pt:10174/8089
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	An algorithm for cooperative probabilistic control designThis paper deals with the decentralized closed loop control in a pure probabilistic framework. In this framework, a system is a controlled Markov chain whose transition probabilities depend on the actions of the agents. The agents are also described in a probabilistic way. The objective is to drive the system so that the joint state and agents actions are close to a set of given target probability distributions. The Kullback-Leibler divergence is used as a performance measure. The resulting algorithm uses dynamic programming interleaved with an iterative process that computes the behavior of each agent.2013-01-30T16:47:45Z2013-01-302012-07-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/8089http://hdl.handle.net/10174/8089https://doi.org/10.1109/MED.2012.6265795engM. Barão, "An algorithm for cooperative probabilistic control design", in proceedings of 20th Mediterranean Conference on Control and Automation, pp. 1161-1164, Barcelona, Spain, July, 2012.mjsb@uevora.pt493Barão, Miguelinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T18:48:49Zoai:dspace.uevora.pt:10174/8089Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:02:28.108163Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	An algorithm for cooperative probabilistic control design
title	An algorithm for cooperative probabilistic control design
spellingShingle	An algorithm for cooperative probabilistic control design Barão, Miguel
title_short	An algorithm for cooperative probabilistic control design
title_full	An algorithm for cooperative probabilistic control design
title_fullStr	An algorithm for cooperative probabilistic control design
title_full_unstemmed	An algorithm for cooperative probabilistic control design
title_sort	An algorithm for cooperative probabilistic control design
author	Barão, Miguel
author_facet	Barão, Miguel
author_role	author
dc.contributor.author.fl_str_mv	Barão, Miguel
description	This paper deals with the decentralized closed loop control in a pure probabilistic framework. In this framework, a system is a controlled Markov chain whose transition probabilities depend on the actions of the agents. The agents are also described in a probabilistic way. The objective is to drive the system so that the joint state and agents actions are close to a set of given target probability distributions. The Kullback-Leibler divergence is used as a performance measure. The resulting algorithm uses dynamic programming interleaved with an iterative process that computes the behavior of each agent.
publishDate	2012
dc.date.none.fl_str_mv	2012-07-01T00:00:00Z 2013-01-30T16:47:45Z 2013-01-30
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10174/8089 http://hdl.handle.net/10174/8089 https://doi.org/10.1109/MED.2012.6265795
url	http://hdl.handle.net/10174/8089 https://doi.org/10.1109/MED.2012.6265795
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	M. Barão, "An algorithm for cooperative probabilistic control design", in proceedings of 20th Mediterranean Conference on Control and Automation, pp. 1161-1164, Barcelona, Spain, July, 2012. mjsb@uevora.pt 493
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799136508343484416

An algorithm for cooperative probabilistic control design

Registros relacionados