Development and generalization of a reinforcement learning model for the pump scheduling problem

Borges, Guilherme Simões

Development and generalization of a reinforcement learning model for the pump scheduling problem

Detalhes bibliográficos
Autor(a) principal:	Borges, Guilherme Simões
Data de Publicação:	2023
Tipo de documento:	Dissertação
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10773/40879
Resumo:	Optimization techniques are used as efficient strategies for the operation of Water Supply Systems (WSS). However, with the increase in complexity of the WSS’s network, the computational resources required by optimization methods also increases, potentially compromising optimal operation in real-time. This work presents a Reinforcement Learning (RL) model as an alternative to traditional optimization techniques for solving the Pump Scheduling Problem (PSP). The model is made of a Deep Q-Learning Network (DQN) agent and an environment where EPANET 2.0 simulates the hydraulic behaviour of water networks. This study aims at analysing the RL agent’s capability in arriving to a near optimal solution in the PSP, testing several energy tariffs and consumption demand patterns. While the computational cost of training the agent is significantly higher than that of finding optimal strategies using traditional optimisation algorithms, once trained the model is capable of providing near-optimal solutions almost immediately. A state features analysis is done where different configurations of both state features and rewards are tested. The results of the RL model are satisfactory by giving information of both demand and tariff patterns, while limiting the number of pump starts for the time window. A generalization analysis is made as well. The RL model achieved an average accuracy of 94% when trained with multiple tariff and consumption patterns for 60k seconds, compared to the results obtained using a non-linear programming optimizer. Additionally, the RL model used only 1% of the CPU time after training.

Metadados do item

id	RCAP_fb6bf3353e046a6305bbab51425e4f09
oai_identifier_str	oai:ria.ua.pt:10773/40879
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Development and generalization of a reinforcement learning model for the pump scheduling problemWSSDRLDQNPump scheduling problemEPANETOptimizationOptimization techniques are used as efficient strategies for the operation of Water Supply Systems (WSS). However, with the increase in complexity of the WSS’s network, the computational resources required by optimization methods also increases, potentially compromising optimal operation in real-time. This work presents a Reinforcement Learning (RL) model as an alternative to traditional optimization techniques for solving the Pump Scheduling Problem (PSP). The model is made of a Deep Q-Learning Network (DQN) agent and an environment where EPANET 2.0 simulates the hydraulic behaviour of water networks. This study aims at analysing the RL agent’s capability in arriving to a near optimal solution in the PSP, testing several energy tariffs and consumption demand patterns. While the computational cost of training the agent is significantly higher than that of finding optimal strategies using traditional optimisation algorithms, once trained the model is capable of providing near-optimal solutions almost immediately. A state features analysis is done where different configurations of both state features and rewards are tested. The results of the RL model are satisfactory by giving information of both demand and tariff patterns, while limiting the number of pump starts for the time window. A generalization analysis is made as well. The RL model achieved an average accuracy of 94% when trained with multiple tariff and consumption patterns for 60k seconds, compared to the results obtained using a non-linear programming optimizer. Additionally, the RL model used only 1% of the CPU time after training.Técnicas de otimização são utilizadas como estratégias eficientes para a operação de Sistemas de Abastecimento de Água (SAA). No entanto, com o aumento da complexidade da rede do SAA, os recursos computacionais necessários para métodos de otimização também aumentam, comprometendo potencialmente a operação ideal em tempo real. Este trabalho apresenta um modelo de Reinforcement Learning (RL) como uma alternativa às técnicas tradicionais de otimização para resolver o Problema de Agendamento de Bombas (PAB). O modelo é composto por um agente de Deep Q-learning Network (DQN) e um ambiente onde o EPANET 2.0 simula o comportamento hidráulico da rede de água. Este estudo visa analisar a capacidade do agente RL em chegar a uma solução quase ótima no PAB, testando várias tarifas de energia e padrões de demanda de consumo. Embora o custo computacional do treinamento do agente seja significativamente maior do que o necessário para encontrar estratégias ótimas usando algoritmos tradicionais de otimização, uma vez treinado, o modelo é capaz de fornecer soluções quase ótimas quase imediatamente. Uma análise de características de estado é realizada, testando diferentes configurações de características de estado e recompensas. Os resultados do modelo de RL são satisfatórios, fornecendo informações sobre padrões de consumo e tarifas e limitando o número de acionamentos da bomba para a janela de tempo. Uma análise de generalização também é realizada. Quando treinado com padrões variados de tarifas e consumo por 60 mil segundos, o modelo de RL alcançaa uma precisão média de 94%, em comparação com os resultados obtidos usando um otimizador de programação não linear. Além disso, o modelo de RL utiliza apenas 1% do tempo da CPU após o treinamento.2024-02-27T12:06:52Z2023-12-04T00:00:00Z2023-12-04info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10773/40879engBorges, Guilherme Simõesinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-04T01:45:58Zoai:ria.ua.pt:10773/40879Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:12:36.067063Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Development and generalization of a reinforcement learning model for the pump scheduling problem
title	Development and generalization of a reinforcement learning model for the pump scheduling problem
spellingShingle	Development and generalization of a reinforcement learning model for the pump scheduling problem Borges, Guilherme Simões WSS DRL DQN Pump scheduling problem EPANET Optimization
title_short	Development and generalization of a reinforcement learning model for the pump scheduling problem
title_full	Development and generalization of a reinforcement learning model for the pump scheduling problem
title_fullStr	Development and generalization of a reinforcement learning model for the pump scheduling problem
title_full_unstemmed	Development and generalization of a reinforcement learning model for the pump scheduling problem
title_sort	Development and generalization of a reinforcement learning model for the pump scheduling problem
author	Borges, Guilherme Simões
author_facet	Borges, Guilherme Simões
author_role	author
dc.contributor.author.fl_str_mv	Borges, Guilherme Simões
dc.subject.por.fl_str_mv	WSS DRL DQN Pump scheduling problem EPANET Optimization
topic	WSS DRL DQN Pump scheduling problem EPANET Optimization
description	Optimization techniques are used as efficient strategies for the operation of Water Supply Systems (WSS). However, with the increase in complexity of the WSS’s network, the computational resources required by optimization methods also increases, potentially compromising optimal operation in real-time. This work presents a Reinforcement Learning (RL) model as an alternative to traditional optimization techniques for solving the Pump Scheduling Problem (PSP). The model is made of a Deep Q-Learning Network (DQN) agent and an environment where EPANET 2.0 simulates the hydraulic behaviour of water networks. This study aims at analysing the RL agent’s capability in arriving to a near optimal solution in the PSP, testing several energy tariffs and consumption demand patterns. While the computational cost of training the agent is significantly higher than that of finding optimal strategies using traditional optimisation algorithms, once trained the model is capable of providing near-optimal solutions almost immediately. A state features analysis is done where different configurations of both state features and rewards are tested. The results of the RL model are satisfactory by giving information of both demand and tariff patterns, while limiting the number of pump starts for the time window. A generalization analysis is made as well. The RL model achieved an average accuracy of 94% when trained with multiple tariff and consumption patterns for 60k seconds, compared to the results obtained using a non-linear programming optimizer. Additionally, the RL model used only 1% of the CPU time after training.
publishDate	2023
dc.date.none.fl_str_mv	2023-12-04T00:00:00Z 2023-12-04 2024-02-27T12:06:52Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10773/40879
url	http://hdl.handle.net/10773/40879
dc.language.iso.fl_str_mv	eng
language	eng
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799137775073624064

Development and generalization of a reinforcement learning model for the pump scheduling problem

Registros relacionados