Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT

Detalhes bibliográficos
Autor(a) principal: Zheng, Jingjing
Data de Publicação: 2022
Outros Autores: Li, Kai, Ni, Wei, Tovar, Eduardo, Guizani, Mohsen, Mhaisen, Naram
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.22/21615
Resumo: Federated learning (FL) has been increasingly considered to preserve data training privacy from eavesdropping attacks in mobile edge computing-based Internet of Thing (EdgeIoT). On the one hand, the learning accuracy of FL can be improved by selecting the IoT devices with large datasets for training, which gives rise to a higher energy consumption. On the other hand, the energy consumption can be reduced by selecting the IoT devices with small datasets for FL, resulting in a falling learning accuracy. In this paper, we formulate a new resource allocation problem for privacy-preserving EdgeIoT to balance the learning accuracy of FL and the energy consumption of the IoT device. We propose a new federated learning-enabled twin-delayed deep deterministic policy gradient (FLDLT3) framework to achieve the optimal accuracy and energy balance in a continuous domain. Furthermore, long short term memory (LSTM) is leveraged in FL-DLT3 to predict the time-varying network state while FL-DLT3 is trained to select the IoT devices and allocate the transmit power. Numerical results demonstrate that the proposed FL-DLT3 achieves fast convergence (less than 100 iterations) while the FL accuracy-to-energy consumption ratio is improved by 51.8% compared to existing state-of-the-art benchmark.
id RCAP_79ee3a4450dccc3a35c1c8cc73c188d1
oai_identifier_str oai:recipp.ipp.pt:10400.22/21615
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT220503Federated learningOnline resource allocationDeep reinforcement learningMobile edge computingInternet of ThingsFederated learning (FL) has been increasingly considered to preserve data training privacy from eavesdropping attacks in mobile edge computing-based Internet of Thing (EdgeIoT). On the one hand, the learning accuracy of FL can be improved by selecting the IoT devices with large datasets for training, which gives rise to a higher energy consumption. On the other hand, the energy consumption can be reduced by selecting the IoT devices with small datasets for FL, resulting in a falling learning accuracy. In this paper, we formulate a new resource allocation problem for privacy-preserving EdgeIoT to balance the learning accuracy of FL and the energy consumption of the IoT device. We propose a new federated learning-enabled twin-delayed deep deterministic policy gradient (FLDLT3) framework to achieve the optimal accuracy and energy balance in a continuous domain. Furthermore, long short term memory (LSTM) is leveraged in FL-DLT3 to predict the time-varying network state while FL-DLT3 is trained to select the IoT devices and allocate the transmit power. Numerical results demonstrate that the proposed FL-DLT3 achieves fast convergence (less than 100 iterations) while the FL accuracy-to-energy consumption ratio is improved by 51.8% compared to existing state-of-the-art benchmark.This work was supported in part by the National Funds through FCT/MCTES (Portuguese Foundation for Science and Technology), within the CISTER Research Unit under Grant UIDP/UIDB/04234/2020, and in part by national funds through FCT, within project PTDC/EEICOM/3362/2021 (ADANET).IEEERepositório Científico do Instituto Politécnico do PortoZheng, JingjingLi, KaiNi, WeiTovar, EduardoGuizani, MohsenMhaisen, Naram2023-01-17T16:31:24Z2022-05-132022-05-13T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/21615eng10.1109/JIOT.2022.3176739info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-03-13T13:17:29Zoai:recipp.ipp.pt:10400.22/21615Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:41:38.471894Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT
220503
title Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT
spellingShingle Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT
Zheng, Jingjing
Federated learning
Online resource allocation
Deep reinforcement learning
Mobile edge computing
Internet of Things
title_short Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT
title_full Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT
title_fullStr Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT
title_full_unstemmed Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT
title_sort Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT
author Zheng, Jingjing
author_facet Zheng, Jingjing
Li, Kai
Ni, Wei
Tovar, Eduardo
Guizani, Mohsen
Mhaisen, Naram
author_role author
author2 Li, Kai
Ni, Wei
Tovar, Eduardo
Guizani, Mohsen
Mhaisen, Naram
author2_role author
author
author
author
author
dc.contributor.none.fl_str_mv Repositório Científico do Instituto Politécnico do Porto
dc.contributor.author.fl_str_mv Zheng, Jingjing
Li, Kai
Ni, Wei
Tovar, Eduardo
Guizani, Mohsen
Mhaisen, Naram
dc.subject.por.fl_str_mv Federated learning
Online resource allocation
Deep reinforcement learning
Mobile edge computing
Internet of Things
topic Federated learning
Online resource allocation
Deep reinforcement learning
Mobile edge computing
Internet of Things
description Federated learning (FL) has been increasingly considered to preserve data training privacy from eavesdropping attacks in mobile edge computing-based Internet of Thing (EdgeIoT). On the one hand, the learning accuracy of FL can be improved by selecting the IoT devices with large datasets for training, which gives rise to a higher energy consumption. On the other hand, the energy consumption can be reduced by selecting the IoT devices with small datasets for FL, resulting in a falling learning accuracy. In this paper, we formulate a new resource allocation problem for privacy-preserving EdgeIoT to balance the learning accuracy of FL and the energy consumption of the IoT device. We propose a new federated learning-enabled twin-delayed deep deterministic policy gradient (FLDLT3) framework to achieve the optimal accuracy and energy balance in a continuous domain. Furthermore, long short term memory (LSTM) is leveraged in FL-DLT3 to predict the time-varying network state while FL-DLT3 is trained to select the IoT devices and allocate the transmit power. Numerical results demonstrate that the proposed FL-DLT3 achieves fast convergence (less than 100 iterations) while the FL accuracy-to-energy consumption ratio is improved by 51.8% compared to existing state-of-the-art benchmark.
publishDate 2022
dc.date.none.fl_str_mv 2022-05-13
2022-05-13T00:00:00Z
2023-01-17T16:31:24Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.22/21615
url http://hdl.handle.net/10400.22/21615
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1109/JIOT.2022.3176739
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv IEEE
publisher.none.fl_str_mv IEEE
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799131503834169344