Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT
Autor(a) principal: | |
---|---|
Data de Publicação: | 2022 |
Outros Autores: | , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.22/21615 |
Resumo: | Federated learning (FL) has been increasingly considered to preserve data training privacy from eavesdropping attacks in mobile edge computing-based Internet of Thing (EdgeIoT). On the one hand, the learning accuracy of FL can be improved by selecting the IoT devices with large datasets for training, which gives rise to a higher energy consumption. On the other hand, the energy consumption can be reduced by selecting the IoT devices with small datasets for FL, resulting in a falling learning accuracy. In this paper, we formulate a new resource allocation problem for privacy-preserving EdgeIoT to balance the learning accuracy of FL and the energy consumption of the IoT device. We propose a new federated learning-enabled twin-delayed deep deterministic policy gradient (FLDLT3) framework to achieve the optimal accuracy and energy balance in a continuous domain. Furthermore, long short term memory (LSTM) is leveraged in FL-DLT3 to predict the time-varying network state while FL-DLT3 is trained to select the IoT devices and allocate the transmit power. Numerical results demonstrate that the proposed FL-DLT3 achieves fast convergence (less than 100 iterations) while the FL accuracy-to-energy consumption ratio is improved by 51.8% compared to existing state-of-the-art benchmark. |
id |
RCAP_79ee3a4450dccc3a35c1c8cc73c188d1 |
---|---|
oai_identifier_str |
oai:recipp.ipp.pt:10400.22/21615 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT220503Federated learningOnline resource allocationDeep reinforcement learningMobile edge computingInternet of ThingsFederated learning (FL) has been increasingly considered to preserve data training privacy from eavesdropping attacks in mobile edge computing-based Internet of Thing (EdgeIoT). On the one hand, the learning accuracy of FL can be improved by selecting the IoT devices with large datasets for training, which gives rise to a higher energy consumption. On the other hand, the energy consumption can be reduced by selecting the IoT devices with small datasets for FL, resulting in a falling learning accuracy. In this paper, we formulate a new resource allocation problem for privacy-preserving EdgeIoT to balance the learning accuracy of FL and the energy consumption of the IoT device. We propose a new federated learning-enabled twin-delayed deep deterministic policy gradient (FLDLT3) framework to achieve the optimal accuracy and energy balance in a continuous domain. Furthermore, long short term memory (LSTM) is leveraged in FL-DLT3 to predict the time-varying network state while FL-DLT3 is trained to select the IoT devices and allocate the transmit power. Numerical results demonstrate that the proposed FL-DLT3 achieves fast convergence (less than 100 iterations) while the FL accuracy-to-energy consumption ratio is improved by 51.8% compared to existing state-of-the-art benchmark.This work was supported in part by the National Funds through FCT/MCTES (Portuguese Foundation for Science and Technology), within the CISTER Research Unit under Grant UIDP/UIDB/04234/2020, and in part by national funds through FCT, within project PTDC/EEICOM/3362/2021 (ADANET).IEEERepositório Científico do Instituto Politécnico do PortoZheng, JingjingLi, KaiNi, WeiTovar, EduardoGuizani, MohsenMhaisen, Naram2023-01-17T16:31:24Z2022-05-132022-05-13T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/21615eng10.1109/JIOT.2022.3176739info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-03-13T13:17:29Zoai:recipp.ipp.pt:10400.22/21615Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:41:38.471894Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT 220503 |
title |
Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT |
spellingShingle |
Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT Zheng, Jingjing Federated learning Online resource allocation Deep reinforcement learning Mobile edge computing Internet of Things |
title_short |
Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT |
title_full |
Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT |
title_fullStr |
Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT |
title_full_unstemmed |
Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT |
title_sort |
Exploring Deep Reinforcement Learning- Assisted Federated Learning for Online Resource Allocation in Privacy-Preserving EdgeIoT |
author |
Zheng, Jingjing |
author_facet |
Zheng, Jingjing Li, Kai Ni, Wei Tovar, Eduardo Guizani, Mohsen Mhaisen, Naram |
author_role |
author |
author2 |
Li, Kai Ni, Wei Tovar, Eduardo Guizani, Mohsen Mhaisen, Naram |
author2_role |
author author author author author |
dc.contributor.none.fl_str_mv |
Repositório Científico do Instituto Politécnico do Porto |
dc.contributor.author.fl_str_mv |
Zheng, Jingjing Li, Kai Ni, Wei Tovar, Eduardo Guizani, Mohsen Mhaisen, Naram |
dc.subject.por.fl_str_mv |
Federated learning Online resource allocation Deep reinforcement learning Mobile edge computing Internet of Things |
topic |
Federated learning Online resource allocation Deep reinforcement learning Mobile edge computing Internet of Things |
description |
Federated learning (FL) has been increasingly considered to preserve data training privacy from eavesdropping attacks in mobile edge computing-based Internet of Thing (EdgeIoT). On the one hand, the learning accuracy of FL can be improved by selecting the IoT devices with large datasets for training, which gives rise to a higher energy consumption. On the other hand, the energy consumption can be reduced by selecting the IoT devices with small datasets for FL, resulting in a falling learning accuracy. In this paper, we formulate a new resource allocation problem for privacy-preserving EdgeIoT to balance the learning accuracy of FL and the energy consumption of the IoT device. We propose a new federated learning-enabled twin-delayed deep deterministic policy gradient (FLDLT3) framework to achieve the optimal accuracy and energy balance in a continuous domain. Furthermore, long short term memory (LSTM) is leveraged in FL-DLT3 to predict the time-varying network state while FL-DLT3 is trained to select the IoT devices and allocate the transmit power. Numerical results demonstrate that the proposed FL-DLT3 achieves fast convergence (less than 100 iterations) while the FL accuracy-to-energy consumption ratio is improved by 51.8% compared to existing state-of-the-art benchmark. |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022-05-13 2022-05-13T00:00:00Z 2023-01-17T16:31:24Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.22/21615 |
url |
http://hdl.handle.net/10400.22/21615 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
10.1109/JIOT.2022.3176739 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
IEEE |
publisher.none.fl_str_mv |
IEEE |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799131503834169344 |