Reinforcement learning for collaborative robots pick-and-place applications: a case study

Gomes, Natanael Magno; Martins, Filipe; Lima, José; Wörtche, Heinrich

Reinforcement learning for collaborative robots pick-and-place applications: a case study

Detalhes bibliográficos
Autor(a) principal:	Gomes, Natanael Magno
Data de Publicação:	2022
Outros Autores:	Martins, Filipe, Lima, José, Wörtche, Heinrich
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10198/27551
Resumo:	The number of applications in which industrial robots share their working environment with people is increasing. Robots appropriate for such applications are equipped with safety systems according to ISO/TS 15066:2016 and are often referred to as collaborative robots (cobots). Due to the nature of human-robot collaboration, the working environment of cobots is subjected to unforeseeable modifications caused by people. Vision systems are often used to increase the adaptability of cobots, but they usually require knowledge of the objects to be manipulated. The application of machine learning techniques can increase the flexibility by enabling the control system of a cobot to continuously learn and adapt to unexpected changes in the working environment. In this paper we address this issue by investigating the use of Reinforcement Learning (RL) to control a cobot to perform pick-and-place tasks. We present the implementation of a control system that can adapt to changes in position and enables a cobot to grasp objects which were not part of the training. Our proposed system uses deep Q-learning to process color and depth images and generates an ϵ -greedy policy to define robot actions. The Q-values are estimated using Convolution Neural Networks (CNNs) based on pre-trained models for feature extraction. To reduce training time, we implement a simulation environment to first train the RL agent, then we apply the resulting system on a real cobot. System performance is compared when using the pre-trained CNN models ResNext, DenseNet, MobileNet, and MNASNet. Simulation and experimental results validate the proposed approach and show that our system reaches a grasping success rate of 89.9% when manipulating a never-seen object operating with the pre-trained CNN model MobileNet.

Metadados do item

id	RCAP_9a6d3619c78584ea0e3bb7f17586338e
oai_identifier_str	oai:bibliotecadigital.ipb.pt:10198/27551
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Reinforcement learning for collaborative robots pick-and-place applications: a case studyReinforcement learningDeep neural networksComputer visionIndustrial robotsCollaborative robotsPick-and-placeGraspingThe number of applications in which industrial robots share their working environment with people is increasing. Robots appropriate for such applications are equipped with safety systems according to ISO/TS 15066:2016 and are often referred to as collaborative robots (cobots). Due to the nature of human-robot collaboration, the working environment of cobots is subjected to unforeseeable modifications caused by people. Vision systems are often used to increase the adaptability of cobots, but they usually require knowledge of the objects to be manipulated. The application of machine learning techniques can increase the flexibility by enabling the control system of a cobot to continuously learn and adapt to unexpected changes in the working environment. In this paper we address this issue by investigating the use of Reinforcement Learning (RL) to control a cobot to perform pick-and-place tasks. We present the implementation of a control system that can adapt to changes in position and enables a cobot to grasp objects which were not part of the training. Our proposed system uses deep Q-learning to process color and depth images and generates an ϵ -greedy policy to define robot actions. The Q-values are estimated using Convolution Neural Networks (CNNs) based on pre-trained models for feature extraction. To reduce training time, we implement a simulation environment to first train the RL agent, then we apply the resulting system on a real cobot. System performance is compared when using the pre-trained CNN models ResNext, DenseNet, MobileNet, and MNASNet. Simulation and experimental results validate the proposed approach and show that our system reaches a grasping success rate of 89.9% when manipulating a never-seen object operating with the pre-trained CNN model MobileNet.This work has been supported by FCT-Fundação para a Ciência e Tecnologia (Portugal) within the Project Scope: UIDB/05757/2020 and by the Innovation Cluster Dracten-ICD (The Netherlands), project Collaborative Connected Robots (Cobots) 2.0.MDPIBiblioteca Digital do IPBGomes, Natanael MagnoMartins, FilipeLima, JoséWörtche, Heinrich2023-03-08T10:08:25Z20222022-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10198/27551engGomes, Natano; Martins, Filipe; Lima, José; Wörtche, Heinrich (2022). Reinforcement learning for collaborative robots pick-and-place applications: a case study. Automationinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-21T11:00:36Zoai:bibliotecadigital.ipb.pt:10198/27551Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:17:39.249101Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Reinforcement learning for collaborative robots pick-and-place applications: a case study
title	Reinforcement learning for collaborative robots pick-and-place applications: a case study
spellingShingle	Reinforcement learning for collaborative robots pick-and-place applications: a case study Gomes, Natanael Magno Reinforcement learning Deep neural networks Computer vision Industrial robots Collaborative robots Pick-and-place Grasping
title_short	Reinforcement learning for collaborative robots pick-and-place applications: a case study
title_full	Reinforcement learning for collaborative robots pick-and-place applications: a case study
title_fullStr	Reinforcement learning for collaborative robots pick-and-place applications: a case study
title_full_unstemmed	Reinforcement learning for collaborative robots pick-and-place applications: a case study
title_sort	Reinforcement learning for collaborative robots pick-and-place applications: a case study
author	Gomes, Natanael Magno
author_facet	Gomes, Natanael Magno Martins, Filipe Lima, José Wörtche, Heinrich
author_role	author
author2	Martins, Filipe Lima, José Wörtche, Heinrich
author2_role	author author author
dc.contributor.none.fl_str_mv	Biblioteca Digital do IPB
dc.contributor.author.fl_str_mv	Gomes, Natanael Magno Martins, Filipe Lima, José Wörtche, Heinrich
dc.subject.por.fl_str_mv	Reinforcement learning Deep neural networks Computer vision Industrial robots Collaborative robots Pick-and-place Grasping
topic	Reinforcement learning Deep neural networks Computer vision Industrial robots Collaborative robots Pick-and-place Grasping
description	The number of applications in which industrial robots share their working environment with people is increasing. Robots appropriate for such applications are equipped with safety systems according to ISO/TS 15066:2016 and are often referred to as collaborative robots (cobots). Due to the nature of human-robot collaboration, the working environment of cobots is subjected to unforeseeable modifications caused by people. Vision systems are often used to increase the adaptability of cobots, but they usually require knowledge of the objects to be manipulated. The application of machine learning techniques can increase the flexibility by enabling the control system of a cobot to continuously learn and adapt to unexpected changes in the working environment. In this paper we address this issue by investigating the use of Reinforcement Learning (RL) to control a cobot to perform pick-and-place tasks. We present the implementation of a control system that can adapt to changes in position and enables a cobot to grasp objects which were not part of the training. Our proposed system uses deep Q-learning to process color and depth images and generates an ϵ -greedy policy to define robot actions. The Q-values are estimated using Convolution Neural Networks (CNNs) based on pre-trained models for feature extraction. To reduce training time, we implement a simulation environment to first train the RL agent, then we apply the resulting system on a real cobot. System performance is compared when using the pre-trained CNN models ResNext, DenseNet, MobileNet, and MNASNet. Simulation and experimental results validate the proposed approach and show that our system reaches a grasping success rate of 89.9% when manipulating a never-seen object operating with the pre-trained CNN model MobileNet.
publishDate	2022
dc.date.none.fl_str_mv	2022 2022-01-01T00:00:00Z 2023-03-08T10:08:25Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10198/27551
url	http://hdl.handle.net/10198/27551
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	Gomes, Natano; Martins, Filipe; Lima, José; Wörtche, Heinrich (2022). Reinforcement learning for collaborative robots pick-and-place applications: a case study. Automation
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	MDPI
publisher.none.fl_str_mv	MDPI
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799135467537432576

Reinforcement learning for collaborative robots pick-and-place applications: a case study

Registros relacionados