Object identification for autonomous vehicles based on machine learning

Guedes, Diogo Alexandre Amaral Conde

Object identification for autonomous vehicles based on machine learning

Detalhes bibliográficos
Autor(a) principal:	Guedes, Diogo Alexandre Amaral Conde
Data de Publicação:	2021
Tipo de documento:	Dissertação
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10773/31213
Resumo:	Autonomous driving is one of the most actively researched fields in artificial intelligence. The autonomous vehicles are expected to significantly reduce the road accidents and casualties one day when they become sufficiently mature transport option. Currently much effort is focused to prove the concept of autonomous vehicles that is based on a suit of sensors to observe their surroundings. In particular, camera and LiDAR are researched as an efficient combination of sensors for on-line object identification on the road. 2D object identification is an already established field in Computer Vision. The successful application of Deep Learning techniques has led to 2D vision with Human-level accuracy. However, for a matter of improved safety more advanced approaches suggest that the vehicle should not rely on a single class of sensors. LiDAR has been proposed as an additional sensor, particularly due to its 3D vision capability. 3D vision relies on LiDAR captured data to recognize objects in 3D. However, in contrast to the 2D object identifi- cation, 3D object detection is a relatively immature field and still has many challenges to overcome. In addition, LiDARs are expensive sensors, which makes the acquisition of data required for training 3D object recognition techniques expensive tasks as well. In this context, this Master's thesis has the major goal to further facilitate the 3D object identification for autonomous vehicles based on Deep Learning (DL). The specific contributions of the present work are the following. First, a comprehensive overview of the state of the art Deep Learning architectures for 3D object identification based on Point Clouds. The purpose of this overview is to understand how to better approach such a problem in the context of autonomous driving. Second, synthetic but realistic Lidar captured data was generated in the GTA V virtual environment. Tools were developed to convert the generated data into the KITTI dataset format, which has become standard in 3D object detection techniques for autonomous driving. Third, some of the overviewed 3D object identification DL architectures were evaluated with the generated data. Though their performance with the generated data was worse than with the original KITTI data, the models were still able to correctly process the synthetic data without being retrained. The future benefit of this work is that the models can be further trained with home-made data and varying testing scenarios. The implemented GTA V mod has proved to be capable of providing rich, well-structured and compatible datasets with the state of the art 3D object identification architectures. The developed tool is publicly available and we hope it will be useful in advancing 3D object identification for autonomous driving, as it removes the dependency from datasets provided by a third party.

Metadados do item

id	RCAP_f8c6e6834483543478682d93e096d87c
oai_identifier_str	oai:ria.ua.pt:10773/31213
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Object identification for autonomous vehicles based on machine learningAutonomous drivingLiDARSimulatorPoint cloudDeep learningObject identificationAutonomous driving is one of the most actively researched fields in artificial intelligence. The autonomous vehicles are expected to significantly reduce the road accidents and casualties one day when they become sufficiently mature transport option. Currently much effort is focused to prove the concept of autonomous vehicles that is based on a suit of sensors to observe their surroundings. In particular, camera and LiDAR are researched as an efficient combination of sensors for on-line object identification on the road. 2D object identification is an already established field in Computer Vision. The successful application of Deep Learning techniques has led to 2D vision with Human-level accuracy. However, for a matter of improved safety more advanced approaches suggest that the vehicle should not rely on a single class of sensors. LiDAR has been proposed as an additional sensor, particularly due to its 3D vision capability. 3D vision relies on LiDAR captured data to recognize objects in 3D. However, in contrast to the 2D object identifi- cation, 3D object detection is a relatively immature field and still has many challenges to overcome. In addition, LiDARs are expensive sensors, which makes the acquisition of data required for training 3D object recognition techniques expensive tasks as well. In this context, this Master's thesis has the major goal to further facilitate the 3D object identification for autonomous vehicles based on Deep Learning (DL). The specific contributions of the present work are the following. First, a comprehensive overview of the state of the art Deep Learning architectures for 3D object identification based on Point Clouds. The purpose of this overview is to understand how to better approach such a problem in the context of autonomous driving. Second, synthetic but realistic Lidar captured data was generated in the GTA V virtual environment. Tools were developed to convert the generated data into the KITTI dataset format, which has become standard in 3D object detection techniques for autonomous driving. Third, some of the overviewed 3D object identification DL architectures were evaluated with the generated data. Though their performance with the generated data was worse than with the original KITTI data, the models were still able to correctly process the synthetic data without being retrained. The future benefit of this work is that the models can be further trained with home-made data and varying testing scenarios. The implemented GTA V mod has proved to be capable of providing rich, well-structured and compatible datasets with the state of the art 3D object identification architectures. The developed tool is publicly available and we hope it will be useful in advancing 3D object identification for autonomous driving, as it removes the dependency from datasets provided by a third party.Condução autónoma é uma das áreas mais ativamente estudadas em inteligência artificial. É esperado que os veículos autónomos reduzam significativamente os acidentes rodoviários e vitimas mortais quando se tornarem suficientemente maturos como opção de transporte. Atualmente, muitos dos esforços estão focados na prova de conceito de veículos autónomos serem baseados num conjunto de sensores que observam o ambiente em redor. Em particular, a camara e o LiDAR são estudados como sendo uma combinação eficiente de sensores para realização de identificação de objectos on-line nas estradas. Identificação de objetos 2D é uma área de estudo já estabelecida no campo de Computação Visual. O sucesso na aplicação de técnicas de Deep Learning levou a que a visão 2D atingisse uma precisão ao nível Humano. No entanto, de forma a melhorar a segurança, abordagens mais avançadas sugerem que o veículo não deve depender de uma única classe de sensores. O LiDAR foi proposto como sendo um sensor adicional, particularmente devido à sua capacidade de visão 3D. Visão 3D depende dos dados capturados pelo LiDAR para reconhecer objetos em 3D. No entanto, em contraste com a identificação de objetos 2D, a identificação de objetos 3D é um campo de estudos relativamente imaturo e ainda possui muitos desafios para ultrapassar. Adicionalmente, LiDARs são sensores dispendiosos, o que também torna a aquisição de dados necessários para o treino de técnicas de reconhecimento de objetos 3D mais cara. Neste contexto, esta tese de Mestrado tem como objetivo principal facilitar a identificação de objetos 3D, baseada em Deep Learning (DL), para veículos autónomos. As contribuições especificas deste trabalho são as seguintes. Primeiro, uma visão global compreensiva do estado de arte relativo _as arquiteturas Deep Learning para identificação de objetos 3D baseadas em point clouds. O propósito desta visão global é para perceber como melhor abordar este tipo de problema no contexto de condução autónoma. Segundo, foi gerado um dataset sintético, mas realista, com dados capturados por um LiDAR no ambiente virtual do GTA V. Foram desenvolvidas ferramentas para converter os dados gerados no formato do dataset do KITTI, que se tornou num standard para avaliação de técnicas de deteção de objetos 3D para condução autónoma. Terceiro, algumas das arquiteturas DL de identificação de objetos 3D revistas foram avaliadas com o dataset gerado. Apesar da sua performance com o dataset gerado ter sido pior que os resultados no dataset original do KITTI, os models chegaram a conseguir processar corretamente os dados sintéticos sem serem retreinados. O benefício futuro deste trabalho consiste nos modelos poderem ser adicionalmente treinados com dados produzidos localmente e testados em cenários variados. O mod do GTA V implementado provou ser capaz de fornecer datasets ricos, bem estruturados e compatíveis com o estado de arte em arquiteturas de identificação de objetos 3D. A ferramenta desenvolvida está disponível publicamente e esperamos que seja útil para o avanço da identificação de objetos 3D para condução autónoma, já que remove a dependência de datasets fornecidos por terceiros.2021-04-20T14:33:13Z2021-01-25T00:00:00Z2021-01-25info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10773/31213engGuedes, Diogo Alexandre Amaral Condeinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-02-22T12:00:16Zoai:ria.ua.pt:10773/31213Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:03:09.248449Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Object identification for autonomous vehicles based on machine learning
title	Object identification for autonomous vehicles based on machine learning
spellingShingle	Object identification for autonomous vehicles based on machine learning Guedes, Diogo Alexandre Amaral Conde Autonomous driving LiDAR Simulator Point cloud Deep learning Object identification
title_short	Object identification for autonomous vehicles based on machine learning
title_full	Object identification for autonomous vehicles based on machine learning
title_fullStr	Object identification for autonomous vehicles based on machine learning
title_full_unstemmed	Object identification for autonomous vehicles based on machine learning
title_sort	Object identification for autonomous vehicles based on machine learning
author	Guedes, Diogo Alexandre Amaral Conde
author_facet	Guedes, Diogo Alexandre Amaral Conde
author_role	author
dc.contributor.author.fl_str_mv	Guedes, Diogo Alexandre Amaral Conde
dc.subject.por.fl_str_mv	Autonomous driving LiDAR Simulator Point cloud Deep learning Object identification
topic	Autonomous driving LiDAR Simulator Point cloud Deep learning Object identification
description	Autonomous driving is one of the most actively researched fields in artificial intelligence. The autonomous vehicles are expected to significantly reduce the road accidents and casualties one day when they become sufficiently mature transport option. Currently much effort is focused to prove the concept of autonomous vehicles that is based on a suit of sensors to observe their surroundings. In particular, camera and LiDAR are researched as an efficient combination of sensors for on-line object identification on the road. 2D object identification is an already established field in Computer Vision. The successful application of Deep Learning techniques has led to 2D vision with Human-level accuracy. However, for a matter of improved safety more advanced approaches suggest that the vehicle should not rely on a single class of sensors. LiDAR has been proposed as an additional sensor, particularly due to its 3D vision capability. 3D vision relies on LiDAR captured data to recognize objects in 3D. However, in contrast to the 2D object identifi- cation, 3D object detection is a relatively immature field and still has many challenges to overcome. In addition, LiDARs are expensive sensors, which makes the acquisition of data required for training 3D object recognition techniques expensive tasks as well. In this context, this Master's thesis has the major goal to further facilitate the 3D object identification for autonomous vehicles based on Deep Learning (DL). The specific contributions of the present work are the following. First, a comprehensive overview of the state of the art Deep Learning architectures for 3D object identification based on Point Clouds. The purpose of this overview is to understand how to better approach such a problem in the context of autonomous driving. Second, synthetic but realistic Lidar captured data was generated in the GTA V virtual environment. Tools were developed to convert the generated data into the KITTI dataset format, which has become standard in 3D object detection techniques for autonomous driving. Third, some of the overviewed 3D object identification DL architectures were evaluated with the generated data. Though their performance with the generated data was worse than with the original KITTI data, the models were still able to correctly process the synthetic data without being retrained. The future benefit of this work is that the models can be further trained with home-made data and varying testing scenarios. The implemented GTA V mod has proved to be capable of providing rich, well-structured and compatible datasets with the state of the art 3D object identification architectures. The developed tool is publicly available and we hope it will be useful in advancing 3D object identification for autonomous driving, as it removes the dependency from datasets provided by a third party.
publishDate	2021
dc.date.none.fl_str_mv	2021-04-20T14:33:13Z 2021-01-25T00:00:00Z 2021-01-25
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10773/31213
url	http://hdl.handle.net/10773/31213
dc.language.iso.fl_str_mv	eng
language	eng
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799137686941859840

Object identification for autonomous vehicles based on machine learning

Registros relacionados