Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional

Rafael Veiga Carvalho

Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional

Detalhes bibliográficos
Autor(a) principal:	Rafael Veiga Carvalho
Data de Publicação:	2021
Tipo de documento:	Dissertação
Idioma:	por
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	https://hdl.handle.net/10216/136030
Resumo:	In recent years, the development of technologies based on artificial intelligence has been evolving at an unprecedented pace. Some of these technologies are computer vision and Deep Learning, whose utilization has been extended to the most varied sectors, such as in industrial applications, in healthcare, safety, marketing and individual use applications. The main focus is on the optimization and automation of processes that would otherwise prove to be quite time consuming or even impossible to execute. With the objective of detecting and counting the number of individuals, as well as quantifying crowds in indoor and public spaces through video surveillance cameras, the present Dissertation, carried out in line with the Safe Cities project, - a partnership between Universidade do Porto and Bosch Security Systems, S.A. - proposes a solution that takes advantage of computer vision technologies and Deep Learning methods. The bibliographic review of the state of the art allowed to identify the main characteristics of object detectors by computer vision, further enhancing the choice of the pre-trained model YOLOv4, which was designed to perform generic object detection. This object detector stands out for its high inference speed, suitable for processing video streams in real time, and for its precision in object detection. In addition, the model is based on the use of a neural network to perform the extraction of various features in images or videos, and then performs the classification and detection of the objects present in these contents. The detections produced by the algorithm were filtered so that only pedestrian detections were returned, as well as their respective numerical count. The developed algorithm also implements an adjustable criterion by the user to determine from which level a crowd will be considered, since this parameter can vary depending on the angle, the installation position of the video surveillance camera and the scenario being monitored. In order to validate the developed algorithm, it was implemented directly in the counting and detection of people in sets of images (i.e., datasets), in video excerpts and by capturing video in real time through video surveillance cameras from Bosch, having a qualitative analysis of the algorithm performance been performed according to the variation of several parameters in the input data, such as brightness, image quality, color scheme, among others.

Metadados do item

id	RCAP_a6408e0fab89ad6e0c66f8da6e37e7f3
oai_identifier_str	oai:repositorio-aberto.up.pt:10216/136030
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão ComputacionalEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringIn recent years, the development of technologies based on artificial intelligence has been evolving at an unprecedented pace. Some of these technologies are computer vision and Deep Learning, whose utilization has been extended to the most varied sectors, such as in industrial applications, in healthcare, safety, marketing and individual use applications. The main focus is on the optimization and automation of processes that would otherwise prove to be quite time consuming or even impossible to execute. With the objective of detecting and counting the number of individuals, as well as quantifying crowds in indoor and public spaces through video surveillance cameras, the present Dissertation, carried out in line with the Safe Cities project, - a partnership between Universidade do Porto and Bosch Security Systems, S.A. - proposes a solution that takes advantage of computer vision technologies and Deep Learning methods. The bibliographic review of the state of the art allowed to identify the main characteristics of object detectors by computer vision, further enhancing the choice of the pre-trained model YOLOv4, which was designed to perform generic object detection. This object detector stands out for its high inference speed, suitable for processing video streams in real time, and for its precision in object detection. In addition, the model is based on the use of a neural network to perform the extraction of various features in images or videos, and then performs the classification and detection of the objects present in these contents. The detections produced by the algorithm were filtered so that only pedestrian detections were returned, as well as their respective numerical count. The developed algorithm also implements an adjustable criterion by the user to determine from which level a crowd will be considered, since this parameter can vary depending on the angle, the installation position of the video surveillance camera and the scenario being monitored. In order to validate the developed algorithm, it was implemented directly in the counting and detection of people in sets of images (i.e., datasets), in video excerpts and by capturing video in real time through video surveillance cameras from Bosch, having a qualitative analysis of the algorithm performance been performed according to the variation of several parameters in the input data, such as brightness, image quality, color scheme, among others.2021-07-192021-07-19T00:00:00Z2024-07-18T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/136030TID:202824810porRafael Veiga Carvalhoinfo:eu-repo/semantics/embargoedAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T14:17:20Zoai:repositorio-aberto.up.pt:10216/136030Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:58:11.178323Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
title	Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
spellingShingle	Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional Rafael Veiga Carvalho Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering
title_short	Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
title_full	Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
title_fullStr	Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
title_full_unstemmed	Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
title_sort	Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
author	Rafael Veiga Carvalho
author_facet	Rafael Veiga Carvalho
author_role	author
dc.contributor.author.fl_str_mv	Rafael Veiga Carvalho
dc.subject.por.fl_str_mv	Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering
topic	Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering
description	In recent years, the development of technologies based on artificial intelligence has been evolving at an unprecedented pace. Some of these technologies are computer vision and Deep Learning, whose utilization has been extended to the most varied sectors, such as in industrial applications, in healthcare, safety, marketing and individual use applications. The main focus is on the optimization and automation of processes that would otherwise prove to be quite time consuming or even impossible to execute. With the objective of detecting and counting the number of individuals, as well as quantifying crowds in indoor and public spaces through video surveillance cameras, the present Dissertation, carried out in line with the Safe Cities project, - a partnership between Universidade do Porto and Bosch Security Systems, S.A. - proposes a solution that takes advantage of computer vision technologies and Deep Learning methods. The bibliographic review of the state of the art allowed to identify the main characteristics of object detectors by computer vision, further enhancing the choice of the pre-trained model YOLOv4, which was designed to perform generic object detection. This object detector stands out for its high inference speed, suitable for processing video streams in real time, and for its precision in object detection. In addition, the model is based on the use of a neural network to perform the extraction of various features in images or videos, and then performs the classification and detection of the objects present in these contents. The detections produced by the algorithm were filtered so that only pedestrian detections were returned, as well as their respective numerical count. The developed algorithm also implements an adjustable criterion by the user to determine from which level a crowd will be considered, since this parameter can vary depending on the angle, the installation position of the video surveillance camera and the scenario being monitored. In order to validate the developed algorithm, it was implemented directly in the counting and detection of people in sets of images (i.e., datasets), in video excerpts and by capturing video in real time through video surveillance cameras from Bosch, having a qualitative analysis of the algorithm performance been performed according to the variation of several parameters in the input data, such as brightness, image quality, color scheme, among others.
publishDate	2021
dc.date.none.fl_str_mv	2021-07-19 2021-07-19T00:00:00Z 2024-07-18T00:00:00Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	https://hdl.handle.net/10216/136030 TID:202824810
url	https://hdl.handle.net/10216/136030
identifier_str_mv	TID:202824810
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/embargoedAccess
eu_rights_str_mv	embargoedAccess
dc.format.none.fl_str_mv	application/pdf
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799135903629705216

Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional

Registros relacionados