Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional

Detalhes bibliográficos
Autor(a) principal: Rafael Veiga Carvalho
Data de Publicação: 2021
Tipo de documento: Dissertação
Idioma: por
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://hdl.handle.net/10216/136030
Resumo: In recent years, the development of technologies based on artificial intelligence has been evolving at an unprecedented pace. Some of these technologies are computer vision and Deep Learning, whose utilization has been extended to the most varied sectors, such as in industrial applications, in healthcare, safety, marketing and individual use applications. The main focus is on the optimization and automation of processes that would otherwise prove to be quite time consuming or even impossible to execute. With the objective of detecting and counting the number of individuals, as well as quantifying crowds in indoor and public spaces through video surveillance cameras, the present Dissertation, carried out in line with the Safe Cities project, - a partnership between Universidade do Porto and Bosch Security Systems, S.A. - proposes a solution that takes advantage of computer vision technologies and Deep Learning methods. The bibliographic review of the state of the art allowed to identify the main characteristics of object detectors by computer vision, further enhancing the choice of the pre-trained model YOLOv4, which was designed to perform generic object detection. This object detector stands out for its high inference speed, suitable for processing video streams in real time, and for its precision in object detection. In addition, the model is based on the use of a neural network to perform the extraction of various features in images or videos, and then performs the classification and detection of the objects present in these contents. The detections produced by the algorithm were filtered so that only pedestrian detections were returned, as well as their respective numerical count. The developed algorithm also implements an adjustable criterion by the user to determine from which level a crowd will be considered, since this parameter can vary depending on the angle, the installation position of the video surveillance camera and the scenario being monitored. In order to validate the developed algorithm, it was implemented directly in the counting and detection of people in sets of images (i.e., datasets), in video excerpts and by capturing video in real time through video surveillance cameras from Bosch, having a qualitative analysis of the algorithm performance been performed according to the variation of several parameters in the input data, such as brightness, image quality, color scheme, among others.
id RCAP_a6408e0fab89ad6e0c66f8da6e37e7f3
oai_identifier_str oai:repositorio-aberto.up.pt:10216/136030
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão ComputacionalEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringIn recent years, the development of technologies based on artificial intelligence has been evolving at an unprecedented pace. Some of these technologies are computer vision and Deep Learning, whose utilization has been extended to the most varied sectors, such as in industrial applications, in healthcare, safety, marketing and individual use applications. The main focus is on the optimization and automation of processes that would otherwise prove to be quite time consuming or even impossible to execute. With the objective of detecting and counting the number of individuals, as well as quantifying crowds in indoor and public spaces through video surveillance cameras, the present Dissertation, carried out in line with the Safe Cities project, - a partnership between Universidade do Porto and Bosch Security Systems, S.A. - proposes a solution that takes advantage of computer vision technologies and Deep Learning methods. The bibliographic review of the state of the art allowed to identify the main characteristics of object detectors by computer vision, further enhancing the choice of the pre-trained model YOLOv4, which was designed to perform generic object detection. This object detector stands out for its high inference speed, suitable for processing video streams in real time, and for its precision in object detection. In addition, the model is based on the use of a neural network to perform the extraction of various features in images or videos, and then performs the classification and detection of the objects present in these contents. The detections produced by the algorithm were filtered so that only pedestrian detections were returned, as well as their respective numerical count. The developed algorithm also implements an adjustable criterion by the user to determine from which level a crowd will be considered, since this parameter can vary depending on the angle, the installation position of the video surveillance camera and the scenario being monitored. In order to validate the developed algorithm, it was implemented directly in the counting and detection of people in sets of images (i.e., datasets), in video excerpts and by capturing video in real time through video surveillance cameras from Bosch, having a qualitative analysis of the algorithm performance been performed according to the variation of several parameters in the input data, such as brightness, image quality, color scheme, among others.2021-07-192021-07-19T00:00:00Z2024-07-18T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/136030TID:202824810porRafael Veiga Carvalhoinfo:eu-repo/semantics/embargoedAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T14:17:20Zoai:repositorio-aberto.up.pt:10216/136030Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:58:11.178323Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
title Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
spellingShingle Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
Rafael Veiga Carvalho
Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
title_short Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
title_full Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
title_fullStr Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
title_full_unstemmed Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
title_sort Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
author Rafael Veiga Carvalho
author_facet Rafael Veiga Carvalho
author_role author
dc.contributor.author.fl_str_mv Rafael Veiga Carvalho
dc.subject.por.fl_str_mv Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
topic Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
description In recent years, the development of technologies based on artificial intelligence has been evolving at an unprecedented pace. Some of these technologies are computer vision and Deep Learning, whose utilization has been extended to the most varied sectors, such as in industrial applications, in healthcare, safety, marketing and individual use applications. The main focus is on the optimization and automation of processes that would otherwise prove to be quite time consuming or even impossible to execute. With the objective of detecting and counting the number of individuals, as well as quantifying crowds in indoor and public spaces through video surveillance cameras, the present Dissertation, carried out in line with the Safe Cities project, - a partnership between Universidade do Porto and Bosch Security Systems, S.A. - proposes a solution that takes advantage of computer vision technologies and Deep Learning methods. The bibliographic review of the state of the art allowed to identify the main characteristics of object detectors by computer vision, further enhancing the choice of the pre-trained model YOLOv4, which was designed to perform generic object detection. This object detector stands out for its high inference speed, suitable for processing video streams in real time, and for its precision in object detection. In addition, the model is based on the use of a neural network to perform the extraction of various features in images or videos, and then performs the classification and detection of the objects present in these contents. The detections produced by the algorithm were filtered so that only pedestrian detections were returned, as well as their respective numerical count. The developed algorithm also implements an adjustable criterion by the user to determine from which level a crowd will be considered, since this parameter can vary depending on the angle, the installation position of the video surveillance camera and the scenario being monitored. In order to validate the developed algorithm, it was implemented directly in the counting and detection of people in sets of images (i.e., datasets), in video excerpts and by capturing video in real time through video surveillance cameras from Bosch, having a qualitative analysis of the algorithm performance been performed according to the variation of several parameters in the input data, such as brightness, image quality, color scheme, among others.
publishDate 2021
dc.date.none.fl_str_mv 2021-07-19
2021-07-19T00:00:00Z
2024-07-18T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10216/136030
TID:202824810
url https://hdl.handle.net/10216/136030
identifier_str_mv TID:202824810
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/embargoedAccess
eu_rights_str_mv embargoedAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799135903629705216