Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Tipo de documento: | Dissertação |
Idioma: | por |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://hdl.handle.net/10216/136030 |
Resumo: | In recent years, the development of technologies based on artificial intelligence has been evolving at an unprecedented pace. Some of these technologies are computer vision and Deep Learning, whose utilization has been extended to the most varied sectors, such as in industrial applications, in healthcare, safety, marketing and individual use applications. The main focus is on the optimization and automation of processes that would otherwise prove to be quite time consuming or even impossible to execute. With the objective of detecting and counting the number of individuals, as well as quantifying crowds in indoor and public spaces through video surveillance cameras, the present Dissertation, carried out in line with the Safe Cities project, - a partnership between Universidade do Porto and Bosch Security Systems, S.A. - proposes a solution that takes advantage of computer vision technologies and Deep Learning methods. The bibliographic review of the state of the art allowed to identify the main characteristics of object detectors by computer vision, further enhancing the choice of the pre-trained model YOLOv4, which was designed to perform generic object detection. This object detector stands out for its high inference speed, suitable for processing video streams in real time, and for its precision in object detection. In addition, the model is based on the use of a neural network to perform the extraction of various features in images or videos, and then performs the classification and detection of the objects present in these contents. The detections produced by the algorithm were filtered so that only pedestrian detections were returned, as well as their respective numerical count. The developed algorithm also implements an adjustable criterion by the user to determine from which level a crowd will be considered, since this parameter can vary depending on the angle, the installation position of the video surveillance camera and the scenario being monitored. In order to validate the developed algorithm, it was implemented directly in the counting and detection of people in sets of images (i.e., datasets), in video excerpts and by capturing video in real time through video surveillance cameras from Bosch, having a qualitative analysis of the algorithm performance been performed according to the variation of several parameters in the input data, such as brightness, image quality, color scheme, among others. |
id |
RCAP_a6408e0fab89ad6e0c66f8da6e37e7f3 |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/136030 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão ComputacionalEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringIn recent years, the development of technologies based on artificial intelligence has been evolving at an unprecedented pace. Some of these technologies are computer vision and Deep Learning, whose utilization has been extended to the most varied sectors, such as in industrial applications, in healthcare, safety, marketing and individual use applications. The main focus is on the optimization and automation of processes that would otherwise prove to be quite time consuming or even impossible to execute. With the objective of detecting and counting the number of individuals, as well as quantifying crowds in indoor and public spaces through video surveillance cameras, the present Dissertation, carried out in line with the Safe Cities project, - a partnership between Universidade do Porto and Bosch Security Systems, S.A. - proposes a solution that takes advantage of computer vision technologies and Deep Learning methods. The bibliographic review of the state of the art allowed to identify the main characteristics of object detectors by computer vision, further enhancing the choice of the pre-trained model YOLOv4, which was designed to perform generic object detection. This object detector stands out for its high inference speed, suitable for processing video streams in real time, and for its precision in object detection. In addition, the model is based on the use of a neural network to perform the extraction of various features in images or videos, and then performs the classification and detection of the objects present in these contents. The detections produced by the algorithm were filtered so that only pedestrian detections were returned, as well as their respective numerical count. The developed algorithm also implements an adjustable criterion by the user to determine from which level a crowd will be considered, since this parameter can vary depending on the angle, the installation position of the video surveillance camera and the scenario being monitored. In order to validate the developed algorithm, it was implemented directly in the counting and detection of people in sets of images (i.e., datasets), in video excerpts and by capturing video in real time through video surveillance cameras from Bosch, having a qualitative analysis of the algorithm performance been performed according to the variation of several parameters in the input data, such as brightness, image quality, color scheme, among others.2021-07-192021-07-19T00:00:00Z2024-07-18T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/136030TID:202824810porRafael Veiga Carvalhoinfo:eu-repo/semantics/embargoedAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T14:17:20Zoai:repositorio-aberto.up.pt:10216/136030Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:58:11.178323Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional |
title |
Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional |
spellingShingle |
Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional Rafael Veiga Carvalho Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
title_short |
Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional |
title_full |
Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional |
title_fullStr |
Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional |
title_full_unstemmed |
Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional |
title_sort |
Contagem de Pedestres e Deteção de Multidões Numa Cena Através de Aprendizagem Profunda e Visão Computacional |
author |
Rafael Veiga Carvalho |
author_facet |
Rafael Veiga Carvalho |
author_role |
author |
dc.contributor.author.fl_str_mv |
Rafael Veiga Carvalho |
dc.subject.por.fl_str_mv |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
topic |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
description |
In recent years, the development of technologies based on artificial intelligence has been evolving at an unprecedented pace. Some of these technologies are computer vision and Deep Learning, whose utilization has been extended to the most varied sectors, such as in industrial applications, in healthcare, safety, marketing and individual use applications. The main focus is on the optimization and automation of processes that would otherwise prove to be quite time consuming or even impossible to execute. With the objective of detecting and counting the number of individuals, as well as quantifying crowds in indoor and public spaces through video surveillance cameras, the present Dissertation, carried out in line with the Safe Cities project, - a partnership between Universidade do Porto and Bosch Security Systems, S.A. - proposes a solution that takes advantage of computer vision technologies and Deep Learning methods. The bibliographic review of the state of the art allowed to identify the main characteristics of object detectors by computer vision, further enhancing the choice of the pre-trained model YOLOv4, which was designed to perform generic object detection. This object detector stands out for its high inference speed, suitable for processing video streams in real time, and for its precision in object detection. In addition, the model is based on the use of a neural network to perform the extraction of various features in images or videos, and then performs the classification and detection of the objects present in these contents. The detections produced by the algorithm were filtered so that only pedestrian detections were returned, as well as their respective numerical count. The developed algorithm also implements an adjustable criterion by the user to determine from which level a crowd will be considered, since this parameter can vary depending on the angle, the installation position of the video surveillance camera and the scenario being monitored. In order to validate the developed algorithm, it was implemented directly in the counting and detection of people in sets of images (i.e., datasets), in video excerpts and by capturing video in real time through video surveillance cameras from Bosch, having a qualitative analysis of the algorithm performance been performed according to the variation of several parameters in the input data, such as brightness, image quality, color scheme, among others. |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021-07-19 2021-07-19T00:00:00Z 2024-07-18T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/136030 TID:202824810 |
url |
https://hdl.handle.net/10216/136030 |
identifier_str_mv |
TID:202824810 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/embargoedAccess |
eu_rights_str_mv |
embargoedAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799135903629705216 |