TARGET TRACKING IN COMPLEX SCENES BASED ON COMPUTER VISION

Detalhes bibliográficos
Autor(a) principal: Shang,Huanan
Data de Publicação: 2022
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Revista brasileira de medicina do esporte (Online)
Texto Completo: http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1517-86922022000500436
Resumo: ABSTRACT Objective: Use the deep learning network model to identify key content in videos. Methodology: After reviewing the literature on computer vision, the feature extraction of the target video from the network using deep learning with the time-series data enhancement method was performed. The preprocessing method for data augmentation and Spatio-temporal feature extraction on the video based on LI3D network was explained. Accuracy rate, precision, and recall were used as indices. Results: The three indicators increased from 0.85, 0.88, and 0.84 to 0.89, 0.90, and 0.88, respectively. This shows that the LI3D network model maintains a high recall rate accompanied by high accuracy after data augmentation. The accuracy and loss function curves of the training phase show that the accuracy of the network is greatly improved compared to I3D. Conclusion: The experiment proves that the LI3D model is more stable and has faster convergence. By comparing the accuracy curve and loss function curve during LI3D, LI3D-LSTM, and LI3D-BiLSTM training, it is found that the LI3D-BiLSTM model converges faster. Level of evidence II; Therapeutic studies - investigation of treatment results.
id SBMEE-1_1746f6b719679fc5a99913f30f20aa05
oai_identifier_str oai:scielo:S1517-86922022000500436
network_acronym_str SBMEE-1
network_name_str Revista brasileira de medicina do esporte (Online)
repository_id_str
spelling TARGET TRACKING IN COMPLEX SCENES BASED ON COMPUTER VISIONComputersComputer Vision SystemsPublic HealthABSTRACT Objective: Use the deep learning network model to identify key content in videos. Methodology: After reviewing the literature on computer vision, the feature extraction of the target video from the network using deep learning with the time-series data enhancement method was performed. The preprocessing method for data augmentation and Spatio-temporal feature extraction on the video based on LI3D network was explained. Accuracy rate, precision, and recall were used as indices. Results: The three indicators increased from 0.85, 0.88, and 0.84 to 0.89, 0.90, and 0.88, respectively. This shows that the LI3D network model maintains a high recall rate accompanied by high accuracy after data augmentation. The accuracy and loss function curves of the training phase show that the accuracy of the network is greatly improved compared to I3D. Conclusion: The experiment proves that the LI3D model is more stable and has faster convergence. By comparing the accuracy curve and loss function curve during LI3D, LI3D-LSTM, and LI3D-BiLSTM training, it is found that the LI3D-BiLSTM model converges faster. Level of evidence II; Therapeutic studies - investigation of treatment results.Sociedade Brasileira de Medicina do Exercício e do Esporte2022-10-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S1517-86922022000500436Revista Brasileira de Medicina do Esporte v.28 n.5 2022reponame:Revista brasileira de medicina do esporte (Online)instname:Sociedade Brasileira de Medicina do Exercício e do Esporte (SBMEE)instacron:SBMEE10.1590/1517-8692202228052021_0532info:eu-repo/semantics/openAccessShang,Huananeng2022-05-16T00:00:00Zoai:scielo:S1517-86922022000500436Revistahttp://www.scielo.br/rbmeONGhttps://old.scielo.br/oai/scielo-oai.php||revista@medicinadoesporte.org.br1806-99401517-8692opendoar:2022-05-16T00:00Revista brasileira de medicina do esporte (Online) - Sociedade Brasileira de Medicina do Exercício e do Esporte (SBMEE)false
dc.title.none.fl_str_mv TARGET TRACKING IN COMPLEX SCENES BASED ON COMPUTER VISION
title TARGET TRACKING IN COMPLEX SCENES BASED ON COMPUTER VISION
spellingShingle TARGET TRACKING IN COMPLEX SCENES BASED ON COMPUTER VISION
Shang,Huanan
Computers
Computer Vision Systems
Public Health
title_short TARGET TRACKING IN COMPLEX SCENES BASED ON COMPUTER VISION
title_full TARGET TRACKING IN COMPLEX SCENES BASED ON COMPUTER VISION
title_fullStr TARGET TRACKING IN COMPLEX SCENES BASED ON COMPUTER VISION
title_full_unstemmed TARGET TRACKING IN COMPLEX SCENES BASED ON COMPUTER VISION
title_sort TARGET TRACKING IN COMPLEX SCENES BASED ON COMPUTER VISION
author Shang,Huanan
author_facet Shang,Huanan
author_role author
dc.contributor.author.fl_str_mv Shang,Huanan
dc.subject.por.fl_str_mv Computers
Computer Vision Systems
Public Health
topic Computers
Computer Vision Systems
Public Health
description ABSTRACT Objective: Use the deep learning network model to identify key content in videos. Methodology: After reviewing the literature on computer vision, the feature extraction of the target video from the network using deep learning with the time-series data enhancement method was performed. The preprocessing method for data augmentation and Spatio-temporal feature extraction on the video based on LI3D network was explained. Accuracy rate, precision, and recall were used as indices. Results: The three indicators increased from 0.85, 0.88, and 0.84 to 0.89, 0.90, and 0.88, respectively. This shows that the LI3D network model maintains a high recall rate accompanied by high accuracy after data augmentation. The accuracy and loss function curves of the training phase show that the accuracy of the network is greatly improved compared to I3D. Conclusion: The experiment proves that the LI3D model is more stable and has faster convergence. By comparing the accuracy curve and loss function curve during LI3D, LI3D-LSTM, and LI3D-BiLSTM training, it is found that the LI3D-BiLSTM model converges faster. Level of evidence II; Therapeutic studies - investigation of treatment results.
publishDate 2022
dc.date.none.fl_str_mv 2022-10-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1517-86922022000500436
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1517-86922022000500436
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1590/1517-8692202228052021_0532
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Sociedade Brasileira de Medicina do Exercício e do Esporte
publisher.none.fl_str_mv Sociedade Brasileira de Medicina do Exercício e do Esporte
dc.source.none.fl_str_mv Revista Brasileira de Medicina do Esporte v.28 n.5 2022
reponame:Revista brasileira de medicina do esporte (Online)
instname:Sociedade Brasileira de Medicina do Exercício e do Esporte (SBMEE)
instacron:SBMEE
instname_str Sociedade Brasileira de Medicina do Exercício e do Esporte (SBMEE)
instacron_str SBMEE
institution SBMEE
reponame_str Revista brasileira de medicina do esporte (Online)
collection Revista brasileira de medicina do esporte (Online)
repository.name.fl_str_mv Revista brasileira de medicina do esporte (Online) - Sociedade Brasileira de Medicina do Exercício e do Esporte (SBMEE)
repository.mail.fl_str_mv ||revista@medicinadoesporte.org.br
_version_ 1752122238929207296