From a Visual Scene to a Virtual Representation: A Cross-Domain Review

Detalhes bibliográficos
Autor(a) principal: Pereira, Américo
Data de Publicação: 2023
Outros Autores: Carvalho, Pedro, Pereira, Nuno, Viana, Paula, Côrte-Real, Luís
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.22/24734
Resumo: The widespread use of smartphones and other low-cost equipment as recording devices, the massive growth in bandwidth, and the ever-growing demand for new applications with enhanced capabilities, made visual data a must in several scenarios, including surveillance, sports, retail, entertainment, and intelligent vehicles. Despite significant advances in analyzing and extracting data from images and video, there is a lack of solutions able to analyze and semantically describe the information in the visual scene so that it can be efficiently used and repurposed. Scientific contributions have focused on individual aspects or addressing specific problems and application areas, and no cross-domain solution is available to implement a complete system that enables information passing between cross-cutting algorithms. This paper analyses the problem from an end-to-end perspective, i.e., from the visual scene analysis to the representation of information in a virtual environment, including how the extracted data can be described and stored. A simple processing pipeline is introduced to set up a structure for discussing challenges and opportunities in different steps of the entire process, allowing to identify current gaps in the literature. The work reviews various technologies specifically from the perspective of their applicability to an endto- end pipeline for scene analysis and synthesis, along with an extensive analysis of datasets for relevant tasks.
id RCAP_662d535d0c3cbe8859865e4e92f7e2a7
oai_identifier_str oai:recipp.ipp.pt:10400.22/24734
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling From a Visual Scene to a Virtual Representation: A Cross-Domain ReviewComputer vision; datasets; scene analysis; scene reconstruction; visual scene understandingThe widespread use of smartphones and other low-cost equipment as recording devices, the massive growth in bandwidth, and the ever-growing demand for new applications with enhanced capabilities, made visual data a must in several scenarios, including surveillance, sports, retail, entertainment, and intelligent vehicles. Despite significant advances in analyzing and extracting data from images and video, there is a lack of solutions able to analyze and semantically describe the information in the visual scene so that it can be efficiently used and repurposed. Scientific contributions have focused on individual aspects or addressing specific problems and application areas, and no cross-domain solution is available to implement a complete system that enables information passing between cross-cutting algorithms. This paper analyses the problem from an end-to-end perspective, i.e., from the visual scene analysis to the representation of information in a virtual environment, including how the extracted data can be described and stored. A simple processing pipeline is introduced to set up a structure for discussing challenges and opportunities in different steps of the entire process, allowing to identify current gaps in the literature. The work reviews various technologies specifically from the perspective of their applicability to an endto- end pipeline for scene analysis and synthesis, along with an extensive analysis of datasets for relevant tasks.IEEERepositório Científico do Instituto Politécnico do PortoPereira, AméricoCarvalho, PedroPereira, NunoViana, PaulaCôrte-Real, Luís2024-01-29T08:25:41Z2023-06-142023-06-14T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/24734engA. Pereira, P. Carvalho, N. Pereira, P. Viana and L. Côrte-Real, "From a Visual Scene to a Virtual Representation: A Cross-Domain Review," in IEEE Access, vol. 11, pp. 57916-57933, 2023, doi: 10.1109/ACCESS.2023.3283495.10.1109/ACCESS.2023.3283495info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-31T01:50:46Zoai:recipp.ipp.pt:10400.22/24734Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:59:06.769325Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv From a Visual Scene to a Virtual Representation: A Cross-Domain Review
title From a Visual Scene to a Virtual Representation: A Cross-Domain Review
spellingShingle From a Visual Scene to a Virtual Representation: A Cross-Domain Review
Pereira, Américo
Computer vision; datasets; scene analysis; scene reconstruction; visual scene understanding
title_short From a Visual Scene to a Virtual Representation: A Cross-Domain Review
title_full From a Visual Scene to a Virtual Representation: A Cross-Domain Review
title_fullStr From a Visual Scene to a Virtual Representation: A Cross-Domain Review
title_full_unstemmed From a Visual Scene to a Virtual Representation: A Cross-Domain Review
title_sort From a Visual Scene to a Virtual Representation: A Cross-Domain Review
author Pereira, Américo
author_facet Pereira, Américo
Carvalho, Pedro
Pereira, Nuno
Viana, Paula
Côrte-Real, Luís
author_role author
author2 Carvalho, Pedro
Pereira, Nuno
Viana, Paula
Côrte-Real, Luís
author2_role author
author
author
author
dc.contributor.none.fl_str_mv Repositório Científico do Instituto Politécnico do Porto
dc.contributor.author.fl_str_mv Pereira, Américo
Carvalho, Pedro
Pereira, Nuno
Viana, Paula
Côrte-Real, Luís
dc.subject.por.fl_str_mv Computer vision; datasets; scene analysis; scene reconstruction; visual scene understanding
topic Computer vision; datasets; scene analysis; scene reconstruction; visual scene understanding
description The widespread use of smartphones and other low-cost equipment as recording devices, the massive growth in bandwidth, and the ever-growing demand for new applications with enhanced capabilities, made visual data a must in several scenarios, including surveillance, sports, retail, entertainment, and intelligent vehicles. Despite significant advances in analyzing and extracting data from images and video, there is a lack of solutions able to analyze and semantically describe the information in the visual scene so that it can be efficiently used and repurposed. Scientific contributions have focused on individual aspects or addressing specific problems and application areas, and no cross-domain solution is available to implement a complete system that enables information passing between cross-cutting algorithms. This paper analyses the problem from an end-to-end perspective, i.e., from the visual scene analysis to the representation of information in a virtual environment, including how the extracted data can be described and stored. A simple processing pipeline is introduced to set up a structure for discussing challenges and opportunities in different steps of the entire process, allowing to identify current gaps in the literature. The work reviews various technologies specifically from the perspective of their applicability to an endto- end pipeline for scene analysis and synthesis, along with an extensive analysis of datasets for relevant tasks.
publishDate 2023
dc.date.none.fl_str_mv 2023-06-14
2023-06-14T00:00:00Z
2024-01-29T08:25:41Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.22/24734
url http://hdl.handle.net/10400.22/24734
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv A. Pereira, P. Carvalho, N. Pereira, P. Viana and L. Côrte-Real, "From a Visual Scene to a Virtual Representation: A Cross-Domain Review," in IEEE Access, vol. 11, pp. 57916-57933, 2023, doi: 10.1109/ACCESS.2023.3283495.
10.1109/ACCESS.2023.3283495
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv IEEE
publisher.none.fl_str_mv IEEE
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799137074882805760