From a Visual Scene to a Virtual Representation: A Cross-Domain Review
Autor(a) principal: | |
---|---|
Data de Publicação: | 2023 |
Outros Autores: | , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.22/24734 |
Resumo: | The widespread use of smartphones and other low-cost equipment as recording devices, the massive growth in bandwidth, and the ever-growing demand for new applications with enhanced capabilities, made visual data a must in several scenarios, including surveillance, sports, retail, entertainment, and intelligent vehicles. Despite significant advances in analyzing and extracting data from images and video, there is a lack of solutions able to analyze and semantically describe the information in the visual scene so that it can be efficiently used and repurposed. Scientific contributions have focused on individual aspects or addressing specific problems and application areas, and no cross-domain solution is available to implement a complete system that enables information passing between cross-cutting algorithms. This paper analyses the problem from an end-to-end perspective, i.e., from the visual scene analysis to the representation of information in a virtual environment, including how the extracted data can be described and stored. A simple processing pipeline is introduced to set up a structure for discussing challenges and opportunities in different steps of the entire process, allowing to identify current gaps in the literature. The work reviews various technologies specifically from the perspective of their applicability to an endto- end pipeline for scene analysis and synthesis, along with an extensive analysis of datasets for relevant tasks. |
id |
RCAP_662d535d0c3cbe8859865e4e92f7e2a7 |
---|---|
oai_identifier_str |
oai:recipp.ipp.pt:10400.22/24734 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
From a Visual Scene to a Virtual Representation: A Cross-Domain ReviewComputer vision; datasets; scene analysis; scene reconstruction; visual scene understandingThe widespread use of smartphones and other low-cost equipment as recording devices, the massive growth in bandwidth, and the ever-growing demand for new applications with enhanced capabilities, made visual data a must in several scenarios, including surveillance, sports, retail, entertainment, and intelligent vehicles. Despite significant advances in analyzing and extracting data from images and video, there is a lack of solutions able to analyze and semantically describe the information in the visual scene so that it can be efficiently used and repurposed. Scientific contributions have focused on individual aspects or addressing specific problems and application areas, and no cross-domain solution is available to implement a complete system that enables information passing between cross-cutting algorithms. This paper analyses the problem from an end-to-end perspective, i.e., from the visual scene analysis to the representation of information in a virtual environment, including how the extracted data can be described and stored. A simple processing pipeline is introduced to set up a structure for discussing challenges and opportunities in different steps of the entire process, allowing to identify current gaps in the literature. The work reviews various technologies specifically from the perspective of their applicability to an endto- end pipeline for scene analysis and synthesis, along with an extensive analysis of datasets for relevant tasks.IEEERepositório Científico do Instituto Politécnico do PortoPereira, AméricoCarvalho, PedroPereira, NunoViana, PaulaCôrte-Real, Luís2024-01-29T08:25:41Z2023-06-142023-06-14T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/24734engA. Pereira, P. Carvalho, N. Pereira, P. Viana and L. Côrte-Real, "From a Visual Scene to a Virtual Representation: A Cross-Domain Review," in IEEE Access, vol. 11, pp. 57916-57933, 2023, doi: 10.1109/ACCESS.2023.3283495.10.1109/ACCESS.2023.3283495info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-31T01:50:46Zoai:recipp.ipp.pt:10400.22/24734Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:59:06.769325Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
From a Visual Scene to a Virtual Representation: A Cross-Domain Review |
title |
From a Visual Scene to a Virtual Representation: A Cross-Domain Review |
spellingShingle |
From a Visual Scene to a Virtual Representation: A Cross-Domain Review Pereira, Américo Computer vision; datasets; scene analysis; scene reconstruction; visual scene understanding |
title_short |
From a Visual Scene to a Virtual Representation: A Cross-Domain Review |
title_full |
From a Visual Scene to a Virtual Representation: A Cross-Domain Review |
title_fullStr |
From a Visual Scene to a Virtual Representation: A Cross-Domain Review |
title_full_unstemmed |
From a Visual Scene to a Virtual Representation: A Cross-Domain Review |
title_sort |
From a Visual Scene to a Virtual Representation: A Cross-Domain Review |
author |
Pereira, Américo |
author_facet |
Pereira, Américo Carvalho, Pedro Pereira, Nuno Viana, Paula Côrte-Real, Luís |
author_role |
author |
author2 |
Carvalho, Pedro Pereira, Nuno Viana, Paula Côrte-Real, Luís |
author2_role |
author author author author |
dc.contributor.none.fl_str_mv |
Repositório Científico do Instituto Politécnico do Porto |
dc.contributor.author.fl_str_mv |
Pereira, Américo Carvalho, Pedro Pereira, Nuno Viana, Paula Côrte-Real, Luís |
dc.subject.por.fl_str_mv |
Computer vision; datasets; scene analysis; scene reconstruction; visual scene understanding |
topic |
Computer vision; datasets; scene analysis; scene reconstruction; visual scene understanding |
description |
The widespread use of smartphones and other low-cost equipment as recording devices, the massive growth in bandwidth, and the ever-growing demand for new applications with enhanced capabilities, made visual data a must in several scenarios, including surveillance, sports, retail, entertainment, and intelligent vehicles. Despite significant advances in analyzing and extracting data from images and video, there is a lack of solutions able to analyze and semantically describe the information in the visual scene so that it can be efficiently used and repurposed. Scientific contributions have focused on individual aspects or addressing specific problems and application areas, and no cross-domain solution is available to implement a complete system that enables information passing between cross-cutting algorithms. This paper analyses the problem from an end-to-end perspective, i.e., from the visual scene analysis to the representation of information in a virtual environment, including how the extracted data can be described and stored. A simple processing pipeline is introduced to set up a structure for discussing challenges and opportunities in different steps of the entire process, allowing to identify current gaps in the literature. The work reviews various technologies specifically from the perspective of their applicability to an endto- end pipeline for scene analysis and synthesis, along with an extensive analysis of datasets for relevant tasks. |
publishDate |
2023 |
dc.date.none.fl_str_mv |
2023-06-14 2023-06-14T00:00:00Z 2024-01-29T08:25:41Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.22/24734 |
url |
http://hdl.handle.net/10400.22/24734 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
A. Pereira, P. Carvalho, N. Pereira, P. Viana and L. Côrte-Real, "From a Visual Scene to a Virtual Representation: A Cross-Domain Review," in IEEE Access, vol. 11, pp. 57916-57933, 2023, doi: 10.1109/ACCESS.2023.3283495. 10.1109/ACCESS.2023.3283495 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
IEEE |
publisher.none.fl_str_mv |
IEEE |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799137074882805760 |