Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications

Costa, Tiago S.; Viana, Paula; Andrade, Maria Teresa

Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications

Detalhes bibliográficos
Autor(a) principal:	Costa, Tiago S.
Data de Publicação:	2023
Outros Autores:	Viana, Paula, Andrade, Maria Teresa
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10400.22/24731
Resumo:	Quality of Experience (QoE) in multi-view streaming systems is known to be severely affected by the latency associated with view-switching procedures. Anticipating the navigation intentions of the viewer on the multi-view scene could provide the means to greatly reduce such latency. The research work presented in this article builds on this premise by proposing a new predictive view-selection mechanism. A VGG16-inspired Convolutional Neural Network (CNN) is used to identify the viewer’s focus of attention and determine which views would be most suited to be presented in the brief term, i.e., the near-term viewing intentions. This way, those views can be locally buffered before they are actually needed. To this aim, two datasets were used to evaluate the prediction performance and impact on latency, in particular when compared to the solution implemented in the previous version of our multi-view streaming system. Results obtained with this work translate into a generalized improvement in perceived QoE. A significant reduction in latency during view-switching procedures was effectively achieved. Moreover, results also demonstrated that the prediction of the user’s visual interest was achieved with a high level of accuracy. An experimental platform was also established on which future predictive models can be integrated and compared with previously implemented models.

Metadados do item

id	RCAP_3518ff8927e06f49c8fd8ff684391d21
oai_identifier_str	oai:recipp.ipp.pt:10400.22/24731
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Deep Learning Approach for Seamless Navigation in Multi-View Streaming ApplicationsMultimedia; streaming; multi-view; focus-of-attention; deep learningQuality of Experience (QoE) in multi-view streaming systems is known to be severely affected by the latency associated with view-switching procedures. Anticipating the navigation intentions of the viewer on the multi-view scene could provide the means to greatly reduce such latency. The research work presented in this article builds on this premise by proposing a new predictive view-selection mechanism. A VGG16-inspired Convolutional Neural Network (CNN) is used to identify the viewer’s focus of attention and determine which views would be most suited to be presented in the brief term, i.e., the near-term viewing intentions. This way, those views can be locally buffered before they are actually needed. To this aim, two datasets were used to evaluate the prediction performance and impact on latency, in particular when compared to the solution implemented in the previous version of our multi-view streaming system. Results obtained with this work translate into a generalized improvement in perceived QoE. A significant reduction in latency during view-switching procedures was effectively achieved. Moreover, results also demonstrated that the prediction of the user’s visual interest was achieved with a high level of accuracy. An experimental platform was also established on which future predictive models can be integrated and compared with previously implemented models.IEEERepositório Científico do Instituto Politécnico do PortoCosta, Tiago S.Viana, PaulaAndrade, Maria Teresa2024-01-29T08:24:48Z2023-08-312023-08-31T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/24731eng3. T. S. Costa, P. Viana and M. T. Andrade (2023). Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications, in IEEE Access, vol. 11, pp. 93883-93897, 2023, doi: 10.1109/ACCESS.2023.331082210.1109/ACCESS.2023.3310822info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-31T01:50:42Zoai:recipp.ipp.pt:10400.22/24731Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:59:06.638477Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
title	Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
spellingShingle	Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications Costa, Tiago S. Multimedia; streaming; multi-view; focus-of-attention; deep learning
title_short	Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
title_full	Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
title_fullStr	Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
title_full_unstemmed	Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
title_sort	Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
author	Costa, Tiago S.
author_facet	Costa, Tiago S. Viana, Paula Andrade, Maria Teresa
author_role	author
author2	Viana, Paula Andrade, Maria Teresa
author2_role	author author
dc.contributor.none.fl_str_mv	Repositório Científico do Instituto Politécnico do Porto
dc.contributor.author.fl_str_mv	Costa, Tiago S. Viana, Paula Andrade, Maria Teresa
dc.subject.por.fl_str_mv	Multimedia; streaming; multi-view; focus-of-attention; deep learning
topic	Multimedia; streaming; multi-view; focus-of-attention; deep learning
description	Quality of Experience (QoE) in multi-view streaming systems is known to be severely affected by the latency associated with view-switching procedures. Anticipating the navigation intentions of the viewer on the multi-view scene could provide the means to greatly reduce such latency. The research work presented in this article builds on this premise by proposing a new predictive view-selection mechanism. A VGG16-inspired Convolutional Neural Network (CNN) is used to identify the viewer’s focus of attention and determine which views would be most suited to be presented in the brief term, i.e., the near-term viewing intentions. This way, those views can be locally buffered before they are actually needed. To this aim, two datasets were used to evaluate the prediction performance and impact on latency, in particular when compared to the solution implemented in the previous version of our multi-view streaming system. Results obtained with this work translate into a generalized improvement in perceived QoE. A significant reduction in latency during view-switching procedures was effectively achieved. Moreover, results also demonstrated that the prediction of the user’s visual interest was achieved with a high level of accuracy. An experimental platform was also established on which future predictive models can be integrated and compared with previously implemented models.
publishDate	2023
dc.date.none.fl_str_mv	2023-08-31 2023-08-31T00:00:00Z 2024-01-29T08:24:48Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10400.22/24731
url	http://hdl.handle.net/10400.22/24731
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	3. T. S. Costa, P. Viana and M. T. Andrade (2023). Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications, in IEEE Access, vol. 11, pp. 93883-93897, 2023, doi: 10.1109/ACCESS.2023.3310822 10.1109/ACCESS.2023.3310822
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	IEEE
publisher.none.fl_str_mv	IEEE
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1817554215265894400

Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications

Registros relacionados