Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications
Autor(a) principal: | |
---|---|
Data de Publicação: | 2023 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.22/24731 |
Resumo: | Quality of Experience (QoE) in multi-view streaming systems is known to be severely affected by the latency associated with view-switching procedures. Anticipating the navigation intentions of the viewer on the multi-view scene could provide the means to greatly reduce such latency. The research work presented in this article builds on this premise by proposing a new predictive view-selection mechanism. A VGG16-inspired Convolutional Neural Network (CNN) is used to identify the viewer’s focus of attention and determine which views would be most suited to be presented in the brief term, i.e., the near-term viewing intentions. This way, those views can be locally buffered before they are actually needed. To this aim, two datasets were used to evaluate the prediction performance and impact on latency, in particular when compared to the solution implemented in the previous version of our multi-view streaming system. Results obtained with this work translate into a generalized improvement in perceived QoE. A significant reduction in latency during view-switching procedures was effectively achieved. Moreover, results also demonstrated that the prediction of the user’s visual interest was achieved with a high level of accuracy. An experimental platform was also established on which future predictive models can be integrated and compared with previously implemented models. |
id |
RCAP_3518ff8927e06f49c8fd8ff684391d21 |
---|---|
oai_identifier_str |
oai:recipp.ipp.pt:10400.22/24731 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Deep Learning Approach for Seamless Navigation in Multi-View Streaming ApplicationsMultimedia; streaming; multi-view; focus-of-attention; deep learningQuality of Experience (QoE) in multi-view streaming systems is known to be severely affected by the latency associated with view-switching procedures. Anticipating the navigation intentions of the viewer on the multi-view scene could provide the means to greatly reduce such latency. The research work presented in this article builds on this premise by proposing a new predictive view-selection mechanism. A VGG16-inspired Convolutional Neural Network (CNN) is used to identify the viewer’s focus of attention and determine which views would be most suited to be presented in the brief term, i.e., the near-term viewing intentions. This way, those views can be locally buffered before they are actually needed. To this aim, two datasets were used to evaluate the prediction performance and impact on latency, in particular when compared to the solution implemented in the previous version of our multi-view streaming system. Results obtained with this work translate into a generalized improvement in perceived QoE. A significant reduction in latency during view-switching procedures was effectively achieved. Moreover, results also demonstrated that the prediction of the user’s visual interest was achieved with a high level of accuracy. An experimental platform was also established on which future predictive models can be integrated and compared with previously implemented models.IEEERepositório Científico do Instituto Politécnico do PortoCosta, Tiago S.Viana, PaulaAndrade, Maria Teresa2024-01-29T08:24:48Z2023-08-312023-08-31T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/24731eng3. T. S. Costa, P. Viana and M. T. Andrade (2023). Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications, in IEEE Access, vol. 11, pp. 93883-93897, 2023, doi: 10.1109/ACCESS.2023.331082210.1109/ACCESS.2023.3310822info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-31T01:50:42Zoai:recipp.ipp.pt:10400.22/24731Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:59:06.638477Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications |
title |
Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications |
spellingShingle |
Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications Costa, Tiago S. Multimedia; streaming; multi-view; focus-of-attention; deep learning |
title_short |
Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications |
title_full |
Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications |
title_fullStr |
Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications |
title_full_unstemmed |
Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications |
title_sort |
Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications |
author |
Costa, Tiago S. |
author_facet |
Costa, Tiago S. Viana, Paula Andrade, Maria Teresa |
author_role |
author |
author2 |
Viana, Paula Andrade, Maria Teresa |
author2_role |
author author |
dc.contributor.none.fl_str_mv |
Repositório Científico do Instituto Politécnico do Porto |
dc.contributor.author.fl_str_mv |
Costa, Tiago S. Viana, Paula Andrade, Maria Teresa |
dc.subject.por.fl_str_mv |
Multimedia; streaming; multi-view; focus-of-attention; deep learning |
topic |
Multimedia; streaming; multi-view; focus-of-attention; deep learning |
description |
Quality of Experience (QoE) in multi-view streaming systems is known to be severely affected by the latency associated with view-switching procedures. Anticipating the navigation intentions of the viewer on the multi-view scene could provide the means to greatly reduce such latency. The research work presented in this article builds on this premise by proposing a new predictive view-selection mechanism. A VGG16-inspired Convolutional Neural Network (CNN) is used to identify the viewer’s focus of attention and determine which views would be most suited to be presented in the brief term, i.e., the near-term viewing intentions. This way, those views can be locally buffered before they are actually needed. To this aim, two datasets were used to evaluate the prediction performance and impact on latency, in particular when compared to the solution implemented in the previous version of our multi-view streaming system. Results obtained with this work translate into a generalized improvement in perceived QoE. A significant reduction in latency during view-switching procedures was effectively achieved. Moreover, results also demonstrated that the prediction of the user’s visual interest was achieved with a high level of accuracy. An experimental platform was also established on which future predictive models can be integrated and compared with previously implemented models. |
publishDate |
2023 |
dc.date.none.fl_str_mv |
2023-08-31 2023-08-31T00:00:00Z 2024-01-29T08:24:48Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.22/24731 |
url |
http://hdl.handle.net/10400.22/24731 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
3. T. S. Costa, P. Viana and M. T. Andrade (2023). Deep Learning Approach for Seamless Navigation in Multi-View Streaming Applications, in IEEE Access, vol. 11, pp. 93883-93897, 2023, doi: 10.1109/ACCESS.2023.3310822 10.1109/ACCESS.2023.3310822 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
IEEE |
publisher.none.fl_str_mv |
IEEE |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799137074877562880 |