Predição de intensidade sonora percebida (loudness ) para áudio espacial

Leandro da Silva Pires

Predição de intensidade sonora percebida (loudness ) para áudio espacial

Detalhes bibliográficos
Autor(a) principal:	Leandro da Silva Pires
Data de Publicação:	2019
Tipo de documento:	Tese
Idioma:	por
Título da fonte:	Repositório Institucional da UFMG
Texto Completo:	http://hdl.handle.net/1843/30272
Resumo:	Loudness control for brodcasting is a common and legally required practice since the International Telecommunication Union (ITU) Recommendation ITUR BS.1770 for objective measurements in multichannel audio. Recommendations and regulations based on the ITU-R algorithm have been published worldwide, including Brazil. There is scope for improving national regulations in light of recent contributions to the field, and also for adapting the ITU-R model to measurements in advanced audio systems. This work pursues these two goals by testing the parameters of the Brazilian standard with a real-time loudness controller using short-form descriptors and by developing a new objective measurement model adapted to the new spatial audio formats. The proposed method performed well compared to other loudness models, although it was purely signal processing based and its readings were not very close to subject responses. The potential benefits of a more perceptually motivated model led to a PhD placement in the Institute of Sound Recording at the University of Surrey (UK), where listening tests were conducted to assess positional parameters of distance, azimuth and elevation, whose results served as a basis for deriving gain correction curves and a new directional weighting for the ITU-R model. General results point to advancements in the regulatory and standardization fronts, either by the elaboration of a strategy to improve the Brazilian standard of loudness, or by comparing this new prediction method with the critical fortune of loudness models through measurements on audio content for multichannel reproduction systems. The developed model resulted in the best trade-off between prediction errors (RMSE*), correlation between predictions and subject responses, and mean run time.

Metadados do item

id	UFMG_961dda71c6b1d880a4d754991da7d68e
oai_identifier_str	oai:repositorio.ufmg.br:1843/30272
network_acronym_str	UFMG
network_name_str	Repositório Institucional da UFMG
repository_id_str
spelling	Predição de intensidade sonora percebida (loudness ) para áudio espacialLoudnessRadiodifusãoAuralizaçãoÁudio EspacialTestes subjetivosProcessamento de sinaisEngenharia elétricaProcessamento de sinaisRadiodifusãoTelecomunicaçõesLoudness control for brodcasting is a common and legally required practice since the International Telecommunication Union (ITU) Recommendation ITUR BS.1770 for objective measurements in multichannel audio. Recommendations and regulations based on the ITU-R algorithm have been published worldwide, including Brazil. There is scope for improving national regulations in light of recent contributions to the field, and also for adapting the ITU-R model to measurements in advanced audio systems. This work pursues these two goals by testing the parameters of the Brazilian standard with a real-time loudness controller using short-form descriptors and by developing a new objective measurement model adapted to the new spatial audio formats. The proposed method performed well compared to other loudness models, although it was purely signal processing based and its readings were not very close to subject responses. The potential benefits of a more perceptually motivated model led to a PhD placement in the Institute of Sound Recording at the University of Surrey (UK), where listening tests were conducted to assess positional parameters of distance, azimuth and elevation, whose results served as a basis for deriving gain correction curves and a new directional weighting for the ITU-R model. General results point to advancements in the regulatory and standardization fronts, either by the elaboration of a strategy to improve the Brazilian standard of loudness, or by comparing this new prediction method with the critical fortune of loudness models through measurements on audio content for multichannel reproduction systems. The developed model resulted in the best trade-off between prediction errors (RMSE), correlation between predictions and subject responses, and mean run time.O controle da intensidade percebida de áudio (loudness) na radiodifusão é prática comum e legalmente exigida desde a publicação da Recomendação ITUR BS.1770, da União Internacional de Telecomunicações (ITU), para medição objetiva de loudness em áudio multicanal. Recomendações e regulamentos regionais foram publicados com base no algoritmo ITU-R, inclusive no Brasil. Isto posto, há oportunidades tanto de melhoria da regulamentação nacional à luz das contribuições mais recentes na área, quanto de aprimoramento do modelo ITU-R para medidas em sistemas avançados de áudio espacial. Este trabalho persegue estes dois objetivos ao testar os parâmetros da norma nacional com um controlador de intensidade percebida em tempo real usando descritores de loudness voltados para conteúdo de curta duração, além de procurar contribuir com as discussões sobre o tema no âmbito do ITU-R com o desenvolvimento de um modelo de medição objetiva adaptado aos novos formatos de áudio espacial. Este teve um desempenho satisfatório em comparação com outros modelos, embora fosse puramente uma solução de processamento de sinais e suas leituras não se assemelhassem tanto aos resultados subjetivos. Buscando benefícios potenciais de um modelo mais orientado à percepção, realizou-se testes de escuta para avaliação dos parâmetros posicionais de distância, azimute e elevação, cujos resultados serviram de base para a obtenção de curvas de correção de ganho e nova ponderação direcional para o modelo ITU-R. Os resultados gerais apontam para avanços tanto na frente regulatória quanto na de padronização, seja pela elaboração de uma estratégia de melhorias propostas para a norma brasileira de intensidade percebida, seja pela comparação deste novo algoritmo de predição com a fortuna crítica de modelos de loudness por meio de medições realizadas em conteúdo para sistemas de reprodução de áudio espacial multicanal. O modelo desenvolvido obteve a melhor relação de compromisso entre erros de predição (RMSE), correlação das estimações com os resultados dos testes subjetivos, e tempo médio de execução.Universidade Federal de Minas GeraisBrasilENG - DEPARTAMENTO DE ENGENHARIA ELÉTRICAPrograma de Pós-Graduação em Engenharia ElétricaUFMGMaurílio Nunes Vieirahttp://lattes.cnpq.br/1636687509748198Hani Camille YehiaMaurílio Nunes VieiraHani Camille YehiaAdriano Vilela BarbosaWallace do Couto BoaventuraLuiz Wagner Pereira BiscainhoLeandro da Silva Pires2019-10-10T17:42:40Z2019-10-10T17:42:40Z2019-06-27info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisapplication/pdfhttp://hdl.handle.net/1843/30272porinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMG2019-11-14T15:46:00Zoai:repositorio.ufmg.br:1843/30272Repositório InstitucionalPUBhttps://repositorio.ufmg.br/oairepositorio@ufmg.bropendoar:2019-11-14T15:46Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)false
dc.title.none.fl_str_mv	Predição de intensidade sonora percebida (loudness ) para áudio espacial
title	Predição de intensidade sonora percebida (loudness ) para áudio espacial
spellingShingle	Predição de intensidade sonora percebida (loudness ) para áudio espacial Leandro da Silva Pires Loudness Radiodifusão Auralização Áudio Espacial Testes subjetivos Processamento de sinais Engenharia elétrica Processamento de sinais Radiodifusão Telecomunicações
title_short	Predição de intensidade sonora percebida (loudness ) para áudio espacial
title_full	Predição de intensidade sonora percebida (loudness ) para áudio espacial
title_fullStr	Predição de intensidade sonora percebida (loudness ) para áudio espacial
title_full_unstemmed	Predição de intensidade sonora percebida (loudness ) para áudio espacial
title_sort	Predição de intensidade sonora percebida (loudness ) para áudio espacial
author	Leandro da Silva Pires
author_facet	Leandro da Silva Pires
author_role	author
dc.contributor.none.fl_str_mv	Maurílio Nunes Vieira http://lattes.cnpq.br/1636687509748198 Hani Camille Yehia Maurílio Nunes Vieira Hani Camille Yehia Adriano Vilela Barbosa Wallace do Couto Boaventura Luiz Wagner Pereira Biscainho
dc.contributor.author.fl_str_mv	Leandro da Silva Pires
dc.subject.por.fl_str_mv	Loudness Radiodifusão Auralização Áudio Espacial Testes subjetivos Processamento de sinais Engenharia elétrica Processamento de sinais Radiodifusão Telecomunicações
topic	Loudness Radiodifusão Auralização Áudio Espacial Testes subjetivos Processamento de sinais Engenharia elétrica Processamento de sinais Radiodifusão Telecomunicações
description	Loudness control for brodcasting is a common and legally required practice since the International Telecommunication Union (ITU) Recommendation ITUR BS.1770 for objective measurements in multichannel audio. Recommendations and regulations based on the ITU-R algorithm have been published worldwide, including Brazil. There is scope for improving national regulations in light of recent contributions to the field, and also for adapting the ITU-R model to measurements in advanced audio systems. This work pursues these two goals by testing the parameters of the Brazilian standard with a real-time loudness controller using short-form descriptors and by developing a new objective measurement model adapted to the new spatial audio formats. The proposed method performed well compared to other loudness models, although it was purely signal processing based and its readings were not very close to subject responses. The potential benefits of a more perceptually motivated model led to a PhD placement in the Institute of Sound Recording at the University of Surrey (UK), where listening tests were conducted to assess positional parameters of distance, azimuth and elevation, whose results served as a basis for deriving gain correction curves and a new directional weighting for the ITU-R model. General results point to advancements in the regulatory and standardization fronts, either by the elaboration of a strategy to improve the Brazilian standard of loudness, or by comparing this new prediction method with the critical fortune of loudness models through measurements on audio content for multichannel reproduction systems. The developed model resulted in the best trade-off between prediction errors (RMSE*), correlation between predictions and subject responses, and mean run time.
publishDate	2019
dc.date.none.fl_str_mv	2019-10-10T17:42:40Z 2019-10-10T17:42:40Z 2019-06-27
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/doctoralThesis
format	doctoralThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/1843/30272
url	http://hdl.handle.net/1843/30272
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Federal de Minas Gerais Brasil ENG - DEPARTAMENTO DE ENGENHARIA ELÉTRICA Programa de Pós-Graduação em Engenharia Elétrica UFMG
publisher.none.fl_str_mv	Universidade Federal de Minas Gerais Brasil ENG - DEPARTAMENTO DE ENGENHARIA ELÉTRICA Programa de Pós-Graduação em Engenharia Elétrica UFMG
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFMG instname:Universidade Federal de Minas Gerais (UFMG) instacron:UFMG
instname_str	Universidade Federal de Minas Gerais (UFMG)
instacron_str	UFMG
institution	UFMG
reponame_str	Repositório Institucional da UFMG
collection	Repositório Institucional da UFMG
repository.name.fl_str_mv	Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv	repositorio@ufmg.br
_version_	1828928750442512384

Predição de intensidade sonora percebida (loudness ) para áudio espacial

Registros relacionados