Computer Sound Transformations Guided by Perceptually Motivated Features

Detalhes bibliográficos
Autor(a) principal: Nuno Figueiredo Pires
Data de Publicação: 2018
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://hdl.handle.net/10216/111170
Resumo: In a time where technology is part of our everyday life, sound and music is coming to us, more and more, in digital format. But this vast amount of digital audio available demands a deeper understanding of audio signals, in particular how algorithms are formulated so that the information of the audio data can be extracted automatically. A big challenge when developing an audio information retrieval system is the identification of appropriate content-based features to represent the audio signal. The most common approach to represent the audio in such system is using audio descriptors, which measures properties of audio signal content and wrap audio features to sets of values. This descriptors can be divided in three levels: low, medium and high. This levels will be detailed later. This dissertation starts by a study of the state-of-the-art on Music Information Retrieval and systems used to get relevant information from audio files. Within this scope, I surveyed two models found in the literature to describe mathematically the warmth of musical audio. After revising the latter sound warmth metric of the studies I aim to provide a better mathematical model to the descriptor finding the constant that better define a linear correlation between user judgments of sound warmth and the two low-level sound descriptors proposed in the studies found. In sum, this dissertation proposes a (one-knob) audio effect which allows users to transform the warmth of a sound in real-time.
id RCAP_acc095717d242efadc156b171a963693
oai_identifier_str oai:repositorio-aberto.up.pt:10216/111170
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Computer Sound Transformations Guided by Perceptually Motivated FeaturesEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringIn a time where technology is part of our everyday life, sound and music is coming to us, more and more, in digital format. But this vast amount of digital audio available demands a deeper understanding of audio signals, in particular how algorithms are formulated so that the information of the audio data can be extracted automatically. A big challenge when developing an audio information retrieval system is the identification of appropriate content-based features to represent the audio signal. The most common approach to represent the audio in such system is using audio descriptors, which measures properties of audio signal content and wrap audio features to sets of values. This descriptors can be divided in three levels: low, medium and high. This levels will be detailed later. This dissertation starts by a study of the state-of-the-art on Music Information Retrieval and systems used to get relevant information from audio files. Within this scope, I surveyed two models found in the literature to describe mathematically the warmth of musical audio. After revising the latter sound warmth metric of the studies I aim to provide a better mathematical model to the descriptor finding the constant that better define a linear correlation between user judgments of sound warmth and the two low-level sound descriptors proposed in the studies found. In sum, this dissertation proposes a (one-knob) audio effect which allows users to transform the warmth of a sound in real-time.2018-02-052018-02-05T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/111170TID:202117944engNuno Figueiredo Piresinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T15:21:00Zoai:repositorio-aberto.up.pt:10216/111170Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:21:21.731223Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Computer Sound Transformations Guided by Perceptually Motivated Features
title Computer Sound Transformations Guided by Perceptually Motivated Features
spellingShingle Computer Sound Transformations Guided by Perceptually Motivated Features
Nuno Figueiredo Pires
Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
title_short Computer Sound Transformations Guided by Perceptually Motivated Features
title_full Computer Sound Transformations Guided by Perceptually Motivated Features
title_fullStr Computer Sound Transformations Guided by Perceptually Motivated Features
title_full_unstemmed Computer Sound Transformations Guided by Perceptually Motivated Features
title_sort Computer Sound Transformations Guided by Perceptually Motivated Features
author Nuno Figueiredo Pires
author_facet Nuno Figueiredo Pires
author_role author
dc.contributor.author.fl_str_mv Nuno Figueiredo Pires
dc.subject.por.fl_str_mv Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
topic Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
description In a time where technology is part of our everyday life, sound and music is coming to us, more and more, in digital format. But this vast amount of digital audio available demands a deeper understanding of audio signals, in particular how algorithms are formulated so that the information of the audio data can be extracted automatically. A big challenge when developing an audio information retrieval system is the identification of appropriate content-based features to represent the audio signal. The most common approach to represent the audio in such system is using audio descriptors, which measures properties of audio signal content and wrap audio features to sets of values. This descriptors can be divided in three levels: low, medium and high. This levels will be detailed later. This dissertation starts by a study of the state-of-the-art on Music Information Retrieval and systems used to get relevant information from audio files. Within this scope, I surveyed two models found in the literature to describe mathematically the warmth of musical audio. After revising the latter sound warmth metric of the studies I aim to provide a better mathematical model to the descriptor finding the constant that better define a linear correlation between user judgments of sound warmth and the two low-level sound descriptors proposed in the studies found. In sum, this dissertation proposes a (one-knob) audio effect which allows users to transform the warmth of a sound in real-time.
publishDate 2018
dc.date.none.fl_str_mv 2018-02-05
2018-02-05T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10216/111170
TID:202117944
url https://hdl.handle.net/10216/111170
identifier_str_mv TID:202117944
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799136129810694144