Computer Sound Transformations Guided by Perceptually Motivated Features
Autor(a) principal: | |
---|---|
Data de Publicação: | 2018 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://hdl.handle.net/10216/111170 |
Resumo: | In a time where technology is part of our everyday life, sound and music is coming to us, more and more, in digital format. But this vast amount of digital audio available demands a deeper understanding of audio signals, in particular how algorithms are formulated so that the information of the audio data can be extracted automatically. A big challenge when developing an audio information retrieval system is the identification of appropriate content-based features to represent the audio signal. The most common approach to represent the audio in such system is using audio descriptors, which measures properties of audio signal content and wrap audio features to sets of values. This descriptors can be divided in three levels: low, medium and high. This levels will be detailed later. This dissertation starts by a study of the state-of-the-art on Music Information Retrieval and systems used to get relevant information from audio files. Within this scope, I surveyed two models found in the literature to describe mathematically the warmth of musical audio. After revising the latter sound warmth metric of the studies I aim to provide a better mathematical model to the descriptor finding the constant that better define a linear correlation between user judgments of sound warmth and the two low-level sound descriptors proposed in the studies found. In sum, this dissertation proposes a (one-knob) audio effect which allows users to transform the warmth of a sound in real-time. |
id |
RCAP_acc095717d242efadc156b171a963693 |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/111170 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Computer Sound Transformations Guided by Perceptually Motivated FeaturesEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringIn a time where technology is part of our everyday life, sound and music is coming to us, more and more, in digital format. But this vast amount of digital audio available demands a deeper understanding of audio signals, in particular how algorithms are formulated so that the information of the audio data can be extracted automatically. A big challenge when developing an audio information retrieval system is the identification of appropriate content-based features to represent the audio signal. The most common approach to represent the audio in such system is using audio descriptors, which measures properties of audio signal content and wrap audio features to sets of values. This descriptors can be divided in three levels: low, medium and high. This levels will be detailed later. This dissertation starts by a study of the state-of-the-art on Music Information Retrieval and systems used to get relevant information from audio files. Within this scope, I surveyed two models found in the literature to describe mathematically the warmth of musical audio. After revising the latter sound warmth metric of the studies I aim to provide a better mathematical model to the descriptor finding the constant that better define a linear correlation between user judgments of sound warmth and the two low-level sound descriptors proposed in the studies found. In sum, this dissertation proposes a (one-knob) audio effect which allows users to transform the warmth of a sound in real-time.2018-02-052018-02-05T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/111170TID:202117944engNuno Figueiredo Piresinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T15:21:00Zoai:repositorio-aberto.up.pt:10216/111170Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:21:21.731223Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Computer Sound Transformations Guided by Perceptually Motivated Features |
title |
Computer Sound Transformations Guided by Perceptually Motivated Features |
spellingShingle |
Computer Sound Transformations Guided by Perceptually Motivated Features Nuno Figueiredo Pires Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
title_short |
Computer Sound Transformations Guided by Perceptually Motivated Features |
title_full |
Computer Sound Transformations Guided by Perceptually Motivated Features |
title_fullStr |
Computer Sound Transformations Guided by Perceptually Motivated Features |
title_full_unstemmed |
Computer Sound Transformations Guided by Perceptually Motivated Features |
title_sort |
Computer Sound Transformations Guided by Perceptually Motivated Features |
author |
Nuno Figueiredo Pires |
author_facet |
Nuno Figueiredo Pires |
author_role |
author |
dc.contributor.author.fl_str_mv |
Nuno Figueiredo Pires |
dc.subject.por.fl_str_mv |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
topic |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
description |
In a time where technology is part of our everyday life, sound and music is coming to us, more and more, in digital format. But this vast amount of digital audio available demands a deeper understanding of audio signals, in particular how algorithms are formulated so that the information of the audio data can be extracted automatically. A big challenge when developing an audio information retrieval system is the identification of appropriate content-based features to represent the audio signal. The most common approach to represent the audio in such system is using audio descriptors, which measures properties of audio signal content and wrap audio features to sets of values. This descriptors can be divided in three levels: low, medium and high. This levels will be detailed later. This dissertation starts by a study of the state-of-the-art on Music Information Retrieval and systems used to get relevant information from audio files. Within this scope, I surveyed two models found in the literature to describe mathematically the warmth of musical audio. After revising the latter sound warmth metric of the studies I aim to provide a better mathematical model to the descriptor finding the constant that better define a linear correlation between user judgments of sound warmth and the two low-level sound descriptors proposed in the studies found. In sum, this dissertation proposes a (one-knob) audio effect which allows users to transform the warmth of a sound in real-time. |
publishDate |
2018 |
dc.date.none.fl_str_mv |
2018-02-05 2018-02-05T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/111170 TID:202117944 |
url |
https://hdl.handle.net/10216/111170 |
identifier_str_mv |
TID:202117944 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136129810694144 |