Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio Segments
Autor(a) principal: | |
---|---|
Data de Publicação: | 2022 |
Outros Autores: | , , , |
Tipo de documento: | Livro |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://hdl.handle.net/10216/140823 |
Resumo: | Usually, Sound event detection systems that classify different events from sound data have two main blocks. In the first block, sound events are separated from sound background and in next block, different events are classified. In recent years, this research area has become increasingly popular in a wide range of applications, such as in surveillance and city patterns learning and recognition, mainly when combined with imaging sensors. However, it still poses challenging problems due to existent noise, complexity of the events, poor microphone(s) quality, bad microphone location(s), or events occurring simultaneously. This research aimed to compare accurate signal processing and classification methods to suggest a novel method for detecting sound events from sound background in urban scenes. Using wavelet and Mel-frequency cepstral coefficients, the analysis of the effect of classification methods and minimization of the number of train data are some of the advantages of the proposed method. The proposed methods' application to a standard sounds database led to an accuracy of about 99% in event detection. (c) 2021, Springer Nature Switzerland AG. |
id |
RCAP_059f4f90c3560d9582c860de2855f2be |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/140823 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio SegmentsCiências Tecnológicas, Ciências da engenharia e tecnologiasTechnological sciences, Engineering and technologyUsually, Sound event detection systems that classify different events from sound data have two main blocks. In the first block, sound events are separated from sound background and in next block, different events are classified. In recent years, this research area has become increasingly popular in a wide range of applications, such as in surveillance and city patterns learning and recognition, mainly when combined with imaging sensors. However, it still poses challenging problems due to existent noise, complexity of the events, poor microphone(s) quality, bad microphone location(s), or events occurring simultaneously. This research aimed to compare accurate signal processing and classification methods to suggest a novel method for detecting sound events from sound background in urban scenes. Using wavelet and Mel-frequency cepstral coefficients, the analysis of the effect of classification methods and minimization of the number of train data are some of the advantages of the proposed method. The proposed methods' application to a standard sounds database led to an accuracy of about 99% in event detection. (c) 2021, Springer Nature Switzerland AG.2022-052022-05-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/bookapplication/pdfhttps://hdl.handle.net/10216/140823eng10.1007/978-3-030-93420-0_38Vahid HajihashemiAbdorreza AlavigharahbaghHugo S. OliveiraPedro CruzJoão Manuel R. S. Tavaresinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T14:47:02Zoai:repositorio-aberto.up.pt:10216/140823Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:08:22.617011Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio Segments |
title |
Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio Segments |
spellingShingle |
Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio Segments Vahid Hajihashemi Ciências Tecnológicas, Ciências da engenharia e tecnologias Technological sciences, Engineering and technology |
title_short |
Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio Segments |
title_full |
Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio Segments |
title_fullStr |
Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio Segments |
title_full_unstemmed |
Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio Segments |
title_sort |
Novel Time-Frequency Based Scheme for Detecting Sound Events from Sound Background in Audio Segments |
author |
Vahid Hajihashemi |
author_facet |
Vahid Hajihashemi Abdorreza Alavigharahbagh Hugo S. Oliveira Pedro Cruz João Manuel R. S. Tavares |
author_role |
author |
author2 |
Abdorreza Alavigharahbagh Hugo S. Oliveira Pedro Cruz João Manuel R. S. Tavares |
author2_role |
author author author author |
dc.contributor.author.fl_str_mv |
Vahid Hajihashemi Abdorreza Alavigharahbagh Hugo S. Oliveira Pedro Cruz João Manuel R. S. Tavares |
dc.subject.por.fl_str_mv |
Ciências Tecnológicas, Ciências da engenharia e tecnologias Technological sciences, Engineering and technology |
topic |
Ciências Tecnológicas, Ciências da engenharia e tecnologias Technological sciences, Engineering and technology |
description |
Usually, Sound event detection systems that classify different events from sound data have two main blocks. In the first block, sound events are separated from sound background and in next block, different events are classified. In recent years, this research area has become increasingly popular in a wide range of applications, such as in surveillance and city patterns learning and recognition, mainly when combined with imaging sensors. However, it still poses challenging problems due to existent noise, complexity of the events, poor microphone(s) quality, bad microphone location(s), or events occurring simultaneously. This research aimed to compare accurate signal processing and classification methods to suggest a novel method for detecting sound events from sound background in urban scenes. Using wavelet and Mel-frequency cepstral coefficients, the analysis of the effect of classification methods and minimization of the number of train data are some of the advantages of the proposed method. The proposed methods' application to a standard sounds database led to an accuracy of about 99% in event detection. (c) 2021, Springer Nature Switzerland AG. |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022-05 2022-05-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/book |
format |
book |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/140823 |
url |
https://hdl.handle.net/10216/140823 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
10.1007/978-3-030-93420-0_38 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136009391177728 |