Regression-Based Noise Modeling for Speech Signal Processing

Detalhes bibliográficos
Autor(a) principal: Abreu, Caio Cesar Enside de
Data de Publicação: 2021
Outros Autores: Duarte, Marco Aparecido Queiroz, Oliveira, Bruno Rodrigues de [UNESP], Filho, Jozue Vieira [UNESP], Villarreal, Francisco [UNESP]
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1142/S021947752150022X
http://hdl.handle.net/11449/208384
Resumo: Speech processing systems are very important in different applications involving speech and voice quality such as automatic speech recognition, forensic phonetics and speech enhancement, among others. In most of them, the acoustic environmental noise is added to the original signal, decreasing the signal-to-noise ratio (SNR) and the speech quality by consequence. Therefore, estimating noise is one of the most important steps in speech processing whether to reduce it before processing or to design robust algorithms. In this paper, a new approach to estimate noise from speech signals is presented and its effectiveness is tested in the speech enhancement context. For this purpose, partial least squares (PLS) regression is used to model the acoustic environment (AE) and a Wiener filter based on a priori SNR estimation is implemented to evaluate the proposed approach. Six noise types are used to create seven acoustically modeled noises. The basic idea is to consider the AE model to identify the noise type and estimate its power to be used in a speech processing system. Speech signals processed using the proposed method and classical noise estimators are evaluated through objective measures. Results show that the proposed method produces better speech quality than state-of-the-art noise estimators, enabling it to be used in real-time applications in the field of robotic, telecommunications and acoustic analysis.
id UNSP_d654cc8559753d41e11a7a1156d612fe
oai_identifier_str oai:repositorio.unesp.br:11449/208384
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Regression-Based Noise Modeling for Speech Signal Processingnoise classificationnoise estimationpartial least squaresSpeech processingSpeech processing systems are very important in different applications involving speech and voice quality such as automatic speech recognition, forensic phonetics and speech enhancement, among others. In most of them, the acoustic environmental noise is added to the original signal, decreasing the signal-to-noise ratio (SNR) and the speech quality by consequence. Therefore, estimating noise is one of the most important steps in speech processing whether to reduce it before processing or to design robust algorithms. In this paper, a new approach to estimate noise from speech signals is presented and its effectiveness is tested in the speech enhancement context. For this purpose, partial least squares (PLS) regression is used to model the acoustic environment (AE) and a Wiener filter based on a priori SNR estimation is implemented to evaluate the proposed approach. Six noise types are used to create seven acoustically modeled noises. The basic idea is to consider the AE model to identify the noise type and estimate its power to be used in a speech processing system. Speech signals processed using the proposed method and classical noise estimators are evaluated through objective measures. Results show that the proposed method produces better speech quality than state-of-the-art noise estimators, enabling it to be used in real-time applications in the field of robotic, telecommunications and acoustic analysis.Department of Computing UNEMAT - Mato Grosso State UniversityDepartment of Mathematics UEMS - Mato Grosso do Sul State UniversityDepartment of Electrical Engineering UNESP - São Paulo State UniversityTelecommunications and Aeronautical Engineering UNESP - São Paulo State UniversityDepartment of Mathematics UNESP - São Paulo State UniversityDepartment of Electrical Engineering UNESP - São Paulo State UniversityTelecommunications and Aeronautical Engineering UNESP - São Paulo State UniversityDepartment of Mathematics UNESP - São Paulo State UniversityUNEMAT - Mato Grosso State UniversityUniversidade Estadual de Mato Grosso do Sul (UEMS)Universidade Estadual Paulista (Unesp)Abreu, Caio Cesar Enside deDuarte, Marco Aparecido QueirozOliveira, Bruno Rodrigues de [UNESP]Filho, Jozue Vieira [UNESP]Villarreal, Francisco [UNESP]2021-06-25T11:11:14Z2021-06-25T11:11:14Z2021-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://dx.doi.org/10.1142/S021947752150022XFluctuation and Noise Letters.0219-4775http://hdl.handle.net/11449/20838410.1142/S021947752150022X2-s2.0-85100534938Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengFluctuation and Noise Lettersinfo:eu-repo/semantics/openAccess2021-10-23T19:02:09Zoai:repositorio.unesp.br:11449/208384Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462021-10-23T19:02:09Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Regression-Based Noise Modeling for Speech Signal Processing
title Regression-Based Noise Modeling for Speech Signal Processing
spellingShingle Regression-Based Noise Modeling for Speech Signal Processing
Abreu, Caio Cesar Enside de
noise classification
noise estimation
partial least squares
Speech processing
title_short Regression-Based Noise Modeling for Speech Signal Processing
title_full Regression-Based Noise Modeling for Speech Signal Processing
title_fullStr Regression-Based Noise Modeling for Speech Signal Processing
title_full_unstemmed Regression-Based Noise Modeling for Speech Signal Processing
title_sort Regression-Based Noise Modeling for Speech Signal Processing
author Abreu, Caio Cesar Enside de
author_facet Abreu, Caio Cesar Enside de
Duarte, Marco Aparecido Queiroz
Oliveira, Bruno Rodrigues de [UNESP]
Filho, Jozue Vieira [UNESP]
Villarreal, Francisco [UNESP]
author_role author
author2 Duarte, Marco Aparecido Queiroz
Oliveira, Bruno Rodrigues de [UNESP]
Filho, Jozue Vieira [UNESP]
Villarreal, Francisco [UNESP]
author2_role author
author
author
author
dc.contributor.none.fl_str_mv UNEMAT - Mato Grosso State University
Universidade Estadual de Mato Grosso do Sul (UEMS)
Universidade Estadual Paulista (Unesp)
dc.contributor.author.fl_str_mv Abreu, Caio Cesar Enside de
Duarte, Marco Aparecido Queiroz
Oliveira, Bruno Rodrigues de [UNESP]
Filho, Jozue Vieira [UNESP]
Villarreal, Francisco [UNESP]
dc.subject.por.fl_str_mv noise classification
noise estimation
partial least squares
Speech processing
topic noise classification
noise estimation
partial least squares
Speech processing
description Speech processing systems are very important in different applications involving speech and voice quality such as automatic speech recognition, forensic phonetics and speech enhancement, among others. In most of them, the acoustic environmental noise is added to the original signal, decreasing the signal-to-noise ratio (SNR) and the speech quality by consequence. Therefore, estimating noise is one of the most important steps in speech processing whether to reduce it before processing or to design robust algorithms. In this paper, a new approach to estimate noise from speech signals is presented and its effectiveness is tested in the speech enhancement context. For this purpose, partial least squares (PLS) regression is used to model the acoustic environment (AE) and a Wiener filter based on a priori SNR estimation is implemented to evaluate the proposed approach. Six noise types are used to create seven acoustically modeled noises. The basic idea is to consider the AE model to identify the noise type and estimate its power to be used in a speech processing system. Speech signals processed using the proposed method and classical noise estimators are evaluated through objective measures. Results show that the proposed method produces better speech quality than state-of-the-art noise estimators, enabling it to be used in real-time applications in the field of robotic, telecommunications and acoustic analysis.
publishDate 2021
dc.date.none.fl_str_mv 2021-06-25T11:11:14Z
2021-06-25T11:11:14Z
2021-01-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1142/S021947752150022X
Fluctuation and Noise Letters.
0219-4775
http://hdl.handle.net/11449/208384
10.1142/S021947752150022X
2-s2.0-85100534938
url http://dx.doi.org/10.1142/S021947752150022X
http://hdl.handle.net/11449/208384
identifier_str_mv Fluctuation and Noise Letters.
0219-4775
10.1142/S021947752150022X
2-s2.0-85100534938
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Fluctuation and Noise Letters
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1799964644331225088