Regression-Based Noise Modeling for Speech Signal Processing
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Outros Autores: | , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://dx.doi.org/10.1142/S021947752150022X http://hdl.handle.net/11449/208384 |
Resumo: | Speech processing systems are very important in different applications involving speech and voice quality such as automatic speech recognition, forensic phonetics and speech enhancement, among others. In most of them, the acoustic environmental noise is added to the original signal, decreasing the signal-to-noise ratio (SNR) and the speech quality by consequence. Therefore, estimating noise is one of the most important steps in speech processing whether to reduce it before processing or to design robust algorithms. In this paper, a new approach to estimate noise from speech signals is presented and its effectiveness is tested in the speech enhancement context. For this purpose, partial least squares (PLS) regression is used to model the acoustic environment (AE) and a Wiener filter based on a priori SNR estimation is implemented to evaluate the proposed approach. Six noise types are used to create seven acoustically modeled noises. The basic idea is to consider the AE model to identify the noise type and estimate its power to be used in a speech processing system. Speech signals processed using the proposed method and classical noise estimators are evaluated through objective measures. Results show that the proposed method produces better speech quality than state-of-the-art noise estimators, enabling it to be used in real-time applications in the field of robotic, telecommunications and acoustic analysis. |
id |
UNSP_d654cc8559753d41e11a7a1156d612fe |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/208384 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
Regression-Based Noise Modeling for Speech Signal Processingnoise classificationnoise estimationpartial least squaresSpeech processingSpeech processing systems are very important in different applications involving speech and voice quality such as automatic speech recognition, forensic phonetics and speech enhancement, among others. In most of them, the acoustic environmental noise is added to the original signal, decreasing the signal-to-noise ratio (SNR) and the speech quality by consequence. Therefore, estimating noise is one of the most important steps in speech processing whether to reduce it before processing or to design robust algorithms. In this paper, a new approach to estimate noise from speech signals is presented and its effectiveness is tested in the speech enhancement context. For this purpose, partial least squares (PLS) regression is used to model the acoustic environment (AE) and a Wiener filter based on a priori SNR estimation is implemented to evaluate the proposed approach. Six noise types are used to create seven acoustically modeled noises. The basic idea is to consider the AE model to identify the noise type and estimate its power to be used in a speech processing system. Speech signals processed using the proposed method and classical noise estimators are evaluated through objective measures. Results show that the proposed method produces better speech quality than state-of-the-art noise estimators, enabling it to be used in real-time applications in the field of robotic, telecommunications and acoustic analysis.Department of Computing UNEMAT - Mato Grosso State UniversityDepartment of Mathematics UEMS - Mato Grosso do Sul State UniversityDepartment of Electrical Engineering UNESP - São Paulo State UniversityTelecommunications and Aeronautical Engineering UNESP - São Paulo State UniversityDepartment of Mathematics UNESP - São Paulo State UniversityDepartment of Electrical Engineering UNESP - São Paulo State UniversityTelecommunications and Aeronautical Engineering UNESP - São Paulo State UniversityDepartment of Mathematics UNESP - São Paulo State UniversityUNEMAT - Mato Grosso State UniversityUniversidade Estadual de Mato Grosso do Sul (UEMS)Universidade Estadual Paulista (Unesp)Abreu, Caio Cesar Enside deDuarte, Marco Aparecido QueirozOliveira, Bruno Rodrigues de [UNESP]Filho, Jozue Vieira [UNESP]Villarreal, Francisco [UNESP]2021-06-25T11:11:14Z2021-06-25T11:11:14Z2021-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://dx.doi.org/10.1142/S021947752150022XFluctuation and Noise Letters.0219-4775http://hdl.handle.net/11449/20838410.1142/S021947752150022X2-s2.0-85100534938Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengFluctuation and Noise Lettersinfo:eu-repo/semantics/openAccess2024-07-10T15:41:37Zoai:repositorio.unesp.br:11449/208384Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T15:31:49.616579Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
Regression-Based Noise Modeling for Speech Signal Processing |
title |
Regression-Based Noise Modeling for Speech Signal Processing |
spellingShingle |
Regression-Based Noise Modeling for Speech Signal Processing Abreu, Caio Cesar Enside de noise classification noise estimation partial least squares Speech processing |
title_short |
Regression-Based Noise Modeling for Speech Signal Processing |
title_full |
Regression-Based Noise Modeling for Speech Signal Processing |
title_fullStr |
Regression-Based Noise Modeling for Speech Signal Processing |
title_full_unstemmed |
Regression-Based Noise Modeling for Speech Signal Processing |
title_sort |
Regression-Based Noise Modeling for Speech Signal Processing |
author |
Abreu, Caio Cesar Enside de |
author_facet |
Abreu, Caio Cesar Enside de Duarte, Marco Aparecido Queiroz Oliveira, Bruno Rodrigues de [UNESP] Filho, Jozue Vieira [UNESP] Villarreal, Francisco [UNESP] |
author_role |
author |
author2 |
Duarte, Marco Aparecido Queiroz Oliveira, Bruno Rodrigues de [UNESP] Filho, Jozue Vieira [UNESP] Villarreal, Francisco [UNESP] |
author2_role |
author author author author |
dc.contributor.none.fl_str_mv |
UNEMAT - Mato Grosso State University Universidade Estadual de Mato Grosso do Sul (UEMS) Universidade Estadual Paulista (Unesp) |
dc.contributor.author.fl_str_mv |
Abreu, Caio Cesar Enside de Duarte, Marco Aparecido Queiroz Oliveira, Bruno Rodrigues de [UNESP] Filho, Jozue Vieira [UNESP] Villarreal, Francisco [UNESP] |
dc.subject.por.fl_str_mv |
noise classification noise estimation partial least squares Speech processing |
topic |
noise classification noise estimation partial least squares Speech processing |
description |
Speech processing systems are very important in different applications involving speech and voice quality such as automatic speech recognition, forensic phonetics and speech enhancement, among others. In most of them, the acoustic environmental noise is added to the original signal, decreasing the signal-to-noise ratio (SNR) and the speech quality by consequence. Therefore, estimating noise is one of the most important steps in speech processing whether to reduce it before processing or to design robust algorithms. In this paper, a new approach to estimate noise from speech signals is presented and its effectiveness is tested in the speech enhancement context. For this purpose, partial least squares (PLS) regression is used to model the acoustic environment (AE) and a Wiener filter based on a priori SNR estimation is implemented to evaluate the proposed approach. Six noise types are used to create seven acoustically modeled noises. The basic idea is to consider the AE model to identify the noise type and estimate its power to be used in a speech processing system. Speech signals processed using the proposed method and classical noise estimators are evaluated through objective measures. Results show that the proposed method produces better speech quality than state-of-the-art noise estimators, enabling it to be used in real-time applications in the field of robotic, telecommunications and acoustic analysis. |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021-06-25T11:11:14Z 2021-06-25T11:11:14Z 2021-01-01 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1142/S021947752150022X Fluctuation and Noise Letters. 0219-4775 http://hdl.handle.net/11449/208384 10.1142/S021947752150022X 2-s2.0-85100534938 |
url |
http://dx.doi.org/10.1142/S021947752150022X http://hdl.handle.net/11449/208384 |
identifier_str_mv |
Fluctuation and Noise Letters. 0219-4775 10.1142/S021947752150022X 2-s2.0-85100534938 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Fluctuation and Noise Letters |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808128527603597312 |