Regression-Based Noise Modeling for Speech Signal Processing

Abreu, Caio Cesar Enside de; Duarte, Marco Aparecido Queiroz; Oliveira, Bruno Rodrigues de [UNESP]; Filho, Jozue Vieira [UNESP]; Villarreal, Francisco [UNESP]

Regression-Based Noise Modeling for Speech Signal Processing

Detalhes bibliográficos
Autor(a) principal:	Abreu, Caio Cesar Enside de
Data de Publicação:	2021
Outros Autores:	Duarte, Marco Aparecido Queiroz, Oliveira, Bruno Rodrigues de [UNESP], Filho, Jozue Vieira [UNESP], Villarreal, Francisco [UNESP]
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Repositório Institucional da UNESP
Texto Completo:	http://dx.doi.org/10.1142/S021947752150022X http://hdl.handle.net/11449/208384
Resumo:	Speech processing systems are very important in different applications involving speech and voice quality such as automatic speech recognition, forensic phonetics and speech enhancement, among others. In most of them, the acoustic environmental noise is added to the original signal, decreasing the signal-to-noise ratio (SNR) and the speech quality by consequence. Therefore, estimating noise is one of the most important steps in speech processing whether to reduce it before processing or to design robust algorithms. In this paper, a new approach to estimate noise from speech signals is presented and its effectiveness is tested in the speech enhancement context. For this purpose, partial least squares (PLS) regression is used to model the acoustic environment (AE) and a Wiener filter based on a priori SNR estimation is implemented to evaluate the proposed approach. Six noise types are used to create seven acoustically modeled noises. The basic idea is to consider the AE model to identify the noise type and estimate its power to be used in a speech processing system. Speech signals processed using the proposed method and classical noise estimators are evaluated through objective measures. Results show that the proposed method produces better speech quality than state-of-the-art noise estimators, enabling it to be used in real-time applications in the field of robotic, telecommunications and acoustic analysis.

Metadados do item

id	UNSP_d654cc8559753d41e11a7a1156d612fe
oai_identifier_str	oai:repositorio.unesp.br:11449/208384
network_acronym_str	UNSP
network_name_str	Repositório Institucional da UNESP
repository_id_str	2946
spelling	Regression-Based Noise Modeling for Speech Signal Processingnoise classificationnoise estimationpartial least squaresSpeech processingSpeech processing systems are very important in different applications involving speech and voice quality such as automatic speech recognition, forensic phonetics and speech enhancement, among others. In most of them, the acoustic environmental noise is added to the original signal, decreasing the signal-to-noise ratio (SNR) and the speech quality by consequence. Therefore, estimating noise is one of the most important steps in speech processing whether to reduce it before processing or to design robust algorithms. In this paper, a new approach to estimate noise from speech signals is presented and its effectiveness is tested in the speech enhancement context. For this purpose, partial least squares (PLS) regression is used to model the acoustic environment (AE) and a Wiener filter based on a priori SNR estimation is implemented to evaluate the proposed approach. Six noise types are used to create seven acoustically modeled noises. The basic idea is to consider the AE model to identify the noise type and estimate its power to be used in a speech processing system. Speech signals processed using the proposed method and classical noise estimators are evaluated through objective measures. Results show that the proposed method produces better speech quality than state-of-the-art noise estimators, enabling it to be used in real-time applications in the field of robotic, telecommunications and acoustic analysis.Department of Computing UNEMAT - Mato Grosso State UniversityDepartment of Mathematics UEMS - Mato Grosso do Sul State UniversityDepartment of Electrical Engineering UNESP - São Paulo State UniversityTelecommunications and Aeronautical Engineering UNESP - São Paulo State UniversityDepartment of Mathematics UNESP - São Paulo State UniversityDepartment of Electrical Engineering UNESP - São Paulo State UniversityTelecommunications and Aeronautical Engineering UNESP - São Paulo State UniversityDepartment of Mathematics UNESP - São Paulo State UniversityUNEMAT - Mato Grosso State UniversityUniversidade Estadual de Mato Grosso do Sul (UEMS)Universidade Estadual Paulista (Unesp)Abreu, Caio Cesar Enside deDuarte, Marco Aparecido QueirozOliveira, Bruno Rodrigues de [UNESP]Filho, Jozue Vieira [UNESP]Villarreal, Francisco [UNESP]2021-06-25T11:11:14Z2021-06-25T11:11:14Z2021-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://dx.doi.org/10.1142/S021947752150022XFluctuation and Noise Letters.0219-4775http://hdl.handle.net/11449/20838410.1142/S021947752150022X2-s2.0-85100534938Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengFluctuation and Noise Lettersinfo:eu-repo/semantics/openAccess2024-07-10T15:41:37Zoai:repositorio.unesp.br:11449/208384Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T15:31:49.616579Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv	Regression-Based Noise Modeling for Speech Signal Processing
title	Regression-Based Noise Modeling for Speech Signal Processing
spellingShingle	Regression-Based Noise Modeling for Speech Signal Processing Abreu, Caio Cesar Enside de noise classification noise estimation partial least squares Speech processing
title_short	Regression-Based Noise Modeling for Speech Signal Processing
title_full	Regression-Based Noise Modeling for Speech Signal Processing
title_fullStr	Regression-Based Noise Modeling for Speech Signal Processing
title_full_unstemmed	Regression-Based Noise Modeling for Speech Signal Processing
title_sort	Regression-Based Noise Modeling for Speech Signal Processing
author	Abreu, Caio Cesar Enside de
author_facet	Abreu, Caio Cesar Enside de Duarte, Marco Aparecido Queiroz Oliveira, Bruno Rodrigues de [UNESP] Filho, Jozue Vieira [UNESP] Villarreal, Francisco [UNESP]
author_role	author
author2	Duarte, Marco Aparecido Queiroz Oliveira, Bruno Rodrigues de [UNESP] Filho, Jozue Vieira [UNESP] Villarreal, Francisco [UNESP]
author2_role	author author author author
dc.contributor.none.fl_str_mv	UNEMAT - Mato Grosso State University Universidade Estadual de Mato Grosso do Sul (UEMS) Universidade Estadual Paulista (Unesp)
dc.contributor.author.fl_str_mv	Abreu, Caio Cesar Enside de Duarte, Marco Aparecido Queiroz Oliveira, Bruno Rodrigues de [UNESP] Filho, Jozue Vieira [UNESP] Villarreal, Francisco [UNESP]
dc.subject.por.fl_str_mv	noise classification noise estimation partial least squares Speech processing
topic	noise classification noise estimation partial least squares Speech processing
description	Speech processing systems are very important in different applications involving speech and voice quality such as automatic speech recognition, forensic phonetics and speech enhancement, among others. In most of them, the acoustic environmental noise is added to the original signal, decreasing the signal-to-noise ratio (SNR) and the speech quality by consequence. Therefore, estimating noise is one of the most important steps in speech processing whether to reduce it before processing or to design robust algorithms. In this paper, a new approach to estimate noise from speech signals is presented and its effectiveness is tested in the speech enhancement context. For this purpose, partial least squares (PLS) regression is used to model the acoustic environment (AE) and a Wiener filter based on a priori SNR estimation is implemented to evaluate the proposed approach. Six noise types are used to create seven acoustically modeled noises. The basic idea is to consider the AE model to identify the noise type and estimate its power to be used in a speech processing system. Speech signals processed using the proposed method and classical noise estimators are evaluated through objective measures. Results show that the proposed method produces better speech quality than state-of-the-art noise estimators, enabling it to be used in real-time applications in the field of robotic, telecommunications and acoustic analysis.
publishDate	2021
dc.date.none.fl_str_mv	2021-06-25T11:11:14Z 2021-06-25T11:11:14Z 2021-01-01
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://dx.doi.org/10.1142/S021947752150022X Fluctuation and Noise Letters. 0219-4775 http://hdl.handle.net/11449/208384 10.1142/S021947752150022X 2-s2.0-85100534938
url	http://dx.doi.org/10.1142/S021947752150022X http://hdl.handle.net/11449/208384
identifier_str_mv	Fluctuation and Noise Letters. 0219-4775 10.1142/S021947752150022X 2-s2.0-85100534938
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	Fluctuation and Noise Letters
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.source.none.fl_str_mv	Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP
instname_str	Universidade Estadual Paulista (UNESP)
instacron_str	UNESP
institution	UNESP
reponame_str	Repositório Institucional da UNESP
collection	Repositório Institucional da UNESP
repository.name.fl_str_mv	Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_	1808128527603597312

Regression-Based Noise Modeling for Speech Signal Processing

Registros relacionados