An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection

Contreras, Rodrigo Colnago [UNESP]; Viana, Monique Simplicio; Fonseca, Everthon Silva; dos Santos, Francisco Lledo; Zanin, Rodrigo Bruno; Guido, Rodrigo Capobianco [UNESP]

An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection

Detalhes bibliográficos
Autor(a) principal:	Contreras, Rodrigo Colnago [UNESP]
Data de Publicação:	2023
Outros Autores:	Viana, Monique Simplicio, Fonseca, Everthon Silva, dos Santos, Francisco Lledo, Zanin, Rodrigo Bruno, Guido, Rodrigo Capobianco [UNESP]
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Repositório Institucional da UNESP
Texto Completo:	http://dx.doi.org/10.3390/s23115196 http://hdl.handle.net/11449/250048
Resumo:	Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one’s own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.

Metadados do item

id	UNSP_2ce9f55e86403f8d9c0515a19f84a739
oai_identifier_str	oai:repositorio.unesp.br:11449/250048
network_acronym_str	UNSP
network_name_str	Repositório Institucional da UNESP
repository_id_str	2946
spelling	An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detectioncepstral analysisdysphonia detectionmachine learningpattern recognitionvoice disorder detectionBiometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one’s own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.Department of Computer Science and Statistics Institute of Biosciences Letters and Exact Sciences São Paulo State University, SPFederal Institute of São Paulo, SPFaculty of Architecture and Engineering Mato Grosso State University, MTDepartment of Computer Science and Statistics Institute of Biosciences Letters and Exact Sciences São Paulo State University, SPUniversidade Estadual Paulista (UNESP)Federal Institute of São PauloMato Grosso State UniversityContreras, Rodrigo Colnago [UNESP]Viana, Monique SimplicioFonseca, Everthon Silvados Santos, Francisco LledoZanin, Rodrigo BrunoGuido, Rodrigo Capobianco [UNESP]2023-07-29T16:16:17Z2023-07-29T16:16:17Z2023-06-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://dx.doi.org/10.3390/s23115196Sensors, v. 23, n. 11, 2023.1424-8220http://hdl.handle.net/11449/25004810.3390/s231151962-s2.0-85161510694Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengSensorsinfo:eu-repo/semantics/openAccess2023-07-29T16:16:17Zoai:repositorio.unesp.br:11449/250048Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462023-07-29T16:16:17Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv	An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
title	An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
spellingShingle	An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection Contreras, Rodrigo Colnago [UNESP] cepstral analysis dysphonia detection machine learning pattern recognition voice disorder detection
title_short	An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
title_full	An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
title_fullStr	An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
title_full_unstemmed	An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
title_sort	An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
author	Contreras, Rodrigo Colnago [UNESP]
author_facet	Contreras, Rodrigo Colnago [UNESP] Viana, Monique Simplicio Fonseca, Everthon Silva dos Santos, Francisco Lledo Zanin, Rodrigo Bruno Guido, Rodrigo Capobianco [UNESP]
author_role	author
author2	Viana, Monique Simplicio Fonseca, Everthon Silva dos Santos, Francisco Lledo Zanin, Rodrigo Bruno Guido, Rodrigo Capobianco [UNESP]
author2_role	author author author author author
dc.contributor.none.fl_str_mv	Universidade Estadual Paulista (UNESP) Federal Institute of São Paulo Mato Grosso State University
dc.contributor.author.fl_str_mv	Contreras, Rodrigo Colnago [UNESP] Viana, Monique Simplicio Fonseca, Everthon Silva dos Santos, Francisco Lledo Zanin, Rodrigo Bruno Guido, Rodrigo Capobianco [UNESP]
dc.subject.por.fl_str_mv	cepstral analysis dysphonia detection machine learning pattern recognition voice disorder detection
topic	cepstral analysis dysphonia detection machine learning pattern recognition voice disorder detection
description	Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one’s own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.
publishDate	2023
dc.date.none.fl_str_mv	2023-07-29T16:16:17Z 2023-07-29T16:16:17Z 2023-06-01
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://dx.doi.org/10.3390/s23115196 Sensors, v. 23, n. 11, 2023. 1424-8220 http://hdl.handle.net/11449/250048 10.3390/s23115196 2-s2.0-85161510694
url	http://dx.doi.org/10.3390/s23115196 http://hdl.handle.net/11449/250048
identifier_str_mv	Sensors, v. 23, n. 11, 2023. 1424-8220 10.3390/s23115196 2-s2.0-85161510694
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	Sensors
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.source.none.fl_str_mv	Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP
instname_str	Universidade Estadual Paulista (UNESP)
instacron_str	UNESP
institution	UNESP
reponame_str	Repositório Institucional da UNESP
collection	Repositório Institucional da UNESP
repository.name.fl_str_mv	Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_	1799965478692585472

An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection

Registros relacionados