On the development of an automatic voice pleasantness classification and intensity estimation system
Autor(a) principal: | |
---|---|
Data de Publicação: | 2013 |
Outros Autores: | , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.22/3436 |
Resumo: | In the last few years, the number of systems and devices that use voice based interaction has grown significantly. For a continued use of these systems, the interface must be reliable and pleasant in order to provide an optimal user experience. However there are currently very few studies that try to evaluate how pleasant is a voice from a perceptual point of view when the final application is a speech based interface. In this paper we present an objective definition for voice pleasantness based on the composition of a representative feature subset and a new automatic voice pleasantness classification and intensity estimation system. Our study is based on a database composed by European Portuguese female voices but the methodology can be extended to male voices or to other languages. In the objective performance evaluation the system achieved a 9.1% error rate for voice pleasantness classification and a 15.7% error rate for voice pleasantness intensity estimation. |
id |
RCAP_ee68d28562ae6a5ed9c8da076161b729 |
---|---|
oai_identifier_str |
oai:recipp.ipp.pt:10400.22/3436 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
On the development of an automatic voice pleasantness classification and intensity estimation systemVoice pleasantnessSubtle emotionsPerceptual speech analysisText-to-Speech synthesisIn the last few years, the number of systems and devices that use voice based interaction has grown significantly. For a continued use of these systems, the interface must be reliable and pleasant in order to provide an optimal user experience. However there are currently very few studies that try to evaluate how pleasant is a voice from a perceptual point of view when the final application is a speech based interface. In this paper we present an objective definition for voice pleasantness based on the composition of a representative feature subset and a new automatic voice pleasantness classification and intensity estimation system. Our study is based on a database composed by European Portuguese female voices but the methodology can be extended to male voices or to other languages. In the objective performance evaluation the system achieved a 9.1% error rate for voice pleasantness classification and a 15.7% error rate for voice pleasantness intensity estimation.Work partially supported by ERDF funds, the Spanish Government (TEC2009-14094-C04-04), and Xunta de Galicia (CN2011/019, 2009/062)ElsevierRepositório Científico do Instituto Politécnico do PortoCoelho, LuísBraga, DanielaSales-Dias, MiguelGarcia-Mateo, Carmen2014-01-22T18:48:16Z20132013-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/3436engPinto-Coelho, L., Braga, D., Sales-Dias, M. & Garcia-Mateo, C. (2013) On the development of an automatic voice pleasantness classification and intensity estimation system. Computer Speech & Language, 27 (1), 75-88. doi: 10.1016/j.csl.2012.01.0060885-230810.1016/j.csl.2012.01.006info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-03-13T12:43:13Zoai:recipp.ipp.pt:10400.22/3436Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:24:25.041202Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
On the development of an automatic voice pleasantness classification and intensity estimation system |
title |
On the development of an automatic voice pleasantness classification and intensity estimation system |
spellingShingle |
On the development of an automatic voice pleasantness classification and intensity estimation system Coelho, Luís Voice pleasantness Subtle emotions Perceptual speech analysis Text-to-Speech synthesis |
title_short |
On the development of an automatic voice pleasantness classification and intensity estimation system |
title_full |
On the development of an automatic voice pleasantness classification and intensity estimation system |
title_fullStr |
On the development of an automatic voice pleasantness classification and intensity estimation system |
title_full_unstemmed |
On the development of an automatic voice pleasantness classification and intensity estimation system |
title_sort |
On the development of an automatic voice pleasantness classification and intensity estimation system |
author |
Coelho, Luís |
author_facet |
Coelho, Luís Braga, Daniela Sales-Dias, Miguel Garcia-Mateo, Carmen |
author_role |
author |
author2 |
Braga, Daniela Sales-Dias, Miguel Garcia-Mateo, Carmen |
author2_role |
author author author |
dc.contributor.none.fl_str_mv |
Repositório Científico do Instituto Politécnico do Porto |
dc.contributor.author.fl_str_mv |
Coelho, Luís Braga, Daniela Sales-Dias, Miguel Garcia-Mateo, Carmen |
dc.subject.por.fl_str_mv |
Voice pleasantness Subtle emotions Perceptual speech analysis Text-to-Speech synthesis |
topic |
Voice pleasantness Subtle emotions Perceptual speech analysis Text-to-Speech synthesis |
description |
In the last few years, the number of systems and devices that use voice based interaction has grown significantly. For a continued use of these systems, the interface must be reliable and pleasant in order to provide an optimal user experience. However there are currently very few studies that try to evaluate how pleasant is a voice from a perceptual point of view when the final application is a speech based interface. In this paper we present an objective definition for voice pleasantness based on the composition of a representative feature subset and a new automatic voice pleasantness classification and intensity estimation system. Our study is based on a database composed by European Portuguese female voices but the methodology can be extended to male voices or to other languages. In the objective performance evaluation the system achieved a 9.1% error rate for voice pleasantness classification and a 15.7% error rate for voice pleasantness intensity estimation. |
publishDate |
2013 |
dc.date.none.fl_str_mv |
2013 2013-01-01T00:00:00Z 2014-01-22T18:48:16Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.22/3436 |
url |
http://hdl.handle.net/10400.22/3436 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Pinto-Coelho, L., Braga, D., Sales-Dias, M. & Garcia-Mateo, C. (2013) On the development of an automatic voice pleasantness classification and intensity estimation system. Computer Speech & Language, 27 (1), 75-88. doi: 10.1016/j.csl.2012.01.006 0885-2308 10.1016/j.csl.2012.01.006 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Elsevier |
publisher.none.fl_str_mv |
Elsevier |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799131338113024000 |