On the development of an automatic voice pleasantness classification and intensity estimation system

Detalhes bibliográficos
Autor(a) principal: Pinto-Coelho, L.
Data de Publicação: 2013
Outros Autores: Braga, D., Sales-Dias, M., Garcia-Mateo, C.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://www.sciencedirect.com/science/article/pii/S0885230812000083
https://ciencia.iscte-iul.pt/public/pub/id/16887
http://hdl.handle.net/10071/7428
Resumo: In the last few years, the number of systems and devices that use voice based interaction has grown significantly. For a continued use of these systems, the interface must be reliable and pleasant in order to provide an optimal user experience. However there are currently very few studies that try to evaluate how pleasant is a voice from a perceptual point of view when the final application is a speech based interface. In this paper we present an objective definition for voice pleasantness based on the composition of a representative feature subset and a new automatic voice pleasantness classification and intensity estimation system. Our study is based on a database composed by European Portuguese female voices but the methodology can be extended to male voices or to other languages. In the objective performance evaluation the system achieved a 9.1 error rate for voice pleasantness classification and a 15.7 error rate for voice pleasantness intensity estimation.
id RCAP_7ca06f149a0fa18a1032d2d49775cdf1
oai_identifier_str oai:repositorio.iscte-iul.pt:10071/7428
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling On the development of an automatic voice pleasantness classification and intensity estimation systemVoice pleasantnessSubtle emotionsPerceptual speech analysisText-to-Speech synthesisIn the last few years, the number of systems and devices that use voice based interaction has grown significantly. For a continued use of these systems, the interface must be reliable and pleasant in order to provide an optimal user experience. However there are currently very few studies that try to evaluate how pleasant is a voice from a perceptual point of view when the final application is a speech based interface. In this paper we present an objective definition for voice pleasantness based on the composition of a representative feature subset and a new automatic voice pleasantness classification and intensity estimation system. Our study is based on a database composed by European Portuguese female voices but the methodology can be extended to male voices or to other languages. In the objective performance evaluation the system achieved a 9.1 error rate for voice pleasantness classification and a 15.7 error rate for voice pleasantness intensity estimation.Elsevier2014-06-03T17:21:02Z2013-01-01T00:00:00Z2013-012014-06-03T17:17:49Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://www.sciencedirect.com/science/article/pii/S0885230812000083https://ciencia.iscte-iul.pt/public/pub/id/16887http://hdl.handle.net/10071/7428eng0885-2308Pinto-Coelho, L.Braga, D.Sales-Dias, M.Garcia-Mateo, C.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-09T17:25:52Zoai:repositorio.iscte-iul.pt:10071/7428Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T22:11:33.061090Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv On the development of an automatic voice pleasantness classification and intensity estimation system
title On the development of an automatic voice pleasantness classification and intensity estimation system
spellingShingle On the development of an automatic voice pleasantness classification and intensity estimation system
Pinto-Coelho, L.
Voice pleasantness
Subtle emotions
Perceptual speech analysis
Text-to-Speech synthesis
title_short On the development of an automatic voice pleasantness classification and intensity estimation system
title_full On the development of an automatic voice pleasantness classification and intensity estimation system
title_fullStr On the development of an automatic voice pleasantness classification and intensity estimation system
title_full_unstemmed On the development of an automatic voice pleasantness classification and intensity estimation system
title_sort On the development of an automatic voice pleasantness classification and intensity estimation system
author Pinto-Coelho, L.
author_facet Pinto-Coelho, L.
Braga, D.
Sales-Dias, M.
Garcia-Mateo, C.
author_role author
author2 Braga, D.
Sales-Dias, M.
Garcia-Mateo, C.
author2_role author
author
author
dc.contributor.author.fl_str_mv Pinto-Coelho, L.
Braga, D.
Sales-Dias, M.
Garcia-Mateo, C.
dc.subject.por.fl_str_mv Voice pleasantness
Subtle emotions
Perceptual speech analysis
Text-to-Speech synthesis
topic Voice pleasantness
Subtle emotions
Perceptual speech analysis
Text-to-Speech synthesis
description In the last few years, the number of systems and devices that use voice based interaction has grown significantly. For a continued use of these systems, the interface must be reliable and pleasant in order to provide an optimal user experience. However there are currently very few studies that try to evaluate how pleasant is a voice from a perceptual point of view when the final application is a speech based interface. In this paper we present an objective definition for voice pleasantness based on the composition of a representative feature subset and a new automatic voice pleasantness classification and intensity estimation system. Our study is based on a database composed by European Portuguese female voices but the methodology can be extended to male voices or to other languages. In the objective performance evaluation the system achieved a 9.1 error rate for voice pleasantness classification and a 15.7 error rate for voice pleasantness intensity estimation.
publishDate 2013
dc.date.none.fl_str_mv 2013-01-01T00:00:00Z
2013-01
2014-06-03T17:21:02Z
2014-06-03T17:17:49Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://www.sciencedirect.com/science/article/pii/S0885230812000083
https://ciencia.iscte-iul.pt/public/pub/id/16887
http://hdl.handle.net/10071/7428
url http://www.sciencedirect.com/science/article/pii/S0885230812000083
https://ciencia.iscte-iul.pt/public/pub/id/16887
http://hdl.handle.net/10071/7428
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 0885-2308
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Elsevier
publisher.none.fl_str_mv Elsevier
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799134669991575552