On the development of an automatic voice pleasantness classification and intensity estimation system
Autor(a) principal: | |
---|---|
Data de Publicação: | 2013 |
Outros Autores: | , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://www.sciencedirect.com/science/article/pii/S0885230812000083 https://ciencia.iscte-iul.pt/public/pub/id/16887 http://hdl.handle.net/10071/7428 |
Resumo: | In the last few years, the number of systems and devices that use voice based interaction has grown significantly. For a continued use of these systems, the interface must be reliable and pleasant in order to provide an optimal user experience. However there are currently very few studies that try to evaluate how pleasant is a voice from a perceptual point of view when the final application is a speech based interface. In this paper we present an objective definition for voice pleasantness based on the composition of a representative feature subset and a new automatic voice pleasantness classification and intensity estimation system. Our study is based on a database composed by European Portuguese female voices but the methodology can be extended to male voices or to other languages. In the objective performance evaluation the system achieved a 9.1 error rate for voice pleasantness classification and a 15.7 error rate for voice pleasantness intensity estimation. |
id |
RCAP_7ca06f149a0fa18a1032d2d49775cdf1 |
---|---|
oai_identifier_str |
oai:repositorio.iscte-iul.pt:10071/7428 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
On the development of an automatic voice pleasantness classification and intensity estimation systemVoice pleasantnessSubtle emotionsPerceptual speech analysisText-to-Speech synthesisIn the last few years, the number of systems and devices that use voice based interaction has grown significantly. For a continued use of these systems, the interface must be reliable and pleasant in order to provide an optimal user experience. However there are currently very few studies that try to evaluate how pleasant is a voice from a perceptual point of view when the final application is a speech based interface. In this paper we present an objective definition for voice pleasantness based on the composition of a representative feature subset and a new automatic voice pleasantness classification and intensity estimation system. Our study is based on a database composed by European Portuguese female voices but the methodology can be extended to male voices or to other languages. In the objective performance evaluation the system achieved a 9.1 error rate for voice pleasantness classification and a 15.7 error rate for voice pleasantness intensity estimation.Elsevier2014-06-03T17:21:02Z2013-01-01T00:00:00Z2013-012014-06-03T17:17:49Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://www.sciencedirect.com/science/article/pii/S0885230812000083https://ciencia.iscte-iul.pt/public/pub/id/16887http://hdl.handle.net/10071/7428eng0885-2308Pinto-Coelho, L.Braga, D.Sales-Dias, M.Garcia-Mateo, C.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-09T17:25:52Zoai:repositorio.iscte-iul.pt:10071/7428Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T22:11:33.061090Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
On the development of an automatic voice pleasantness classification and intensity estimation system |
title |
On the development of an automatic voice pleasantness classification and intensity estimation system |
spellingShingle |
On the development of an automatic voice pleasantness classification and intensity estimation system Pinto-Coelho, L. Voice pleasantness Subtle emotions Perceptual speech analysis Text-to-Speech synthesis |
title_short |
On the development of an automatic voice pleasantness classification and intensity estimation system |
title_full |
On the development of an automatic voice pleasantness classification and intensity estimation system |
title_fullStr |
On the development of an automatic voice pleasantness classification and intensity estimation system |
title_full_unstemmed |
On the development of an automatic voice pleasantness classification and intensity estimation system |
title_sort |
On the development of an automatic voice pleasantness classification and intensity estimation system |
author |
Pinto-Coelho, L. |
author_facet |
Pinto-Coelho, L. Braga, D. Sales-Dias, M. Garcia-Mateo, C. |
author_role |
author |
author2 |
Braga, D. Sales-Dias, M. Garcia-Mateo, C. |
author2_role |
author author author |
dc.contributor.author.fl_str_mv |
Pinto-Coelho, L. Braga, D. Sales-Dias, M. Garcia-Mateo, C. |
dc.subject.por.fl_str_mv |
Voice pleasantness Subtle emotions Perceptual speech analysis Text-to-Speech synthesis |
topic |
Voice pleasantness Subtle emotions Perceptual speech analysis Text-to-Speech synthesis |
description |
In the last few years, the number of systems and devices that use voice based interaction has grown significantly. For a continued use of these systems, the interface must be reliable and pleasant in order to provide an optimal user experience. However there are currently very few studies that try to evaluate how pleasant is a voice from a perceptual point of view when the final application is a speech based interface. In this paper we present an objective definition for voice pleasantness based on the composition of a representative feature subset and a new automatic voice pleasantness classification and intensity estimation system. Our study is based on a database composed by European Portuguese female voices but the methodology can be extended to male voices or to other languages. In the objective performance evaluation the system achieved a 9.1 error rate for voice pleasantness classification and a 15.7 error rate for voice pleasantness intensity estimation. |
publishDate |
2013 |
dc.date.none.fl_str_mv |
2013-01-01T00:00:00Z 2013-01 2014-06-03T17:21:02Z 2014-06-03T17:17:49Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://www.sciencedirect.com/science/article/pii/S0885230812000083 https://ciencia.iscte-iul.pt/public/pub/id/16887 http://hdl.handle.net/10071/7428 |
url |
http://www.sciencedirect.com/science/article/pii/S0885230812000083 https://ciencia.iscte-iul.pt/public/pub/id/16887 http://hdl.handle.net/10071/7428 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
0885-2308 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Elsevier |
publisher.none.fl_str_mv |
Elsevier |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799134669991575552 |