Disfluency Detection Across Domains
Autor(a) principal: | |
---|---|
Data de Publicação: | 2015 |
Outros Autores: | , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10451/31081 |
Resumo: | This paper focuses on disfluency detection across distinct domains using a large set of openSMILE features, derived from the Interspeech 2013 Paralinguistic challenge. Amongst different machine learning methods being applied, SVMs achieved the best performance. Feature selection experiments revealed that the dimensionality of the larger set of features can be further reduced at the cost of a small degradation. Different models trained with one corpus were tested on the other corpus, revealing that models can be quite robust across corpora for this task, despite their distinct nature. We have conducted additional experiments aiming at disfluency prediction in the context of IVR systems, and results reveal that there is no substantial degradation on the performance, encouraging the use of the models in IVR domains. |
id |
RCAP_ced32823af2758c2fa7bc84a3ec8fd97 |
---|---|
oai_identifier_str |
oai:repositorio.ul.pt:10451/31081 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Disfluency Detection Across DomainsDisfluency detectionAcoustic-prosodic featuresCross-domain analisysEuropean PortugueseThis paper focuses on disfluency detection across distinct domains using a large set of openSMILE features, derived from the Interspeech 2013 Paralinguistic challenge. Amongst different machine learning methods being applied, SVMs achieved the best performance. Feature selection experiments revealed that the dimensionality of the larger set of features can be further reduced at the cost of a small degradation. Different models trained with one corpus were tested on the other corpus, revealing that models can be quite robust across corpora for this task, despite their distinct nature. We have conducted additional experiments aiming at disfluency prediction in the context of IVR systems, and results reveal that there is no substantial degradation on the performance, encouraging the use of the models in IVR domains.International Phonetic AssociationRepositório da Universidade de LisboaMoniz, HelenaFerreira, JaimeBatista, FernandoTrancoso, Isabel2018-01-28T15:00:08Z20152015-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10451/31081engMoniz, H., Ferreira, J., Batista, F. & Trancoso, I. (2015) "Disfluency Detection Across Domains", In DISS 2015, Edinburgh, Scotlan, UK.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-08T16:24:14Zoai:repositorio.ul.pt:10451/31081Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:46:36.338940Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Disfluency Detection Across Domains |
title |
Disfluency Detection Across Domains |
spellingShingle |
Disfluency Detection Across Domains Moniz, Helena Disfluency detection Acoustic-prosodic features Cross-domain analisys European Portuguese |
title_short |
Disfluency Detection Across Domains |
title_full |
Disfluency Detection Across Domains |
title_fullStr |
Disfluency Detection Across Domains |
title_full_unstemmed |
Disfluency Detection Across Domains |
title_sort |
Disfluency Detection Across Domains |
author |
Moniz, Helena |
author_facet |
Moniz, Helena Ferreira, Jaime Batista, Fernando Trancoso, Isabel |
author_role |
author |
author2 |
Ferreira, Jaime Batista, Fernando Trancoso, Isabel |
author2_role |
author author author |
dc.contributor.none.fl_str_mv |
Repositório da Universidade de Lisboa |
dc.contributor.author.fl_str_mv |
Moniz, Helena Ferreira, Jaime Batista, Fernando Trancoso, Isabel |
dc.subject.por.fl_str_mv |
Disfluency detection Acoustic-prosodic features Cross-domain analisys European Portuguese |
topic |
Disfluency detection Acoustic-prosodic features Cross-domain analisys European Portuguese |
description |
This paper focuses on disfluency detection across distinct domains using a large set of openSMILE features, derived from the Interspeech 2013 Paralinguistic challenge. Amongst different machine learning methods being applied, SVMs achieved the best performance. Feature selection experiments revealed that the dimensionality of the larger set of features can be further reduced at the cost of a small degradation. Different models trained with one corpus were tested on the other corpus, revealing that models can be quite robust across corpora for this task, despite their distinct nature. We have conducted additional experiments aiming at disfluency prediction in the context of IVR systems, and results reveal that there is no substantial degradation on the performance, encouraging the use of the models in IVR domains. |
publishDate |
2015 |
dc.date.none.fl_str_mv |
2015 2015-01-01T00:00:00Z 2018-01-28T15:00:08Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10451/31081 |
url |
http://hdl.handle.net/10451/31081 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Moniz, H., Ferreira, J., Batista, F. & Trancoso, I. (2015) "Disfluency Detection Across Domains", In DISS 2015, Edinburgh, Scotlan, UK. |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
International Phonetic Association |
publisher.none.fl_str_mv |
International Phonetic Association |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799134391154245632 |