Disfluency Detection Across Domains

Detalhes bibliográficos
Autor(a) principal: Moniz, Helena
Data de Publicação: 2015
Outros Autores: Ferreira, Jaime, Batista, Fernando, Trancoso, Isabel
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10451/31081
Resumo: This paper focuses on disfluency detection across distinct domains using a large set of openSMILE features, derived from the Interspeech 2013 Paralinguistic challenge. Amongst different machine learning methods being applied, SVMs achieved the best performance. Feature selection experiments revealed that the dimensionality of the larger set of features can be further reduced at the cost of a small degradation. Different models trained with one corpus were tested on the other corpus, revealing that models can be quite robust across corpora for this task, despite their distinct nature. We have conducted additional experiments aiming at disfluency prediction in the context of IVR systems, and results reveal that there is no substantial degradation on the performance, encouraging the use of the models in IVR domains.
id RCAP_ced32823af2758c2fa7bc84a3ec8fd97
oai_identifier_str oai:repositorio.ul.pt:10451/31081
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Disfluency Detection Across DomainsDisfluency detectionAcoustic-prosodic featuresCross-domain analisysEuropean PortugueseThis paper focuses on disfluency detection across distinct domains using a large set of openSMILE features, derived from the Interspeech 2013 Paralinguistic challenge. Amongst different machine learning methods being applied, SVMs achieved the best performance. Feature selection experiments revealed that the dimensionality of the larger set of features can be further reduced at the cost of a small degradation. Different models trained with one corpus were tested on the other corpus, revealing that models can be quite robust across corpora for this task, despite their distinct nature. We have conducted additional experiments aiming at disfluency prediction in the context of IVR systems, and results reveal that there is no substantial degradation on the performance, encouraging the use of the models in IVR domains.International Phonetic AssociationRepositório da Universidade de LisboaMoniz, HelenaFerreira, JaimeBatista, FernandoTrancoso, Isabel2018-01-28T15:00:08Z20152015-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10451/31081engMoniz, H., Ferreira, J., Batista, F. & Trancoso, I. (2015) "Disfluency Detection Across Domains", In DISS 2015, Edinburgh, Scotlan, UK.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-08T16:24:14Zoai:repositorio.ul.pt:10451/31081Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:46:36.338940Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Disfluency Detection Across Domains
title Disfluency Detection Across Domains
spellingShingle Disfluency Detection Across Domains
Moniz, Helena
Disfluency detection
Acoustic-prosodic features
Cross-domain analisys
European Portuguese
title_short Disfluency Detection Across Domains
title_full Disfluency Detection Across Domains
title_fullStr Disfluency Detection Across Domains
title_full_unstemmed Disfluency Detection Across Domains
title_sort Disfluency Detection Across Domains
author Moniz, Helena
author_facet Moniz, Helena
Ferreira, Jaime
Batista, Fernando
Trancoso, Isabel
author_role author
author2 Ferreira, Jaime
Batista, Fernando
Trancoso, Isabel
author2_role author
author
author
dc.contributor.none.fl_str_mv Repositório da Universidade de Lisboa
dc.contributor.author.fl_str_mv Moniz, Helena
Ferreira, Jaime
Batista, Fernando
Trancoso, Isabel
dc.subject.por.fl_str_mv Disfluency detection
Acoustic-prosodic features
Cross-domain analisys
European Portuguese
topic Disfluency detection
Acoustic-prosodic features
Cross-domain analisys
European Portuguese
description This paper focuses on disfluency detection across distinct domains using a large set of openSMILE features, derived from the Interspeech 2013 Paralinguistic challenge. Amongst different machine learning methods being applied, SVMs achieved the best performance. Feature selection experiments revealed that the dimensionality of the larger set of features can be further reduced at the cost of a small degradation. Different models trained with one corpus were tested on the other corpus, revealing that models can be quite robust across corpora for this task, despite their distinct nature. We have conducted additional experiments aiming at disfluency prediction in the context of IVR systems, and results reveal that there is no substantial degradation on the performance, encouraging the use of the models in IVR domains.
publishDate 2015
dc.date.none.fl_str_mv 2015
2015-01-01T00:00:00Z
2018-01-28T15:00:08Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10451/31081
url http://hdl.handle.net/10451/31081
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Moniz, H., Ferreira, J., Batista, F. & Trancoso, I. (2015) "Disfluency Detection Across Domains", In DISS 2015, Edinburgh, Scotlan, UK.
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv International Phonetic Association
publisher.none.fl_str_mv International Phonetic Association
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799134391154245632