Prosodic classification of discourse markers

Detalhes bibliográficos
Autor(a) principal: Cabarrão, Vera
Data de Publicação: 2016
Outros Autores: Moniz, Helena, Jaime Ferreira, Fernando Batista, Isabel Trancoso, Ana Isabel Mata, Sérgio Curto
Tipo de documento: Artigo
Idioma: por
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://doi.org/10.26334/2183-9077/rapln2ano2016a4
Resumo: This work describes the discourse markers present in two corpora for European Portuguese, in different domains (university lectures and map-task dialogues). In this study, we also perform a multiclass automatic classification task based on prosodic features to verify in both corpora which words are discourse markers, which are disfluencies, and which are sentence like-units (SUs). Results show that the selection of discourse markers varies across domain and between speakers. As for the classification task, results show that the discourse markers are better classified in the lectures corpus (87%) than in the dialogue corpus (84%). However, cross-domain experiments evidenced that data trained with the dialogue corpus predicts better the events in the lecture corpus, since this domain displays more speakers and therefore complex patterns. In both corpora, markers are more easily classified as SUs than as disfluencies.
id RCAP_64949a927ad67855b334997fbe3ca7ed
oai_identifier_str oai:ojs3.ojs.apl.pt:article/235
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Prosodic classification of discourse markersClassificação prosódica de marcadores discursivosmarcadores discursivosprosódiaprocessamento de falaclassificação automática multiclassediscourse markersprosodyspeech processingmulticlass automatic classificationThis work describes the discourse markers present in two corpora for European Portuguese, in different domains (university lectures and map-task dialogues). In this study, we also perform a multiclass automatic classification task based on prosodic features to verify in both corpora which words are discourse markers, which are disfluencies, and which are sentence like-units (SUs). Results show that the selection of discourse markers varies across domain and between speakers. As for the classification task, results show that the discourse markers are better classified in the lectures corpus (87%) than in the dialogue corpus (84%). However, cross-domain experiments evidenced that data trained with the dialogue corpus predicts better the events in the lecture corpus, since this domain displays more speakers and therefore complex patterns. In both corpora, markers are more easily classified as SUs than as disfluencies.Associação Portuguesa de Linguística2016-10-31info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttps://doi.org/10.26334/2183-9077/rapln2ano2016a4https://doi.org/10.26334/2183-9077/rapln2ano2016a4Revista da Associação Portuguesa de Linguística; No. 2 (2016): Journal of the Portuguese Linguistics Association; 69-95Revista da Associação Portuguesa de Linguística; N.º 2 (2016): Revista da Associação Portuguesa de Linguística; 69-952183-907710.26334/2183-9077/rapln2ano2016tdreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAPporhttps://ojs.apl.pt/index.php/rapl/article/view/235https://ojs.apl.pt/index.php/rapl/article/view/235/196Cabarrão, VeraMoniz, HelenaJaime FerreiraFernando BatistaIsabel TrancosoAna Isabel MataSérgio Curtoinfo:eu-repo/semantics/openAccess2023-12-02T10:18:20Zoai:ojs3.ojs.apl.pt:article/235Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T20:36:05.609708Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Prosodic classification of discourse markers
Classificação prosódica de marcadores discursivos
title Prosodic classification of discourse markers
spellingShingle Prosodic classification of discourse markers
Cabarrão, Vera
marcadores discursivos
prosódia
processamento de fala
classificação automática multiclasse
discourse markers
prosody
speech processing
multiclass automatic classification
title_short Prosodic classification of discourse markers
title_full Prosodic classification of discourse markers
title_fullStr Prosodic classification of discourse markers
title_full_unstemmed Prosodic classification of discourse markers
title_sort Prosodic classification of discourse markers
author Cabarrão, Vera
author_facet Cabarrão, Vera
Moniz, Helena
Jaime Ferreira
Fernando Batista
Isabel Trancoso
Ana Isabel Mata
Sérgio Curto
author_role author
author2 Moniz, Helena
Jaime Ferreira
Fernando Batista
Isabel Trancoso
Ana Isabel Mata
Sérgio Curto
author2_role author
author
author
author
author
author
dc.contributor.author.fl_str_mv Cabarrão, Vera
Moniz, Helena
Jaime Ferreira
Fernando Batista
Isabel Trancoso
Ana Isabel Mata
Sérgio Curto
dc.subject.por.fl_str_mv marcadores discursivos
prosódia
processamento de fala
classificação automática multiclasse
discourse markers
prosody
speech processing
multiclass automatic classification
topic marcadores discursivos
prosódia
processamento de fala
classificação automática multiclasse
discourse markers
prosody
speech processing
multiclass automatic classification
description This work describes the discourse markers present in two corpora for European Portuguese, in different domains (university lectures and map-task dialogues). In this study, we also perform a multiclass automatic classification task based on prosodic features to verify in both corpora which words are discourse markers, which are disfluencies, and which are sentence like-units (SUs). Results show that the selection of discourse markers varies across domain and between speakers. As for the classification task, results show that the discourse markers are better classified in the lectures corpus (87%) than in the dialogue corpus (84%). However, cross-domain experiments evidenced that data trained with the dialogue corpus predicts better the events in the lecture corpus, since this domain displays more speakers and therefore complex patterns. In both corpora, markers are more easily classified as SUs than as disfluencies.
publishDate 2016
dc.date.none.fl_str_mv 2016-10-31
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://doi.org/10.26334/2183-9077/rapln2ano2016a4
https://doi.org/10.26334/2183-9077/rapln2ano2016a4
url https://doi.org/10.26334/2183-9077/rapln2ano2016a4
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv https://ojs.apl.pt/index.php/rapl/article/view/235
https://ojs.apl.pt/index.php/rapl/article/view/235/196
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Associação Portuguesa de Linguística
publisher.none.fl_str_mv Associação Portuguesa de Linguística
dc.source.none.fl_str_mv Revista da Associação Portuguesa de Linguística; No. 2 (2016): Journal of the Portuguese Linguistics Association; 69-95
Revista da Associação Portuguesa de Linguística; N.º 2 (2016): Revista da Associação Portuguesa de Linguística; 69-95
2183-9077
10.26334/2183-9077/rapln2ano2016td
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799133623920623616