Prosodic classification of discourse markers
Autor(a) principal: | |
---|---|
Data de Publicação: | 2016 |
Outros Autores: | , , , , , |
Tipo de documento: | Artigo |
Idioma: | por |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://doi.org/10.26334/2183-9077/rapln2ano2016a4 |
Resumo: | This work describes the discourse markers present in two corpora for European Portuguese, in different domains (university lectures and map-task dialogues). In this study, we also perform a multiclass automatic classification task based on prosodic features to verify in both corpora which words are discourse markers, which are disfluencies, and which are sentence like-units (SUs). Results show that the selection of discourse markers varies across domain and between speakers. As for the classification task, results show that the discourse markers are better classified in the lectures corpus (87%) than in the dialogue corpus (84%). However, cross-domain experiments evidenced that data trained with the dialogue corpus predicts better the events in the lecture corpus, since this domain displays more speakers and therefore complex patterns. In both corpora, markers are more easily classified as SUs than as disfluencies. |
id |
RCAP_64949a927ad67855b334997fbe3ca7ed |
---|---|
oai_identifier_str |
oai:ojs3.ojs.apl.pt:article/235 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Prosodic classification of discourse markersClassificação prosódica de marcadores discursivosmarcadores discursivosprosódiaprocessamento de falaclassificação automática multiclassediscourse markersprosodyspeech processingmulticlass automatic classificationThis work describes the discourse markers present in two corpora for European Portuguese, in different domains (university lectures and map-task dialogues). In this study, we also perform a multiclass automatic classification task based on prosodic features to verify in both corpora which words are discourse markers, which are disfluencies, and which are sentence like-units (SUs). Results show that the selection of discourse markers varies across domain and between speakers. As for the classification task, results show that the discourse markers are better classified in the lectures corpus (87%) than in the dialogue corpus (84%). However, cross-domain experiments evidenced that data trained with the dialogue corpus predicts better the events in the lecture corpus, since this domain displays more speakers and therefore complex patterns. In both corpora, markers are more easily classified as SUs than as disfluencies.Associação Portuguesa de Linguística2016-10-31info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttps://doi.org/10.26334/2183-9077/rapln2ano2016a4https://doi.org/10.26334/2183-9077/rapln2ano2016a4Revista da Associação Portuguesa de Linguística; No. 2 (2016): Journal of the Portuguese Linguistics Association; 69-95Revista da Associação Portuguesa de Linguística; N.º 2 (2016): Revista da Associação Portuguesa de Linguística; 69-952183-907710.26334/2183-9077/rapln2ano2016tdreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAPporhttps://ojs.apl.pt/index.php/rapl/article/view/235https://ojs.apl.pt/index.php/rapl/article/view/235/196Cabarrão, VeraMoniz, HelenaJaime FerreiraFernando BatistaIsabel TrancosoAna Isabel MataSérgio Curtoinfo:eu-repo/semantics/openAccess2023-12-02T10:18:20Zoai:ojs3.ojs.apl.pt:article/235Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T20:36:05.609708Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Prosodic classification of discourse markers Classificação prosódica de marcadores discursivos |
title |
Prosodic classification of discourse markers |
spellingShingle |
Prosodic classification of discourse markers Cabarrão, Vera marcadores discursivos prosódia processamento de fala classificação automática multiclasse discourse markers prosody speech processing multiclass automatic classification |
title_short |
Prosodic classification of discourse markers |
title_full |
Prosodic classification of discourse markers |
title_fullStr |
Prosodic classification of discourse markers |
title_full_unstemmed |
Prosodic classification of discourse markers |
title_sort |
Prosodic classification of discourse markers |
author |
Cabarrão, Vera |
author_facet |
Cabarrão, Vera Moniz, Helena Jaime Ferreira Fernando Batista Isabel Trancoso Ana Isabel Mata Sérgio Curto |
author_role |
author |
author2 |
Moniz, Helena Jaime Ferreira Fernando Batista Isabel Trancoso Ana Isabel Mata Sérgio Curto |
author2_role |
author author author author author author |
dc.contributor.author.fl_str_mv |
Cabarrão, Vera Moniz, Helena Jaime Ferreira Fernando Batista Isabel Trancoso Ana Isabel Mata Sérgio Curto |
dc.subject.por.fl_str_mv |
marcadores discursivos prosódia processamento de fala classificação automática multiclasse discourse markers prosody speech processing multiclass automatic classification |
topic |
marcadores discursivos prosódia processamento de fala classificação automática multiclasse discourse markers prosody speech processing multiclass automatic classification |
description |
This work describes the discourse markers present in two corpora for European Portuguese, in different domains (university lectures and map-task dialogues). In this study, we also perform a multiclass automatic classification task based on prosodic features to verify in both corpora which words are discourse markers, which are disfluencies, and which are sentence like-units (SUs). Results show that the selection of discourse markers varies across domain and between speakers. As for the classification task, results show that the discourse markers are better classified in the lectures corpus (87%) than in the dialogue corpus (84%). However, cross-domain experiments evidenced that data trained with the dialogue corpus predicts better the events in the lecture corpus, since this domain displays more speakers and therefore complex patterns. In both corpora, markers are more easily classified as SUs than as disfluencies. |
publishDate |
2016 |
dc.date.none.fl_str_mv |
2016-10-31 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://doi.org/10.26334/2183-9077/rapln2ano2016a4 https://doi.org/10.26334/2183-9077/rapln2ano2016a4 |
url |
https://doi.org/10.26334/2183-9077/rapln2ano2016a4 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.relation.none.fl_str_mv |
https://ojs.apl.pt/index.php/rapl/article/view/235 https://ojs.apl.pt/index.php/rapl/article/view/235/196 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Associação Portuguesa de Linguística |
publisher.none.fl_str_mv |
Associação Portuguesa de Linguística |
dc.source.none.fl_str_mv |
Revista da Associação Portuguesa de Linguística; No. 2 (2016): Journal of the Portuguese Linguistics Association; 69-95 Revista da Associação Portuguesa de Linguística; N.º 2 (2016): Revista da Associação Portuguesa de Linguística; 69-95 2183-9077 10.26334/2183-9077/rapln2ano2016td reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799133623920623616 |