Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora

Detalhes bibliográficos
Autor(a) principal: Mata, Ana Isabel
Data de Publicação: 2014
Outros Autores: Moniz, Helena, Móia, Telmo, Gonçalves, Anabela, Silva, Fátima, Batista, Fernando, Duarte, Inês, Oliveira, Fátima, Falé, Isabel
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10451/31090
Resumo: This paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese. These guidelines were defined to allow for the empirical study of interactions among intonation and syntax-discourse patterns in selected sets of different corpora (monologues and dialogues, by adults and teenagers). In this paper we focus on the multilayer annotation process of left periphery structures by using a small sample of highly spontaneous speech in which the distinct types of topic structures are displayed. The analysis of this sample provides fundamental training and testing material for further application in a wider range of domains and corpora. The annotation process comprises the following time-linked levels (manual and automatic): phone, syllable and word level transcriptions (including co-articulation effects); tonal events and break levels; part-of-speech tagging; syntactic-discourse patterns (construction type; construction position; syntactic function; discourse function), and disfluency events as well. Speech corpora with such a multi-level annotation are a valuable resource to look into grammar module relations in language use from an integrated viewpoint. Such viewpoint is innovative in our language, and has not been often assumed by studies for other languages.
id RCAP_88c653a08dbb12a9e7e1853d3d9402a0
oai_identifier_str oai:repositorio.ul.pt:10451/31090
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and CorporaSpeech annotationTopic structuresEuropean PortugueseThis paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese. These guidelines were defined to allow for the empirical study of interactions among intonation and syntax-discourse patterns in selected sets of different corpora (monologues and dialogues, by adults and teenagers). In this paper we focus on the multilayer annotation process of left periphery structures by using a small sample of highly spontaneous speech in which the distinct types of topic structures are displayed. The analysis of this sample provides fundamental training and testing material for further application in a wider range of domains and corpora. The annotation process comprises the following time-linked levels (manual and automatic): phone, syllable and word level transcriptions (including co-articulation effects); tonal events and break levels; part-of-speech tagging; syntactic-discourse patterns (construction type; construction position; syntactic function; discourse function), and disfluency events as well. Speech corpora with such a multi-level annotation are a valuable resource to look into grammar module relations in language use from an integrated viewpoint. Such viewpoint is innovative in our language, and has not been often assumed by studies for other languages.European Language Resources Association (ELRA)Repositório da Universidade de LisboaMata, Ana IsabelMoniz, HelenaMóia, TelmoGonçalves, AnabelaSilva, FátimaBatista, FernandoDuarte, InêsOliveira, FátimaFalé, Isabel2018-01-28T15:36:57Z20142014-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10451/31090engMata, A. I., Moniz, H., Móia, T., Gonçalves, A., Silva, F., Batista, F., Duarte, I., Oliveira, F. & Falé, I. (2014) Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora, in Ninth International Conference on Language Resources and Evaluation (LREC'14), European Language Resources Association (ELRA), 1188-1193.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-08T16:24:15Zoai:repositorio.ul.pt:10451/31090Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:46:37.127433Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
title Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
spellingShingle Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
Mata, Ana Isabel
Speech annotation
Topic structures
European Portuguese
title_short Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
title_full Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
title_fullStr Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
title_full_unstemmed Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
title_sort Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
author Mata, Ana Isabel
author_facet Mata, Ana Isabel
Moniz, Helena
Móia, Telmo
Gonçalves, Anabela
Silva, Fátima
Batista, Fernando
Duarte, Inês
Oliveira, Fátima
Falé, Isabel
author_role author
author2 Moniz, Helena
Móia, Telmo
Gonçalves, Anabela
Silva, Fátima
Batista, Fernando
Duarte, Inês
Oliveira, Fátima
Falé, Isabel
author2_role author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Repositório da Universidade de Lisboa
dc.contributor.author.fl_str_mv Mata, Ana Isabel
Moniz, Helena
Móia, Telmo
Gonçalves, Anabela
Silva, Fátima
Batista, Fernando
Duarte, Inês
Oliveira, Fátima
Falé, Isabel
dc.subject.por.fl_str_mv Speech annotation
Topic structures
European Portuguese
topic Speech annotation
Topic structures
European Portuguese
description This paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese. These guidelines were defined to allow for the empirical study of interactions among intonation and syntax-discourse patterns in selected sets of different corpora (monologues and dialogues, by adults and teenagers). In this paper we focus on the multilayer annotation process of left periphery structures by using a small sample of highly spontaneous speech in which the distinct types of topic structures are displayed. The analysis of this sample provides fundamental training and testing material for further application in a wider range of domains and corpora. The annotation process comprises the following time-linked levels (manual and automatic): phone, syllable and word level transcriptions (including co-articulation effects); tonal events and break levels; part-of-speech tagging; syntactic-discourse patterns (construction type; construction position; syntactic function; discourse function), and disfluency events as well. Speech corpora with such a multi-level annotation are a valuable resource to look into grammar module relations in language use from an integrated viewpoint. Such viewpoint is innovative in our language, and has not been often assumed by studies for other languages.
publishDate 2014
dc.date.none.fl_str_mv 2014
2014-01-01T00:00:00Z
2018-01-28T15:36:57Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10451/31090
url http://hdl.handle.net/10451/31090
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Mata, A. I., Moniz, H., Móia, T., Gonçalves, A., Silva, F., Batista, F., Duarte, I., Oliveira, F. & Falé, I. (2014) Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora, in Ninth International Conference on Language Resources and Evaluation (LREC'14), European Language Resources Association (ELRA), 1188-1193.
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv European Language Resources Association (ELRA)
publisher.none.fl_str_mv European Language Resources Association (ELRA)
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799134391186751488