Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora

Detalhes bibliográficos
Autor(a) principal: Mata, Ana Isabel
Data de Publicação: 2014
Outros Autores: Moniz, Helena, Móia, Telmo, Gonçalves, Anabela, Silva, Fátima, Batista, Fernando, Duarte, Inês, Oliveira, Fátima, Falé, Isabel
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.2/6327
Resumo: This paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese. These guidelines were defined to allow for the empirical study of interactions among intonation and syntax-discourse patterns in selected sets of different corpora (monologues and dialogues, by adults and teenagers). In this paper we focus on the multilayer annotation process of left periphery structures by using a small sample of highly spontaneous speech in which the distinct types of topic structures are displayed. The analysis of this sample provides fundamental training and testing material for further application in a wider range of domains and corpora. The annotation process comprises the following time-linked levels (manual and automatic): phone, syllable and word level transcriptions (including co-articulation effects); tonal events and break levels; part-of-speech tagging; syntactic-discourse patterns (construction type; construction position; syntactic function; discourse function), and disfluency events as well. Speech corpora with such a multi-level annotation are a valuable resource to look into grammar module relations in language use from an integrated viewpoint. Such viewpoint is innovative in our language, and has not been often assumed by studies for other languages.
id RCAP_6f1dae4f7307d913f0e62eb25162a61e
oai_identifier_str oai:repositorioaberto.uab.pt:10400.2/6327
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Prosodic, syntactic, semantic guidelines for topic structures across domains and corporaSpeech annotationTopic structuresProsodySyntaxThis paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese. These guidelines were defined to allow for the empirical study of interactions among intonation and syntax-discourse patterns in selected sets of different corpora (monologues and dialogues, by adults and teenagers). In this paper we focus on the multilayer annotation process of left periphery structures by using a small sample of highly spontaneous speech in which the distinct types of topic structures are displayed. The analysis of this sample provides fundamental training and testing material for further application in a wider range of domains and corpora. The annotation process comprises the following time-linked levels (manual and automatic): phone, syllable and word level transcriptions (including co-articulation effects); tonal events and break levels; part-of-speech tagging; syntactic-discourse patterns (construction type; construction position; syntactic function; discourse function), and disfluency events as well. Speech corpora with such a multi-level annotation are a valuable resource to look into grammar module relations in language use from an integrated viewpoint. Such viewpoint is innovative in our language, and has not been often assumed by studies for other languages.COPAS – PTDC/CLE-LIN/120017/2010 This work was supported by national funds through FCT - Fundacão para a Ciência e a Tecnologia, under project COPAS – PTDC/CLE-LIN/120017/2010. Fernando Batista is supported by ISCTE – Instituto Universitário de Lisboa.European Language Resources AssociationRepositório AbertoMata, Ana IsabelMoniz, HelenaMóia, TelmoGonçalves, AnabelaSilva, FátimaBatista, FernandoDuarte, InêsOliveira, FátimaFalé, Isabel2017-03-22T15:23:47Z2014-052014-05-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.2/6327engFalé, Isabel; [et al.] - Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora. In LREC20214. International Conference on Language Resources and Evaluation, 9, Islândia, 2014 - "Internationl conference...[Em linha]: proceedings". [S.l.] [s.n.], [2014]. ISBN 978-2-9517408-8-4. p. 1188-1193978-2-9517408-8-4info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-16T15:24:16Zoai:repositorioaberto.uab.pt:10400.2/6327Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T22:46:51.748770Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora
title Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora
spellingShingle Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora
Mata, Ana Isabel
Speech annotation
Topic structures
Prosody
Syntax
title_short Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora
title_full Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora
title_fullStr Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora
title_full_unstemmed Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora
title_sort Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora
author Mata, Ana Isabel
author_facet Mata, Ana Isabel
Moniz, Helena
Móia, Telmo
Gonçalves, Anabela
Silva, Fátima
Batista, Fernando
Duarte, Inês
Oliveira, Fátima
Falé, Isabel
author_role author
author2 Moniz, Helena
Móia, Telmo
Gonçalves, Anabela
Silva, Fátima
Batista, Fernando
Duarte, Inês
Oliveira, Fátima
Falé, Isabel
author2_role author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Repositório Aberto
dc.contributor.author.fl_str_mv Mata, Ana Isabel
Moniz, Helena
Móia, Telmo
Gonçalves, Anabela
Silva, Fátima
Batista, Fernando
Duarte, Inês
Oliveira, Fátima
Falé, Isabel
dc.subject.por.fl_str_mv Speech annotation
Topic structures
Prosody
Syntax
topic Speech annotation
Topic structures
Prosody
Syntax
description This paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese. These guidelines were defined to allow for the empirical study of interactions among intonation and syntax-discourse patterns in selected sets of different corpora (monologues and dialogues, by adults and teenagers). In this paper we focus on the multilayer annotation process of left periphery structures by using a small sample of highly spontaneous speech in which the distinct types of topic structures are displayed. The analysis of this sample provides fundamental training and testing material for further application in a wider range of domains and corpora. The annotation process comprises the following time-linked levels (manual and automatic): phone, syllable and word level transcriptions (including co-articulation effects); tonal events and break levels; part-of-speech tagging; syntactic-discourse patterns (construction type; construction position; syntactic function; discourse function), and disfluency events as well. Speech corpora with such a multi-level annotation are a valuable resource to look into grammar module relations in language use from an integrated viewpoint. Such viewpoint is innovative in our language, and has not been often assumed by studies for other languages.
publishDate 2014
dc.date.none.fl_str_mv 2014-05
2014-05-01T00:00:00Z
2017-03-22T15:23:47Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.2/6327
url http://hdl.handle.net/10400.2/6327
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Falé, Isabel; [et al.] - Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora. In LREC20214. International Conference on Language Resources and Evaluation, 9, Islândia, 2014 - "Internationl conference...[Em linha]: proceedings". [S.l.] [s.n.], [2014]. ISBN 978-2-9517408-8-4. p. 1188-1193
978-2-9517408-8-4
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv European Language Resources Association
publisher.none.fl_str_mv European Language Resources Association
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799135044363616256