Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora
Autor(a) principal: | |
---|---|
Data de Publicação: | 2014 |
Outros Autores: | , , , , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.2/6327 |
Resumo: | This paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese. These guidelines were defined to allow for the empirical study of interactions among intonation and syntax-discourse patterns in selected sets of different corpora (monologues and dialogues, by adults and teenagers). In this paper we focus on the multilayer annotation process of left periphery structures by using a small sample of highly spontaneous speech in which the distinct types of topic structures are displayed. The analysis of this sample provides fundamental training and testing material for further application in a wider range of domains and corpora. The annotation process comprises the following time-linked levels (manual and automatic): phone, syllable and word level transcriptions (including co-articulation effects); tonal events and break levels; part-of-speech tagging; syntactic-discourse patterns (construction type; construction position; syntactic function; discourse function), and disfluency events as well. Speech corpora with such a multi-level annotation are a valuable resource to look into grammar module relations in language use from an integrated viewpoint. Such viewpoint is innovative in our language, and has not been often assumed by studies for other languages. |
id |
RCAP_6f1dae4f7307d913f0e62eb25162a61e |
---|---|
oai_identifier_str |
oai:repositorioaberto.uab.pt:10400.2/6327 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Prosodic, syntactic, semantic guidelines for topic structures across domains and corporaSpeech annotationTopic structuresProsodySyntaxThis paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese. These guidelines were defined to allow for the empirical study of interactions among intonation and syntax-discourse patterns in selected sets of different corpora (monologues and dialogues, by adults and teenagers). In this paper we focus on the multilayer annotation process of left periphery structures by using a small sample of highly spontaneous speech in which the distinct types of topic structures are displayed. The analysis of this sample provides fundamental training and testing material for further application in a wider range of domains and corpora. The annotation process comprises the following time-linked levels (manual and automatic): phone, syllable and word level transcriptions (including co-articulation effects); tonal events and break levels; part-of-speech tagging; syntactic-discourse patterns (construction type; construction position; syntactic function; discourse function), and disfluency events as well. Speech corpora with such a multi-level annotation are a valuable resource to look into grammar module relations in language use from an integrated viewpoint. Such viewpoint is innovative in our language, and has not been often assumed by studies for other languages.COPAS – PTDC/CLE-LIN/120017/2010 This work was supported by national funds through FCT - Fundacão para a Ciência e a Tecnologia, under project COPAS – PTDC/CLE-LIN/120017/2010. Fernando Batista is supported by ISCTE – Instituto Universitário de Lisboa.European Language Resources AssociationRepositório AbertoMata, Ana IsabelMoniz, HelenaMóia, TelmoGonçalves, AnabelaSilva, FátimaBatista, FernandoDuarte, InêsOliveira, FátimaFalé, Isabel2017-03-22T15:23:47Z2014-052014-05-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.2/6327engFalé, Isabel; [et al.] - Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora. In LREC20214. International Conference on Language Resources and Evaluation, 9, Islândia, 2014 - "Internationl conference...[Em linha]: proceedings". [S.l.] [s.n.], [2014]. ISBN 978-2-9517408-8-4. p. 1188-1193978-2-9517408-8-4info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-16T15:24:16Zoai:repositorioaberto.uab.pt:10400.2/6327Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T22:46:51.748770Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora |
title |
Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora |
spellingShingle |
Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora Mata, Ana Isabel Speech annotation Topic structures Prosody Syntax |
title_short |
Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora |
title_full |
Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora |
title_fullStr |
Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora |
title_full_unstemmed |
Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora |
title_sort |
Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora |
author |
Mata, Ana Isabel |
author_facet |
Mata, Ana Isabel Moniz, Helena Móia, Telmo Gonçalves, Anabela Silva, Fátima Batista, Fernando Duarte, Inês Oliveira, Fátima Falé, Isabel |
author_role |
author |
author2 |
Moniz, Helena Móia, Telmo Gonçalves, Anabela Silva, Fátima Batista, Fernando Duarte, Inês Oliveira, Fátima Falé, Isabel |
author2_role |
author author author author author author author author |
dc.contributor.none.fl_str_mv |
Repositório Aberto |
dc.contributor.author.fl_str_mv |
Mata, Ana Isabel Moniz, Helena Móia, Telmo Gonçalves, Anabela Silva, Fátima Batista, Fernando Duarte, Inês Oliveira, Fátima Falé, Isabel |
dc.subject.por.fl_str_mv |
Speech annotation Topic structures Prosody Syntax |
topic |
Speech annotation Topic structures Prosody Syntax |
description |
This paper presents the annotation guidelines applied to naturally occurring speech, aiming at an integrated account of contrast and parallel structures in European Portuguese. These guidelines were defined to allow for the empirical study of interactions among intonation and syntax-discourse patterns in selected sets of different corpora (monologues and dialogues, by adults and teenagers). In this paper we focus on the multilayer annotation process of left periphery structures by using a small sample of highly spontaneous speech in which the distinct types of topic structures are displayed. The analysis of this sample provides fundamental training and testing material for further application in a wider range of domains and corpora. The annotation process comprises the following time-linked levels (manual and automatic): phone, syllable and word level transcriptions (including co-articulation effects); tonal events and break levels; part-of-speech tagging; syntactic-discourse patterns (construction type; construction position; syntactic function; discourse function), and disfluency events as well. Speech corpora with such a multi-level annotation are a valuable resource to look into grammar module relations in language use from an integrated viewpoint. Such viewpoint is innovative in our language, and has not been often assumed by studies for other languages. |
publishDate |
2014 |
dc.date.none.fl_str_mv |
2014-05 2014-05-01T00:00:00Z 2017-03-22T15:23:47Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.2/6327 |
url |
http://hdl.handle.net/10400.2/6327 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Falé, Isabel; [et al.] - Prosodic, syntactic, semantic guidelines for topic structures across domains and corpora. In LREC20214. International Conference on Language Resources and Evaluation, 9, Islândia, 2014 - "Internationl conference...[Em linha]: proceedings". [S.l.] [s.n.], [2014]. ISBN 978-2-9517408-8-4. p. 1188-1193 978-2-9517408-8-4 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
European Language Resources Association |
publisher.none.fl_str_mv |
European Language Resources Association |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799135044363616256 |