Linguistics parameters for zero anaphora resolution

Detalhes bibliográficos
Autor(a) principal: Pereira, Simone Cristina
Data de Publicação: 2010
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.1/1787
Resumo: Dissertação de mest., Natural Language Processing and Human Language Technology, Univ. do Algarve, 2009
id RCAP_b7a8b63308f7d3ae27ca0cd210907748
oai_identifier_str oai:sapientia.ualg.pt:10400.1/1787
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Linguistics parameters for zero anaphora resolutionResolução de anáforaAnáfora zeroAbordagem baseada em regras linguisticamente motivadasPortuguês do BrasilDissertação de mest., Natural Language Processing and Human Language Technology, Univ. do Algarve, 2009This dissertation describes and proposes a set of linguistically motivated rules for zero anaphora resolution in the context of a natural language processing chain developed for Portuguese. Some languages, like Portuguese, allow noun phrase (NP) deletion (or zeroing) in several syntactic contexts in order to avoid the redundancy that would result from repetition of previously mentioned words. The co-reference relation between the zeroed element and its antecedent (or previous mention) in the discourse is here called zero anaphora (Mitkov, 2002). In Computational Linguistics, zero anaphora resolution may be viewed as a subtask of anaphora resolution and has an essential role in various Natural Language Processing applications such as information extraction, automatic abstracting, dialog systems, machine translation and question answering. The main goal of this dissertation is to describe the grammatical rules imposing subject NP deletion and referential constraints in the Brazilian Portuguese, in order to allow a correct identification of the antecedent of the deleted subject NP. Some of these rules were then formalized into the Xerox Incremental Parser or XIP (Ait-Mokhtar et al., 2002: 121-144) in order to constitute a module of the Portuguese grammar (Mamede et al. 2010) developed at Spoken Language Laboratory (L2F). Using this rule-based approach we expected to improve the performance of the Portuguese grammar namely by producing better dependency structures with (reconstructed) zeroed NPs for the syntactic-semantic interface. Because of the complexity of the task, the scope of this dissertation had to be limited: (a) subject NP deletion; b) within sentence boundaries and (c) with an explicit antecedent; besides, (d) rules were formalized based solely on the results of the shallow parser (or chunks), that is, with minimal syntactic (and no semantic) knowledge. A corpus of different text genres was manually annotated for zero anaphors and other zero-shaped, usually indefinite, subjects. The rule-based approached is evaluated and results are presented and discussed.Baptista, JorgeEvans, Richard J.SapientiaPereira, Simone Cristina2012-10-30T17:41:25Z20102010-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10400.1/1787eng81'36 PER*Lininfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-24T10:12:56Zoai:sapientia.ualg.pt:10400.1/1787Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T19:55:54.528731Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Linguistics parameters for zero anaphora resolution
title Linguistics parameters for zero anaphora resolution
spellingShingle Linguistics parameters for zero anaphora resolution
Pereira, Simone Cristina
Resolução de anáfora
Anáfora zero
Abordagem baseada em regras linguisticamente motivadas
Português do Brasil
title_short Linguistics parameters for zero anaphora resolution
title_full Linguistics parameters for zero anaphora resolution
title_fullStr Linguistics parameters for zero anaphora resolution
title_full_unstemmed Linguistics parameters for zero anaphora resolution
title_sort Linguistics parameters for zero anaphora resolution
author Pereira, Simone Cristina
author_facet Pereira, Simone Cristina
author_role author
dc.contributor.none.fl_str_mv Baptista, Jorge
Evans, Richard J.
Sapientia
dc.contributor.author.fl_str_mv Pereira, Simone Cristina
dc.subject.por.fl_str_mv Resolução de anáfora
Anáfora zero
Abordagem baseada em regras linguisticamente motivadas
Português do Brasil
topic Resolução de anáfora
Anáfora zero
Abordagem baseada em regras linguisticamente motivadas
Português do Brasil
description Dissertação de mest., Natural Language Processing and Human Language Technology, Univ. do Algarve, 2009
publishDate 2010
dc.date.none.fl_str_mv 2010
2010-01-01T00:00:00Z
2012-10-30T17:41:25Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.1/1787
url http://hdl.handle.net/10400.1/1787
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 81'36 PER*Lin
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799133163482513408