Linguistics parameters for zero anaphora resolution
Autor(a) principal: | |
---|---|
Data de Publicação: | 2010 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.1/1787 |
Resumo: | Dissertação de mest., Natural Language Processing and Human Language Technology, Univ. do Algarve, 2009 |
id |
RCAP_b7a8b63308f7d3ae27ca0cd210907748 |
---|---|
oai_identifier_str |
oai:sapientia.ualg.pt:10400.1/1787 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Linguistics parameters for zero anaphora resolutionResolução de anáforaAnáfora zeroAbordagem baseada em regras linguisticamente motivadasPortuguês do BrasilDissertação de mest., Natural Language Processing and Human Language Technology, Univ. do Algarve, 2009This dissertation describes and proposes a set of linguistically motivated rules for zero anaphora resolution in the context of a natural language processing chain developed for Portuguese. Some languages, like Portuguese, allow noun phrase (NP) deletion (or zeroing) in several syntactic contexts in order to avoid the redundancy that would result from repetition of previously mentioned words. The co-reference relation between the zeroed element and its antecedent (or previous mention) in the discourse is here called zero anaphora (Mitkov, 2002). In Computational Linguistics, zero anaphora resolution may be viewed as a subtask of anaphora resolution and has an essential role in various Natural Language Processing applications such as information extraction, automatic abstracting, dialog systems, machine translation and question answering. The main goal of this dissertation is to describe the grammatical rules imposing subject NP deletion and referential constraints in the Brazilian Portuguese, in order to allow a correct identification of the antecedent of the deleted subject NP. Some of these rules were then formalized into the Xerox Incremental Parser or XIP (Ait-Mokhtar et al., 2002: 121-144) in order to constitute a module of the Portuguese grammar (Mamede et al. 2010) developed at Spoken Language Laboratory (L2F). Using this rule-based approach we expected to improve the performance of the Portuguese grammar namely by producing better dependency structures with (reconstructed) zeroed NPs for the syntactic-semantic interface. Because of the complexity of the task, the scope of this dissertation had to be limited: (a) subject NP deletion; b) within sentence boundaries and (c) with an explicit antecedent; besides, (d) rules were formalized based solely on the results of the shallow parser (or chunks), that is, with minimal syntactic (and no semantic) knowledge. A corpus of different text genres was manually annotated for zero anaphors and other zero-shaped, usually indefinite, subjects. The rule-based approached is evaluated and results are presented and discussed.Baptista, JorgeEvans, Richard J.SapientiaPereira, Simone Cristina2012-10-30T17:41:25Z20102010-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10400.1/1787eng81'36 PER*Lininfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-24T10:12:56Zoai:sapientia.ualg.pt:10400.1/1787Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T19:55:54.528731Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Linguistics parameters for zero anaphora resolution |
title |
Linguistics parameters for zero anaphora resolution |
spellingShingle |
Linguistics parameters for zero anaphora resolution Pereira, Simone Cristina Resolução de anáfora Anáfora zero Abordagem baseada em regras linguisticamente motivadas Português do Brasil |
title_short |
Linguistics parameters for zero anaphora resolution |
title_full |
Linguistics parameters for zero anaphora resolution |
title_fullStr |
Linguistics parameters for zero anaphora resolution |
title_full_unstemmed |
Linguistics parameters for zero anaphora resolution |
title_sort |
Linguistics parameters for zero anaphora resolution |
author |
Pereira, Simone Cristina |
author_facet |
Pereira, Simone Cristina |
author_role |
author |
dc.contributor.none.fl_str_mv |
Baptista, Jorge Evans, Richard J. Sapientia |
dc.contributor.author.fl_str_mv |
Pereira, Simone Cristina |
dc.subject.por.fl_str_mv |
Resolução de anáfora Anáfora zero Abordagem baseada em regras linguisticamente motivadas Português do Brasil |
topic |
Resolução de anáfora Anáfora zero Abordagem baseada em regras linguisticamente motivadas Português do Brasil |
description |
Dissertação de mest., Natural Language Processing and Human Language Technology, Univ. do Algarve, 2009 |
publishDate |
2010 |
dc.date.none.fl_str_mv |
2010 2010-01-01T00:00:00Z 2012-10-30T17:41:25Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.1/1787 |
url |
http://hdl.handle.net/10400.1/1787 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
81'36 PER*Lin |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799133163482513408 |