VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE

Detalhes bibliográficos
Autor(a) principal: Mourão, Jessé de Sousa
Data de Publicação: 2021
Tipo de documento: Tese
Idioma: por
Título da fonte: Repositório Institucional da Universidade Federal do Ceará (UFC)
Texto Completo: http://www.repositorio.ufc.br/handle/riufc/62043
Resumo: This dissertation presents a research work whose main objective was the construction of a computational lexicon of verbal valences and diatheses of Brazilian Portuguese (BP), named VaLexPB, via the LFG-XLE formalism. The work was initially motivated by theoretical, descriptive and applied scarcity and gaps, identified in the linguistic literature and in technical and technological knowledge for the computational processing of Brazilian Portuguese. Particularly, the research aimed at contributing to the development of a lexical-functional computational grammar for BP. To this end, a set of valences in the form of verbal diathesis, based on the Dicionário de Valências Verbais do Português Brasileiro (Perini, 2015), was elaborated and implemented in the LFG-XLE formalism. The theoretical model adopted for the description and analysis was the Lexical-Functional Grammar along with theoretical proposals by Perini (2015, 2019), Cançado (2005) and Cançado and Amaral (2016). The methodological approach used was the one already consolidated in works of computational LFG-XLE grammars construction. One of the goals was also to evaluate the resource regarding its ability to cover BP sentences with the investigated verbs, besides the analysis of the computational and theoretical consequences of the implemented resource. As a result, a set of 389 verbal diatheses was developed, covering sentential constructions licensed by at least 409 verbs of BP. These diatheses allow the analysis of sentences at three levels: two levels of syntactic representation (c-structure and f-structure) and one level of semantic representation (a-structure). Regarding the coverage capacity of VaLexPB, tests showed that it managed to cover over 90% of new sentences licensed by the implemented verbs. The results also showed that this resource has a significant degree of ambiguity, producing an average of 2.8 to 3.7 solutions per sentence (considering development and coverage tests), something that still needs to be worked on in future work in order to include restrictions that might reduce this ambiguity. Despite the improvements needed, the developed resource presents great relevance and usefulness, as it contributes, in the theoretical side, to the current knowledge about the nature and functioning of the BP verbal lexicon; and fro the practical side, it extends a BP computational LFG grammar, enriching the computational processing scenario of this language.
id UFC-7_5a52f4016ba353c1c02d490234cbeeb8
oai_identifier_str oai:repositorio.ufc.br:riufc/62043
network_acronym_str UFC-7
network_name_str Repositório Institucional da Universidade Federal do Ceará (UFC)
repository_id_str
spelling VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLEValência verbalDiáteses verbaisGramática Léxico-FuncionalProcessamento de linguagem naturalPortuguês brasileiroThis dissertation presents a research work whose main objective was the construction of a computational lexicon of verbal valences and diatheses of Brazilian Portuguese (BP), named VaLexPB, via the LFG-XLE formalism. The work was initially motivated by theoretical, descriptive and applied scarcity and gaps, identified in the linguistic literature and in technical and technological knowledge for the computational processing of Brazilian Portuguese. Particularly, the research aimed at contributing to the development of a lexical-functional computational grammar for BP. To this end, a set of valences in the form of verbal diathesis, based on the Dicionário de Valências Verbais do Português Brasileiro (Perini, 2015), was elaborated and implemented in the LFG-XLE formalism. The theoretical model adopted for the description and analysis was the Lexical-Functional Grammar along with theoretical proposals by Perini (2015, 2019), Cançado (2005) and Cançado and Amaral (2016). The methodological approach used was the one already consolidated in works of computational LFG-XLE grammars construction. One of the goals was also to evaluate the resource regarding its ability to cover BP sentences with the investigated verbs, besides the analysis of the computational and theoretical consequences of the implemented resource. As a result, a set of 389 verbal diatheses was developed, covering sentential constructions licensed by at least 409 verbs of BP. These diatheses allow the analysis of sentences at three levels: two levels of syntactic representation (c-structure and f-structure) and one level of semantic representation (a-structure). Regarding the coverage capacity of VaLexPB, tests showed that it managed to cover over 90% of new sentences licensed by the implemented verbs. The results also showed that this resource has a significant degree of ambiguity, producing an average of 2.8 to 3.7 solutions per sentence (considering development and coverage tests), something that still needs to be worked on in future work in order to include restrictions that might reduce this ambiguity. Despite the improvements needed, the developed resource presents great relevance and usefulness, as it contributes, in the theoretical side, to the current knowledge about the nature and functioning of the BP verbal lexicon; and fro the practical side, it extends a BP computational LFG grammar, enriching the computational processing scenario of this language.Esta tese apresenta uma pesquisa que teve como principal objetivo a construção de um léxico computacional de valências e diáteses verbais do Português Brasileiro (PB), denominado VaLexPB, via formalismo LFG-XLE. O trabalho foi motivado inicialmente em carências e lacunas de ordem tanto teórica quanto descritiva e aplicada, identificadas na literatura linguística e no conhecimento técnico e tecnológico produzido para fins de tratamento e processamento computacional do Português Brasileiro. Particularmente, a pesquisa visou contribuir com o desenvolvimento de uma gramática computacional léxico-funcional para o PB. Para tanto, um conjunto de valências dadas em formas de diáteses verbais, baseado no Dicionário de Valências Verbais do Português Brasileiro (Perini, 2019), foi elaborado e implementado no formalismo LFG-XLE. O modelo teórico adotado para dar suporte à descrição e análise foi o da Gramática Léxico-Funcional juntamente com propostas teóricas desenvolvidas por Perini (2015, 2019) e Cançado e Amaral (2017). A abordagem metodológica empregada foi a já consolidada nos trabalhos de construção de gramáticas computacionais no formalismo LFG-XE. Objetivou-se ainda a avaliação do recurso quanto à sua capacidade de cobertura de sentenças do PB com os verbos investigados e a análise das consequências computacionais e teóricas do recurso implementado. Como resultado, chegou-se a um conjunto de 389 diáteses verbais que abrangem as construções sentenciais licenciadas por pelo menos 409 verbos do PB. Essas diáteses permitem a análise de sentenças em três níveis: dois níveis de representação sintática (estrutura-c e estruturua-f) e um nível de representação semântica (estrutura-a). Em relação à capacidade cobertura do VaLexPB, os testes mostraram que ele conseguiu cobrir pouco mais de 90% de sentenças novas licenciadas pelos verbos implementados. Os resultados também mostraram que o recurso tem um grau significativo de ambiguidade, produzindo uma média de 2,8 a 3,7 soluções por sentença (considerando os testes de desenvolvimento e de cobertura), algo que ainda precisa ser trabalhado em trabalhos futuros, de modo a incluir restrições que reduzam essa ambiguidade. Apesar dos aprimoramentos ainda necessários, o recurso desenvolvido tem grande relevância e utilidade, pois contribui, no âmbito teórico, com o conhecimento que se tem sobre a natureza e funcionamento do léxico verbal no PB; e no âmbito prático, amplia uma gramática computacional LFG para o PB, enriquecendo o cenário de processamento computacional dessa língua.Araripe, Leonel Figueiredo de AlencarMourão, Jessé de Sousa2021-11-12T19:16:17Z2021-11-12T19:16:17Z2021info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisapplication/pdfMOURÃO, Jessé de Sousa. VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE. 2021. 297 f - Tese (Doutorado em Linguística) - Programa de Pós-graduação em Linguística, Centro de Humanidades, Universidade Federal do Ceará, Fortaleza, 2021.http://www.repositorio.ufc.br/handle/riufc/62043porreponame:Repositório Institucional da Universidade Federal do Ceará (UFC)instname:Universidade Federal do Ceará (UFC)instacron:UFCinfo:eu-repo/semantics/openAccess2021-11-12T19:16:17Zoai:repositorio.ufc.br:riufc/62043Repositório InstitucionalPUBhttp://www.repositorio.ufc.br/ri-oai/requestbu@ufc.br || repositorio@ufc.bropendoar:2024-09-11T18:28:16.537822Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC)false
dc.title.none.fl_str_mv VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE
title VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE
spellingShingle VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE
Mourão, Jessé de Sousa
Valência verbal
Diáteses verbais
Gramática Léxico-Funcional
Processamento de linguagem natural
Português brasileiro
title_short VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE
title_full VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE
title_fullStr VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE
title_full_unstemmed VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE
title_sort VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE
author Mourão, Jessé de Sousa
author_facet Mourão, Jessé de Sousa
author_role author
dc.contributor.none.fl_str_mv Araripe, Leonel Figueiredo de Alencar
dc.contributor.author.fl_str_mv Mourão, Jessé de Sousa
dc.subject.por.fl_str_mv Valência verbal
Diáteses verbais
Gramática Léxico-Funcional
Processamento de linguagem natural
Português brasileiro
topic Valência verbal
Diáteses verbais
Gramática Léxico-Funcional
Processamento de linguagem natural
Português brasileiro
description This dissertation presents a research work whose main objective was the construction of a computational lexicon of verbal valences and diatheses of Brazilian Portuguese (BP), named VaLexPB, via the LFG-XLE formalism. The work was initially motivated by theoretical, descriptive and applied scarcity and gaps, identified in the linguistic literature and in technical and technological knowledge for the computational processing of Brazilian Portuguese. Particularly, the research aimed at contributing to the development of a lexical-functional computational grammar for BP. To this end, a set of valences in the form of verbal diathesis, based on the Dicionário de Valências Verbais do Português Brasileiro (Perini, 2015), was elaborated and implemented in the LFG-XLE formalism. The theoretical model adopted for the description and analysis was the Lexical-Functional Grammar along with theoretical proposals by Perini (2015, 2019), Cançado (2005) and Cançado and Amaral (2016). The methodological approach used was the one already consolidated in works of computational LFG-XLE grammars construction. One of the goals was also to evaluate the resource regarding its ability to cover BP sentences with the investigated verbs, besides the analysis of the computational and theoretical consequences of the implemented resource. As a result, a set of 389 verbal diatheses was developed, covering sentential constructions licensed by at least 409 verbs of BP. These diatheses allow the analysis of sentences at three levels: two levels of syntactic representation (c-structure and f-structure) and one level of semantic representation (a-structure). Regarding the coverage capacity of VaLexPB, tests showed that it managed to cover over 90% of new sentences licensed by the implemented verbs. The results also showed that this resource has a significant degree of ambiguity, producing an average of 2.8 to 3.7 solutions per sentence (considering development and coverage tests), something that still needs to be worked on in future work in order to include restrictions that might reduce this ambiguity. Despite the improvements needed, the developed resource presents great relevance and usefulness, as it contributes, in the theoretical side, to the current knowledge about the nature and functioning of the BP verbal lexicon; and fro the practical side, it extends a BP computational LFG grammar, enriching the computational processing scenario of this language.
publishDate 2021
dc.date.none.fl_str_mv 2021-11-12T19:16:17Z
2021-11-12T19:16:17Z
2021
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv MOURÃO, Jessé de Sousa. VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE. 2021. 297 f - Tese (Doutorado em Linguística) - Programa de Pós-graduação em Linguística, Centro de Humanidades, Universidade Federal do Ceará, Fortaleza, 2021.
http://www.repositorio.ufc.br/handle/riufc/62043
identifier_str_mv MOURÃO, Jessé de Sousa. VaLexPB: um léxico de valências e diáteses verbais para o português brasileiro baseado no formalismo LFG-XLE. 2021. 297 f - Tese (Doutorado em Linguística) - Programa de Pós-graduação em Linguística, Centro de Humanidades, Universidade Federal do Ceará, Fortaleza, 2021.
url http://www.repositorio.ufc.br/handle/riufc/62043
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Institucional da Universidade Federal do Ceará (UFC)
instname:Universidade Federal do Ceará (UFC)
instacron:UFC
instname_str Universidade Federal do Ceará (UFC)
instacron_str UFC
institution UFC
reponame_str Repositório Institucional da Universidade Federal do Ceará (UFC)
collection Repositório Institucional da Universidade Federal do Ceará (UFC)
repository.name.fl_str_mv Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC)
repository.mail.fl_str_mv bu@ufc.br || repositorio@ufc.br
_version_ 1813028817111875584