PrepNet.Br: uma proposta de representação semântica para as preposições do português
Autor(a) principal: | |
---|---|
Data de Publicação: | 2018 |
Tipo de documento: | Tese |
Idioma: | por |
Título da fonte: | Repositório Institucional da UFSCAR |
Texto Completo: | https://repositorio.ufscar.br/handle/ufscar/10006 |
Resumo: | In the last decades, the study of prepositions has been more modest when compared to the studies of nouns, verbs and adjectives, both in Linguistics and Natural Language Processing (NLP). However, some recent studies have shown that the semantic of prepositions is essential for the understanding of sentences and this can benefit computational applications. This thesis proposes the description of prepositions in terms of a semantic network inspired by WordNet (Princeton's Lexical Database for English, WN.Pr). In this proposal, it is argued that prepositions with similar spatial characteristics are synonymous and occupy the center of the network, from which their metaphorical extensions derive. In representational terms, the synonymy was modeled by means of synsets (as in WN.Pr), simple definitions (glosses) were assigned to each direction, and antonyms, hyperonymy, and hyponymy relations were established. Therefore, this research was developed in two research domains: Linguistic and Computational-Linguistics. In the first domain, (i) we study the theoretical cognitive knowledge that supports the spatial description of the basic sense of these items, and (ii) we identified their metaphorical senses based on a systematization of the different meanings attributed by Portuguese grammars. In the second domain, we proposed the systematization of the linguistic data in a conceptual hierarchy organized in a computationally treatable format implemented in Protégé, one of the most used editors for the creation and edition of ontologies. In total, 502 different senses were identified and indexed, creating the first version of PrepNet.Br, published online. Of these, 152 spatial meanings were attributed to 15 simple prepositions and 110 complex prepositions, while 350 are metaphorical meanings grouped into 114 classes. In addition, a total of 2358 sample sentences were selected. With the creation of this sense inventory, a new lexical resource was created for Brazilian Portuguese and the description of prepositions contributed to enrich the studies in Descriptive Linguistics. With the creation of this sense inventory, a new lexical resource was created for Brazilian Portuguese that may contribute to enrich the studies of Descriptive Linguistics and, even modestly, to the advance of NLP in Brazil. |
id |
SCAR_86951b3b628085c78be1169acfc3fe9c |
---|---|
oai_identifier_str |
oai:repositorio.ufscar.br:ufscar/10006 |
network_acronym_str |
SCAR |
network_name_str |
Repositório Institucional da UFSCAR |
repository_id_str |
4322 |
spelling |
Garcia, Débora DomicianoDi Felippo, Arianihttp://lattes.cnpq.br/8648412103197455Lopes, Marcos Fernandohttp://lattes.cnpq.br/8585527032653186http://lattes.cnpq.br/0661331988415902be190aca-eb36-484d-950b-9a9e7604b1262018-05-14T20:23:55Z2018-05-14T20:23:55Z2018-02-27GARCIA, Débora Domiciano. PrepNet.Br: uma proposta de representação semântica para as preposições do português. 2018. Tese (Doutorado em Linguística) – Universidade Federal de São Carlos, São Carlos, 2018. Disponível em: https://repositorio.ufscar.br/handle/ufscar/10006.https://repositorio.ufscar.br/handle/ufscar/10006In the last decades, the study of prepositions has been more modest when compared to the studies of nouns, verbs and adjectives, both in Linguistics and Natural Language Processing (NLP). However, some recent studies have shown that the semantic of prepositions is essential for the understanding of sentences and this can benefit computational applications. This thesis proposes the description of prepositions in terms of a semantic network inspired by WordNet (Princeton's Lexical Database for English, WN.Pr). In this proposal, it is argued that prepositions with similar spatial characteristics are synonymous and occupy the center of the network, from which their metaphorical extensions derive. In representational terms, the synonymy was modeled by means of synsets (as in WN.Pr), simple definitions (glosses) were assigned to each direction, and antonyms, hyperonymy, and hyponymy relations were established. Therefore, this research was developed in two research domains: Linguistic and Computational-Linguistics. In the first domain, (i) we study the theoretical cognitive knowledge that supports the spatial description of the basic sense of these items, and (ii) we identified their metaphorical senses based on a systematization of the different meanings attributed by Portuguese grammars. In the second domain, we proposed the systematization of the linguistic data in a conceptual hierarchy organized in a computationally treatable format implemented in Protégé, one of the most used editors for the creation and edition of ontologies. In total, 502 different senses were identified and indexed, creating the first version of PrepNet.Br, published online. Of these, 152 spatial meanings were attributed to 15 simple prepositions and 110 complex prepositions, while 350 are metaphorical meanings grouped into 114 classes. In addition, a total of 2358 sample sentences were selected. With the creation of this sense inventory, a new lexical resource was created for Brazilian Portuguese and the description of prepositions contributed to enrich the studies in Descriptive Linguistics. With the creation of this sense inventory, a new lexical resource was created for Brazilian Portuguese that may contribute to enrich the studies of Descriptive Linguistics and, even modestly, to the advance of NLP in Brazil.As preposições têm sido coadjuvantes nos estudos linguístico-computacionais seja por causa de sua alta frequência de ocorrência ou por serem consideradas meras partículas relacionais. Entretanto, alguns estudos mais recentes têm mostrado que a carga semântica das preposições é essencial para a compreensão das sentenças e que isso pode beneficiar aplicações computacionais. Inserida no contexto da Linguística Cognitiva e do Processamento Automático de Línguas Naturais (PLN), esta tese propõe a descrição e modelagem dos sentidos das preposições em termos de uma rede semântica inspirada na WordNet de Princeton (WN.Pr). Nesta proposta, defende-se que preposições com características espaciais semelhantes são sinônimas e ocupam o centro da rede, a partir do qual derivam suas extensões metafóricas. Em termos representacionais, a sinonímia entre os termos foi modelada por meio de \textit{synsets} (como na WN.Pr), definições simples (glosas) foram atribuídas a cada sentido e relações de antonímia, hiperonímia e hiponímia, estabelecidas. Para tanto, esta pesquisa desenvolveu-se em dois domínios de investigação: o Linguístico e o Linguístico-Computacional. No primeiro, (i) reuniu-se o conhecimento teórico cognitivo que dá suporte à descrição espacial do sentido de base desses itens e (ii) identificaram-se os seus sentidos metafóricos, a partir da sistematização dos diferentes sentidos atribuídos por gramáticas selecionadas. No segundo, sistematizaram-se os dados linguísticos em uma hierarquia conceitual organizada em um formato computacionalmente tratável implementado no Protégé, um dos editores mais utilizados para a criação e edição de ontologias. No total, 502 sentidos diferentes foram identificados e indexados, criando a primeira versão da PrepNet.Br, disponilizada \textit{online}. Desses sentidos, 151 são espaciais, atribuídos a um conjunto de 15 preposições simples e 110 locuções prepositivas, e 350 são metafóricos, designados por 114 classes. Além disso, um total de 2.358 frases-exemplo foram selecionadas. Com a criação desse inventário de sentidos, cria-se mais um recurso linguístico-computacional para o português brasileiro, contribuindo com os estudos descritivos do português e, ainda que de maneira modesta, para o avanço do PLN no Brasil.Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)porUniversidade Federal de São CarlosCâmpus São CarlosPrograma de Pós-Graduação em Linguística - PPGLUFSCarPreposiçõesRede semânticaLinguística cognitivaOntologiaPrepositionsSemantic networkCognitive linguisticsOntologyLINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADALINGUISTICA, LETRAS E ARTES::LINGUISTICA::TEORIA E ANALISE LINGUISTICAPrepNet.Br: uma proposta de representação semântica para as preposições do portuguêsPrepNet.Br: a proposal of semantic representation for the prepositions of Portugueseinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisOnline60060026c5db60-6612-41e6-a8f9-f94fb475ca58info:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFSCARinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCARORIGINALDeboraGarcia_VersaoFinal.pdfDeboraGarcia_VersaoFinal.pdfTese de doutoradoapplication/pdf17056969https://repositorio.ufscar.br/bitstream/ufscar/10006/1/DeboraGarcia_VersaoFinal.pdfda99488c5ec28bdcc57df422870196bcMD51Anexo I Carta Orientador.docxAnexo I Carta Orientador.docxCarta comprovante assinada pelo orientadoraapplication/vnd.openxmlformats-officedocument.wordprocessingml.document67228https://repositorio.ufscar.br/bitstream/ufscar/10006/3/Anexo%20I%20Carta%20Orientador.docx6ffbec4bfb0436f7ec1242b8bfe2a573MD53LICENSElicense.txtlicense.txttext/plain; charset=utf-81957https://repositorio.ufscar.br/bitstream/ufscar/10006/4/license.txtae0398b6f8b235e40ad82cba6c50031dMD54TEXTDeboraGarcia_VersaoFinal.pdf.txtDeboraGarcia_VersaoFinal.pdf.txtExtracted texttext/plain551595https://repositorio.ufscar.br/bitstream/ufscar/10006/5/DeboraGarcia_VersaoFinal.pdf.txt85007b95d6714c5e3d5a9f88dbeb0543MD55THUMBNAILDeboraGarcia_VersaoFinal.pdf.jpgDeboraGarcia_VersaoFinal.pdf.jpgIM Thumbnailimage/jpeg4071https://repositorio.ufscar.br/bitstream/ufscar/10006/6/DeboraGarcia_VersaoFinal.pdf.jpg639ec5582288d5a477da14cd84bff400MD56ufscar/100062023-09-18 18:31:42.407oai:repositorio.ufscar.br:ufscar/10006TElDRU7Dh0EgREUgRElTVFJJQlVJw4fDg08gTsODTy1FWENMVVNJVkEKCkNvbSBhIGFwcmVzZW50YcOnw6NvIGRlc3RhIGxpY2Vuw6dhLCB2b2PDqiAobyBhdXRvciAoZXMpIG91IG8gdGl0dWxhciBkb3MgZGlyZWl0b3MgZGUgYXV0b3IpIGNvbmNlZGUgw6AgVW5pdmVyc2lkYWRlCkZlZGVyYWwgZGUgU8OjbyBDYXJsb3MgbyBkaXJlaXRvIG7Do28tZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsICB0cmFkdXppciAoY29uZm9ybWUgZGVmaW5pZG8gYWJhaXhvKSwgZS9vdQpkaXN0cmlidWlyIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyAoaW5jbHVpbmRvIG8gcmVzdW1vKSBwb3IgdG9kbyBvIG11bmRvIG5vIGZvcm1hdG8gaW1wcmVzc28gZSBlbGV0csO0bmljbyBlCmVtIHF1YWxxdWVyIG1laW8sIGluY2x1aW5kbyBvcyBmb3JtYXRvcyDDoXVkaW8gb3UgdsOtZGVvLgoKVm9jw6ogY29uY29yZGEgcXVlIGEgVUZTQ2FyIHBvZGUsIHNlbSBhbHRlcmFyIG8gY29udGXDumRvLCB0cmFuc3BvciBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28KcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhw6fDo28uCgpWb2PDqiB0YW1iw6ltIGNvbmNvcmRhIHF1ZSBhIFVGU0NhciBwb2RlIG1hbnRlciBtYWlzIGRlIHVtYSBjw7NwaWEgYSBzdWEgdGVzZSBvdQpkaXNzZXJ0YcOnw6NvIHBhcmEgZmlucyBkZSBzZWd1cmFuw6dhLCBiYWNrLXVwIGUgcHJlc2VydmHDp8Ojby4KClZvY8OqIGRlY2xhcmEgcXVlIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyDDqSBvcmlnaW5hbCBlIHF1ZSB2b2PDqiB0ZW0gbyBwb2RlciBkZSBjb25jZWRlciBvcyBkaXJlaXRvcyBjb250aWRvcwpuZXN0YSBsaWNlbsOnYS4gVm9jw6ogdGFtYsOpbSBkZWNsYXJhIHF1ZSBvIGRlcMOzc2l0byBkYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIG7Do28sIHF1ZSBzZWphIGRlIHNldQpjb25oZWNpbWVudG8sIGluZnJpbmdlIGRpcmVpdG9zIGF1dG9yYWlzIGRlIG5pbmd1w6ltLgoKQ2FzbyBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gY29udGVuaGEgbWF0ZXJpYWwgcXVlIHZvY8OqIG7Do28gcG9zc3VpIGEgdGl0dWxhcmlkYWRlIGRvcyBkaXJlaXRvcyBhdXRvcmFpcywgdm9jw6oKZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzc8OjbyBpcnJlc3RyaXRhIGRvIGRldGVudG9yIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBwYXJhIGNvbmNlZGVyIMOgIFVGU0NhcgpvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7Dp2EsIGUgcXVlIGVzc2UgbWF0ZXJpYWwgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUKaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3Ugbm8gY29udGXDumRvIGRhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBvcmEgZGVwb3NpdGFkYS4KCkNBU08gQSBURVNFIE9VIERJU1NFUlRBw4fDg08gT1JBIERFUE9TSVRBREEgVEVOSEEgU0lETyBSRVNVTFRBRE8gREUgVU0gUEFUUk9Dw41OSU8gT1UKQVBPSU8gREUgVU1BIEFHw4pOQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PIFFVRSBOw4NPIFNFSkEgQSBVRlNDYXIsClZPQ8OKIERFQ0xBUkEgUVVFIFJFU1BFSVRPVSBUT0RPUyBFIFFVQUlTUVVFUiBESVJFSVRPUyBERSBSRVZJU8ODTyBDT01PClRBTULDiU0gQVMgREVNQUlTIE9CUklHQcOHw5VFUyBFWElHSURBUyBQT1IgQ09OVFJBVE8gT1UgQUNPUkRPLgoKQSBVRlNDYXIgc2UgY29tcHJvbWV0ZSBhIGlkZW50aWZpY2FyIGNsYXJhbWVudGUgbyBzZXUgbm9tZSAocykgb3UgbyhzKSBub21lKHMpIGRvKHMpCmRldGVudG9yKGVzKSBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgZGEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvLCBlIG7Do28gZmFyw6EgcXVhbHF1ZXIgYWx0ZXJhw6fDo28sIGFsw6ltIGRhcXVlbGFzCmNvbmNlZGlkYXMgcG9yIGVzdGEgbGljZW7Dp2EuCg==Repositório InstitucionalPUBhttps://repositorio.ufscar.br/oai/requestopendoar:43222023-09-18T18:31:42Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)false |
dc.title.por.fl_str_mv |
PrepNet.Br: uma proposta de representação semântica para as preposições do português |
dc.title.alternative.eng.fl_str_mv |
PrepNet.Br: a proposal of semantic representation for the prepositions of Portuguese |
title |
PrepNet.Br: uma proposta de representação semântica para as preposições do português |
spellingShingle |
PrepNet.Br: uma proposta de representação semântica para as preposições do português Garcia, Débora Domiciano Preposições Rede semântica Linguística cognitiva Ontologia Prepositions Semantic network Cognitive linguistics Ontology LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA LINGUISTICA, LETRAS E ARTES::LINGUISTICA::TEORIA E ANALISE LINGUISTICA |
title_short |
PrepNet.Br: uma proposta de representação semântica para as preposições do português |
title_full |
PrepNet.Br: uma proposta de representação semântica para as preposições do português |
title_fullStr |
PrepNet.Br: uma proposta de representação semântica para as preposições do português |
title_full_unstemmed |
PrepNet.Br: uma proposta de representação semântica para as preposições do português |
title_sort |
PrepNet.Br: uma proposta de representação semântica para as preposições do português |
author |
Garcia, Débora Domiciano |
author_facet |
Garcia, Débora Domiciano |
author_role |
author |
dc.contributor.authorlattes.por.fl_str_mv |
http://lattes.cnpq.br/0661331988415902 |
dc.contributor.author.fl_str_mv |
Garcia, Débora Domiciano |
dc.contributor.advisor1.fl_str_mv |
Di Felippo, Ariani |
dc.contributor.advisor1Lattes.fl_str_mv |
http://lattes.cnpq.br/8648412103197455 |
dc.contributor.advisor-co1.fl_str_mv |
Lopes, Marcos Fernando |
dc.contributor.advisor-co1Lattes.fl_str_mv |
http://lattes.cnpq.br/8585527032653186 |
dc.contributor.authorID.fl_str_mv |
be190aca-eb36-484d-950b-9a9e7604b126 |
contributor_str_mv |
Di Felippo, Ariani Lopes, Marcos Fernando |
dc.subject.por.fl_str_mv |
Preposições Rede semântica Linguística cognitiva Ontologia Prepositions |
topic |
Preposições Rede semântica Linguística cognitiva Ontologia Prepositions Semantic network Cognitive linguistics Ontology LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA LINGUISTICA, LETRAS E ARTES::LINGUISTICA::TEORIA E ANALISE LINGUISTICA |
dc.subject.eng.fl_str_mv |
Semantic network Cognitive linguistics Ontology |
dc.subject.cnpq.fl_str_mv |
LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA LINGUISTICA, LETRAS E ARTES::LINGUISTICA::TEORIA E ANALISE LINGUISTICA |
description |
In the last decades, the study of prepositions has been more modest when compared to the studies of nouns, verbs and adjectives, both in Linguistics and Natural Language Processing (NLP). However, some recent studies have shown that the semantic of prepositions is essential for the understanding of sentences and this can benefit computational applications. This thesis proposes the description of prepositions in terms of a semantic network inspired by WordNet (Princeton's Lexical Database for English, WN.Pr). In this proposal, it is argued that prepositions with similar spatial characteristics are synonymous and occupy the center of the network, from which their metaphorical extensions derive. In representational terms, the synonymy was modeled by means of synsets (as in WN.Pr), simple definitions (glosses) were assigned to each direction, and antonyms, hyperonymy, and hyponymy relations were established. Therefore, this research was developed in two research domains: Linguistic and Computational-Linguistics. In the first domain, (i) we study the theoretical cognitive knowledge that supports the spatial description of the basic sense of these items, and (ii) we identified their metaphorical senses based on a systematization of the different meanings attributed by Portuguese grammars. In the second domain, we proposed the systematization of the linguistic data in a conceptual hierarchy organized in a computationally treatable format implemented in Protégé, one of the most used editors for the creation and edition of ontologies. In total, 502 different senses were identified and indexed, creating the first version of PrepNet.Br, published online. Of these, 152 spatial meanings were attributed to 15 simple prepositions and 110 complex prepositions, while 350 are metaphorical meanings grouped into 114 classes. In addition, a total of 2358 sample sentences were selected. With the creation of this sense inventory, a new lexical resource was created for Brazilian Portuguese and the description of prepositions contributed to enrich the studies in Descriptive Linguistics. With the creation of this sense inventory, a new lexical resource was created for Brazilian Portuguese that may contribute to enrich the studies of Descriptive Linguistics and, even modestly, to the advance of NLP in Brazil. |
publishDate |
2018 |
dc.date.accessioned.fl_str_mv |
2018-05-14T20:23:55Z |
dc.date.available.fl_str_mv |
2018-05-14T20:23:55Z |
dc.date.issued.fl_str_mv |
2018-02-27 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/doctoralThesis |
format |
doctoralThesis |
status_str |
publishedVersion |
dc.identifier.citation.fl_str_mv |
GARCIA, Débora Domiciano. PrepNet.Br: uma proposta de representação semântica para as preposições do português. 2018. Tese (Doutorado em Linguística) – Universidade Federal de São Carlos, São Carlos, 2018. Disponível em: https://repositorio.ufscar.br/handle/ufscar/10006. |
dc.identifier.uri.fl_str_mv |
https://repositorio.ufscar.br/handle/ufscar/10006 |
identifier_str_mv |
GARCIA, Débora Domiciano. PrepNet.Br: uma proposta de representação semântica para as preposições do português. 2018. Tese (Doutorado em Linguística) – Universidade Federal de São Carlos, São Carlos, 2018. Disponível em: https://repositorio.ufscar.br/handle/ufscar/10006. |
url |
https://repositorio.ufscar.br/handle/ufscar/10006 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.relation.confidence.fl_str_mv |
600 600 |
dc.relation.authority.fl_str_mv |
26c5db60-6612-41e6-a8f9-f94fb475ca58 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
Universidade Federal de São Carlos Câmpus São Carlos |
dc.publisher.program.fl_str_mv |
Programa de Pós-Graduação em Linguística - PPGL |
dc.publisher.initials.fl_str_mv |
UFSCar |
publisher.none.fl_str_mv |
Universidade Federal de São Carlos Câmpus São Carlos |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFSCAR instname:Universidade Federal de São Carlos (UFSCAR) instacron:UFSCAR |
instname_str |
Universidade Federal de São Carlos (UFSCAR) |
instacron_str |
UFSCAR |
institution |
UFSCAR |
reponame_str |
Repositório Institucional da UFSCAR |
collection |
Repositório Institucional da UFSCAR |
bitstream.url.fl_str_mv |
https://repositorio.ufscar.br/bitstream/ufscar/10006/1/DeboraGarcia_VersaoFinal.pdf https://repositorio.ufscar.br/bitstream/ufscar/10006/3/Anexo%20I%20Carta%20Orientador.docx https://repositorio.ufscar.br/bitstream/ufscar/10006/4/license.txt https://repositorio.ufscar.br/bitstream/ufscar/10006/5/DeboraGarcia_VersaoFinal.pdf.txt https://repositorio.ufscar.br/bitstream/ufscar/10006/6/DeboraGarcia_VersaoFinal.pdf.jpg |
bitstream.checksum.fl_str_mv |
da99488c5ec28bdcc57df422870196bc 6ffbec4bfb0436f7ec1242b8bfe2a573 ae0398b6f8b235e40ad82cba6c50031d 85007b95d6714c5e3d5a9f88dbeb0543 639ec5582288d5a477da14cd84bff400 |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 MD5 |
repository.name.fl_str_mv |
Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR) |
repository.mail.fl_str_mv |
|
_version_ |
1802136338415222784 |