AN ANNOTATED CORPUS WITH SUPPORT VERB CONSTRUCTIONS IN PORTUGUESE
Autor(a) principal: | |
---|---|
Data de Publicação: | 2015 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | por |
Título da fonte: | Gragoatá |
Texto Completo: | https://periodicos.uff.br/gragoata/article/view/33307 |
Resumo: | The support verb constructions (SVC) are a type of nominal construction, where the core predicate is the noun, called 'predicative noun' (Npred), which is assisted by a verb, called 'support verb' (Vsup). The Lexicon‑Grammar theoretical and methodological framework was adopted, in this paper, for the linguistic description and formalization of SVC in Portuguese. Considering the syntactic and semantic differences between SVC and other types of constructions, the purpose of this paper is to present the methodology and results of creating a corpus annotated with Vsup and Npred. A list with 4,668 SVC was built, considering 45 variants of Vsup and around 3,200 different Npred. Based on this list, we extracted 121,198 sentences from PLN.Br full corpus, from which 2,646 sentences have been manually annotated. This sample may constitute a reference corpus for the processing of SVC and used as a golden standard for evaluating the automatic tasks of identification, extraction or classification of SVC, as well as for other Natural Language Processing (NLP) applications. |
id |
UFF-6_e4b0439f2263f6de7bcce10afd5bd058 |
---|---|
oai_identifier_str |
oai:ojs.pkp.sfu.ca:article/33307 |
network_acronym_str |
UFF-6 |
network_name_str |
Gragoatá |
repository_id_str |
|
spelling |
AN ANNOTATED CORPUS WITH SUPPORT VERB CONSTRUCTIONS IN PORTUGUESEUM ‘CORPUS’ ANOTADO DE CONSTRUÇÕES COM VERBO-SUPORTE EM PORTUGUÊSsupport verbpredicative noun. Lexicon Grammarcorpus annotation.Teoria e análise linguísticaVerbo-suporteNome predicativoLéxico GramáticaAnotação de corpus.The support verb constructions (SVC) are a type of nominal construction, where the core predicate is the noun, called 'predicative noun' (Npred), which is assisted by a verb, called 'support verb' (Vsup). The Lexicon‑Grammar theoretical and methodological framework was adopted, in this paper, for the linguistic description and formalization of SVC in Portuguese. Considering the syntactic and semantic differences between SVC and other types of constructions, the purpose of this paper is to present the methodology and results of creating a corpus annotated with Vsup and Npred. A list with 4,668 SVC was built, considering 45 variants of Vsup and around 3,200 different Npred. Based on this list, we extracted 121,198 sentences from PLN.Br full corpus, from which 2,646 sentences have been manually annotated. This sample may constitute a reference corpus for the processing of SVC and used as a golden standard for evaluating the automatic tasks of identification, extraction or classification of SVC, as well as for other Natural Language Processing (NLP) applications.As construções com verbo‑suporte (CVS) são um tipo de construção nominal, em que o predicador central é o nome, chamado de nome predicativo (Npred), e este é auxiliado por um verbo, chamado verbo-suporte (Vsup). A abordagem utilizada para a descrição e formalização das CVS, neste artigo, é o Léxico‑Gramática. Tendo em vista as diferenças sintáticas e semânticas das CVS em relação a outros tipos de construções, o objetivo deste artigo é apresentar a metodologia e os resultados da construção de um corpus anotado com construções de Vsup e de Npred. Foi construída uma lista com 4.668 CVS, considerando‑se 45 variantes de Vsup e cerca de 3.200 Npred diferentes. A partir dessa lista de CVS, foram extraídas 121.198 frases do corpus PLN.Br Full, das quais foi anotada e revista manualmente uma amostra com 2.646 frases, que constituem o corpus de referência para o tratamento de CVS em Português. Esse corpus de referência poderá ser utilizado como golden standard para avaliar tarefas automáticas de identificação, extração ou classificação de CVS ou ainda para outras aplicações de Processamento Automático de Língua Natural (PLN).Universidade Federal Fluminense2015-06-30info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://periodicos.uff.br/gragoata/article/view/3330710.22409/gragoata.v20i38.33307Gragoatá; Vol. 20 No. 38 (2015): Language StudiesGragoatá; v. 20 n. 38 (2015): Estudos de Linguagem2358-41141413-907310.22409/gragoata.v20i38reponame:Gragoatáinstname:Universidade Federal Fluminense (UFF)instacron:UFFporhttps://periodicos.uff.br/gragoata/article/view/33307/19294Copyright (c) 2019 Gragoatáinfo:eu-repo/semantics/openAccessRassi, Amanda PontesBaptista, JorgeVale, Oto Araújo2019-08-23T11:04:49Zoai:ojs.pkp.sfu.ca:article/33307Revistahttps://periodicos.uff.br/gragoataPUBhttps://periodicos.uff.br/gragoata/oai||revistagragoata@gmail.com2358-41141413-9073opendoar:2019-08-23T11:04:49Gragoatá - Universidade Federal Fluminense (UFF)false |
dc.title.none.fl_str_mv |
AN ANNOTATED CORPUS WITH SUPPORT VERB CONSTRUCTIONS IN PORTUGUESE UM ‘CORPUS’ ANOTADO DE CONSTRUÇÕES COM VERBO-SUPORTE EM PORTUGUÊS |
title |
AN ANNOTATED CORPUS WITH SUPPORT VERB CONSTRUCTIONS IN PORTUGUESE |
spellingShingle |
AN ANNOTATED CORPUS WITH SUPPORT VERB CONSTRUCTIONS IN PORTUGUESE Rassi, Amanda Pontes support verb predicative noun. Lexicon Grammar corpus annotation. Teoria e análise linguística Verbo-suporte Nome predicativo Léxico Gramática Anotação de corpus. |
title_short |
AN ANNOTATED CORPUS WITH SUPPORT VERB CONSTRUCTIONS IN PORTUGUESE |
title_full |
AN ANNOTATED CORPUS WITH SUPPORT VERB CONSTRUCTIONS IN PORTUGUESE |
title_fullStr |
AN ANNOTATED CORPUS WITH SUPPORT VERB CONSTRUCTIONS IN PORTUGUESE |
title_full_unstemmed |
AN ANNOTATED CORPUS WITH SUPPORT VERB CONSTRUCTIONS IN PORTUGUESE |
title_sort |
AN ANNOTATED CORPUS WITH SUPPORT VERB CONSTRUCTIONS IN PORTUGUESE |
author |
Rassi, Amanda Pontes |
author_facet |
Rassi, Amanda Pontes Baptista, Jorge Vale, Oto Araújo |
author_role |
author |
author2 |
Baptista, Jorge Vale, Oto Araújo |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Rassi, Amanda Pontes Baptista, Jorge Vale, Oto Araújo |
dc.subject.por.fl_str_mv |
support verb predicative noun. Lexicon Grammar corpus annotation. Teoria e análise linguística Verbo-suporte Nome predicativo Léxico Gramática Anotação de corpus. |
topic |
support verb predicative noun. Lexicon Grammar corpus annotation. Teoria e análise linguística Verbo-suporte Nome predicativo Léxico Gramática Anotação de corpus. |
description |
The support verb constructions (SVC) are a type of nominal construction, where the core predicate is the noun, called 'predicative noun' (Npred), which is assisted by a verb, called 'support verb' (Vsup). The Lexicon‑Grammar theoretical and methodological framework was adopted, in this paper, for the linguistic description and formalization of SVC in Portuguese. Considering the syntactic and semantic differences between SVC and other types of constructions, the purpose of this paper is to present the methodology and results of creating a corpus annotated with Vsup and Npred. A list with 4,668 SVC was built, considering 45 variants of Vsup and around 3,200 different Npred. Based on this list, we extracted 121,198 sentences from PLN.Br full corpus, from which 2,646 sentences have been manually annotated. This sample may constitute a reference corpus for the processing of SVC and used as a golden standard for evaluating the automatic tasks of identification, extraction or classification of SVC, as well as for other Natural Language Processing (NLP) applications. |
publishDate |
2015 |
dc.date.none.fl_str_mv |
2015-06-30 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://periodicos.uff.br/gragoata/article/view/33307 10.22409/gragoata.v20i38.33307 |
url |
https://periodicos.uff.br/gragoata/article/view/33307 |
identifier_str_mv |
10.22409/gragoata.v20i38.33307 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.relation.none.fl_str_mv |
https://periodicos.uff.br/gragoata/article/view/33307/19294 |
dc.rights.driver.fl_str_mv |
Copyright (c) 2019 Gragoatá info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Copyright (c) 2019 Gragoatá |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Universidade Federal Fluminense |
publisher.none.fl_str_mv |
Universidade Federal Fluminense |
dc.source.none.fl_str_mv |
Gragoatá; Vol. 20 No. 38 (2015): Language Studies Gragoatá; v. 20 n. 38 (2015): Estudos de Linguagem 2358-4114 1413-9073 10.22409/gragoata.v20i38 reponame:Gragoatá instname:Universidade Federal Fluminense (UFF) instacron:UFF |
instname_str |
Universidade Federal Fluminense (UFF) |
instacron_str |
UFF |
institution |
UFF |
reponame_str |
Gragoatá |
collection |
Gragoatá |
repository.name.fl_str_mv |
Gragoatá - Universidade Federal Fluminense (UFF) |
repository.mail.fl_str_mv |
||revistagragoata@gmail.com |
_version_ |
1799705501088350208 |