CORPORART - a corpus of Public Art for lexicon extraction: representativity and comparability in specialized corpora

Detalhes bibliográficos
Autor(a) principal: Barbero, Chiara
Data de Publicação: 2019
Tipo de documento: Artigo
Idioma: por
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://doi.org/10.26334/2183-9077/rapln5ano2019a4
Resumo: This paper introduces the CORPORART, a bilingual corpus of Public Art. CORPORART intends to gather, in a single collection of bilingual data, representative samples of specialized language in European Portuguese and Italian. The compilation of this corpus is part of an ongoing doctoral project, which aims to integrate specialized lexical units into a pre-existing common language resource, WordNet.PT (Marrafa et al., 2005), in the perspective of contributing to streamline communication between heterogeneous interlocutors (Amaro & Mendes, 2012). Assuming that the structure of the corpus heavily depends on the goals of the investigation, this paper presents the linguistic and extralinguistic parameters adopted for the construction and organization of the corpus, as well as the criteria for text processing. In particular, we will deepen the notion of representativity and comparability considering the specificity of this case study, outlining a work practice proposal oriented to guarantee these two flexible dimensions within the specialized languages context.
id RCAP_965f8dce34b2859e69e94c9fd24cfcf5
oai_identifier_str oai:ojs3.ojs.apl.pt:article/2
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling CORPORART - a corpus of Public Art for lexicon extraction: representativity and comparability in specialized corporaCORPORART - um corpus de arte pública para a extração de léxico: representatividade e comparabilidade em corpora de especialidadecorpora de especialidadeléxico de especialidadeorganização do corpusrepresentatividadearte públicaspecialized corporaspecialized lexiconcorpus organizationrepresentativitypublic artThis paper introduces the CORPORART, a bilingual corpus of Public Art. CORPORART intends to gather, in a single collection of bilingual data, representative samples of specialized language in European Portuguese and Italian. The compilation of this corpus is part of an ongoing doctoral project, which aims to integrate specialized lexical units into a pre-existing common language resource, WordNet.PT (Marrafa et al., 2005), in the perspective of contributing to streamline communication between heterogeneous interlocutors (Amaro & Mendes, 2012). Assuming that the structure of the corpus heavily depends on the goals of the investigation, this paper presents the linguistic and extralinguistic parameters adopted for the construction and organization of the corpus, as well as the criteria for text processing. In particular, we will deepen the notion of representativity and comparability considering the specificity of this case study, outlining a work practice proposal oriented to guarantee these two flexible dimensions within the specialized languages context.Associação Portuguesa de Linguística2019-11-21info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttps://doi.org/10.26334/2183-9077/rapln5ano2019a4https://doi.org/10.26334/2183-9077/rapln5ano2019a4Revista da Associação Portuguesa de Linguística; No. 5 (2019): Journal of the Portuguese Linguistics Association; 43-57Revista da Associação Portuguesa de Linguística; N.º 5 (2019): Revista da Associação Portuguesa de Linguística; 43-572183-9077reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAPporhttps://ojs.apl.pt/index.php/rapl/article/view/2https://ojs.apl.pt/index.php/rapl/article/view/2/27Direitos de Autor (c) 2019 Chiara Barberoinfo:eu-repo/semantics/openAccessBarbero, Chiara2023-12-02T10:17:09Zoai:ojs3.ojs.apl.pt:article/2Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T20:35:55.927185Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv CORPORART - a corpus of Public Art for lexicon extraction: representativity and comparability in specialized corpora
CORPORART - um corpus de arte pública para a extração de léxico: representatividade e comparabilidade em corpora de especialidade
title CORPORART - a corpus of Public Art for lexicon extraction: representativity and comparability in specialized corpora
spellingShingle CORPORART - a corpus of Public Art for lexicon extraction: representativity and comparability in specialized corpora
Barbero, Chiara
corpora de especialidade
léxico de especialidade
organização do corpus
representatividade
arte pública
specialized corpora
specialized lexicon
corpus organization
representativity
public art
title_short CORPORART - a corpus of Public Art for lexicon extraction: representativity and comparability in specialized corpora
title_full CORPORART - a corpus of Public Art for lexicon extraction: representativity and comparability in specialized corpora
title_fullStr CORPORART - a corpus of Public Art for lexicon extraction: representativity and comparability in specialized corpora
title_full_unstemmed CORPORART - a corpus of Public Art for lexicon extraction: representativity and comparability in specialized corpora
title_sort CORPORART - a corpus of Public Art for lexicon extraction: representativity and comparability in specialized corpora
author Barbero, Chiara
author_facet Barbero, Chiara
author_role author
dc.contributor.author.fl_str_mv Barbero, Chiara
dc.subject.por.fl_str_mv corpora de especialidade
léxico de especialidade
organização do corpus
representatividade
arte pública
specialized corpora
specialized lexicon
corpus organization
representativity
public art
topic corpora de especialidade
léxico de especialidade
organização do corpus
representatividade
arte pública
specialized corpora
specialized lexicon
corpus organization
representativity
public art
description This paper introduces the CORPORART, a bilingual corpus of Public Art. CORPORART intends to gather, in a single collection of bilingual data, representative samples of specialized language in European Portuguese and Italian. The compilation of this corpus is part of an ongoing doctoral project, which aims to integrate specialized lexical units into a pre-existing common language resource, WordNet.PT (Marrafa et al., 2005), in the perspective of contributing to streamline communication between heterogeneous interlocutors (Amaro & Mendes, 2012). Assuming that the structure of the corpus heavily depends on the goals of the investigation, this paper presents the linguistic and extralinguistic parameters adopted for the construction and organization of the corpus, as well as the criteria for text processing. In particular, we will deepen the notion of representativity and comparability considering the specificity of this case study, outlining a work practice proposal oriented to guarantee these two flexible dimensions within the specialized languages context.
publishDate 2019
dc.date.none.fl_str_mv 2019-11-21
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://doi.org/10.26334/2183-9077/rapln5ano2019a4
https://doi.org/10.26334/2183-9077/rapln5ano2019a4
url https://doi.org/10.26334/2183-9077/rapln5ano2019a4
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv https://ojs.apl.pt/index.php/rapl/article/view/2
https://ojs.apl.pt/index.php/rapl/article/view/2/27
dc.rights.driver.fl_str_mv Direitos de Autor (c) 2019 Chiara Barbero
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Direitos de Autor (c) 2019 Chiara Barbero
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Associação Portuguesa de Linguística
publisher.none.fl_str_mv Associação Portuguesa de Linguística
dc.source.none.fl_str_mv Revista da Associação Portuguesa de Linguística; No. 5 (2019): Journal of the Portuguese Linguistics Association; 43-57
Revista da Associação Portuguesa de Linguística; N.º 5 (2019): Revista da Associação Portuguesa de Linguística; 43-57
2183-9077
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799133622595223552