Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation
Autor(a) principal: | |
---|---|
Data de Publicação: | 2024 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | por |
Título da fonte: | Letras de Hoje (Online) |
Texto Completo: | https://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/44734 |
Resumo: | Many of the Brazilian indigenous languages are endangered. In most cases, revitalization and conservation strategies for these languages are essential (Crystal, 2002; Harrison, 2007), requiring continuous processes of promoting language policies and actions focused on indigenous school education. This article presents the use of linguistic tools associated with the construction of treebanks (corpora of texts with syntactic and morphological annotations) and the description of two minority indigenous languages belonging to the Tupían linguistic family spoken in the southwestern Amazon, Brazil. The treebanks, part of the Universal Dependencies project (De Marneffe et al., 2021; Duran et al., 2022), form the basis of experiments conducted in the Institutional Program for Scientific Initiation Scholarships at the Federal University of Paraíba (2021-2022), entitled "Education, Linguistics, History, and Indigenous Communities." We discuss the application of these tools in linguistic description, their relationship with the study of indigenous language typology. Furthermore, we explore the intersection of computational linguistics with descriptive linguistics. |
id |
PUC_RS-19_3f3dcfd7c274c27dbd9454d5a711bb22 |
---|---|
oai_identifier_str |
oai:ojs.revistaseletronicas.pucrs.br:article/44734 |
network_acronym_str |
PUC_RS-19 |
network_name_str |
Letras de Hoje (Online) |
repository_id_str |
|
spelling |
Minorities languages and syntactic annotations of corpora: research experiences in scientific initiationLenguas minoritarias y anotaciones sintácticas de corpora: experiencias de investigación en la iniciación científicaLínguas minoritárias e anotações sintáticas de corpora: experiências de pesquisa na iniciação científicalinguística computacionallinguística descritiva línguas Tupídependências universaistreebanks.computational linguisticsdescriptive linguisticsTupían languagesuniversal dependenciestreebanks.lingüística computacional lingüística descriptivalenguas Tupídependencias universales treebanks.Many of the Brazilian indigenous languages are endangered. In most cases, revitalization and conservation strategies for these languages are essential (Crystal, 2002; Harrison, 2007), requiring continuous processes of promoting language policies and actions focused on indigenous school education. This article presents the use of linguistic tools associated with the construction of treebanks (corpora of texts with syntactic and morphological annotations) and the description of two minority indigenous languages belonging to the Tupían linguistic family spoken in the southwestern Amazon, Brazil. The treebanks, part of the Universal Dependencies project (De Marneffe et al., 2021; Duran et al., 2022), form the basis of experiments conducted in the Institutional Program for Scientific Initiation Scholarships at the Federal University of Paraíba (2021-2022), entitled "Education, Linguistics, History, and Indigenous Communities." We discuss the application of these tools in linguistic description, their relationship with the study of indigenous language typology. Furthermore, we explore the intersection of computational linguistics with descriptive linguistics. Muchas de las lenguas indígenas de Brasil están en peligro de extinción. En la mayoría de los casos, las estrategias de revitalización y conservación de estas lenguas son esenciales (Crystal, 2002; Harrison, 2007), requiriendo procesos continuos para promover políticas y acciones lingüísticas dirigidas a la educación escolar indígena. Este artículo presenta el uso de herramientas lingüísticas asociadas a la construcción de treebanks (corpus de textos con anotaciones sintácticas y morfológicas) y a la descripción de dos lenguas minoritarias del tronco lingüístico Tupí habladas en el suroeste de la Amazonía, Brasil. Los bancos de árboles, parte del proyecto Dependencias Universales (De Marneffe et al., 2021; Duran et al. 2022), son la base de las experiencias desarrolladas en el Programa Institucional de Becas de Iniciación Científica de la Universidad Federal de Paraíba (2021-2022), titulado “Educación, Lingüística, Historia y Comunidades Indígenas”. Discutimos la aplicación de estas herramientas en la descripción lingüística, sus relaciones con el estudio de la tipología de lenguas indígenas. Además, exploramos la intersección de la lingüística computacional y la lingüística descriptiva.Muitas das línguas indígenas brasileiras estão ameaçadas de extinção. Na maioria dos casos, estratégias de revitalização e de conservação dessas línguas são imprescindíveis (Crystal, 2002; Harrison, 2007), necessitando de processos contínuos de promoção de políticas linguísticas e de ações voltadas à educação escolar indígena. Este artigo apresenta o uso de ferramentas linguísticas associadas à construção de treebanks (corpus de textos com anotações sintáticas e morfológicas) e à descrição de duas línguas minoritárias do tronco linguístico Tupí faladas no sudoeste Amazônico. Os treebanks, parte das Dependências Universais (De Marneffe et al., 2021; Duran et al. 2022), são a base de algumas das atividades do projeto “Educação, Linguística, História e Comunidades Indígenas” vinculado ao Programa Institucional de Bolsas de Iniciação Científica (2021-2022) da Universidade Federal da Paraíba (UFPB). Neste artigo, discutimos a aplicação dessas ferramentas na descrição linguística e exploramos a interseção da linguística computacional com a linguística descritiva. Editora da PUCRS - ediPUCRS2024-01-11info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/4473410.15448/1984-7726.2023.1.44734Letras de Hoje; Vol. 59 No. 1 (2024): Single Volume - Continuous flow; e44734Letras de Hoje; Vol. 59 Núm. 1 (2024): Volumen Único - Flujo continuo; e44734Letras de Hoje; v. 59 n. 1 (2024): Volume único - Fluxo contínuo; e447341984-77260101-333510.15448/1984-7726.2024.1reponame:Letras de Hoje (Online)instname:Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS)instacron:PUC_RSporhttps://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/44734/28448Copyright (c) 2024 Letras de Hojehttp://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessLuiza Santos, LuanaCoelho Aragon, CarolinaGerardi, Fabrício2024-03-12T12:59:11Zoai:ojs.revistaseletronicas.pucrs.br:article/44734Revistahttps://revistaseletronicas.pucrs.br/ojs/index.php/falePRIhttps://revistaseletronicas.pucrs.br/ojs/index.php/fale/oaieditora.periodicos@pucrs.br || letrasdehoje@pucrs.br1984-77260101-3335opendoar:2024-03-12T12:59:11Letras de Hoje (Online) - Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS)false |
dc.title.none.fl_str_mv |
Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation Lenguas minoritarias y anotaciones sintácticas de corpora: experiencias de investigación en la iniciación científica Línguas minoritárias e anotações sintáticas de corpora: experiências de pesquisa na iniciação científica |
title |
Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation |
spellingShingle |
Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation Luiza Santos, Luana linguística computacional linguística descritiva línguas Tupí dependências universais treebanks. computational linguistics descriptive linguistics Tupían languages universal dependencies treebanks. lingüística computacional lingüística descriptiva lenguas Tupí dependencias universales treebanks. |
title_short |
Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation |
title_full |
Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation |
title_fullStr |
Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation |
title_full_unstemmed |
Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation |
title_sort |
Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation |
author |
Luiza Santos, Luana |
author_facet |
Luiza Santos, Luana Coelho Aragon, Carolina Gerardi, Fabrício |
author_role |
author |
author2 |
Coelho Aragon, Carolina Gerardi, Fabrício |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Luiza Santos, Luana Coelho Aragon, Carolina Gerardi, Fabrício |
dc.subject.por.fl_str_mv |
linguística computacional linguística descritiva línguas Tupí dependências universais treebanks. computational linguistics descriptive linguistics Tupían languages universal dependencies treebanks. lingüística computacional lingüística descriptiva lenguas Tupí dependencias universales treebanks. |
topic |
linguística computacional linguística descritiva línguas Tupí dependências universais treebanks. computational linguistics descriptive linguistics Tupían languages universal dependencies treebanks. lingüística computacional lingüística descriptiva lenguas Tupí dependencias universales treebanks. |
description |
Many of the Brazilian indigenous languages are endangered. In most cases, revitalization and conservation strategies for these languages are essential (Crystal, 2002; Harrison, 2007), requiring continuous processes of promoting language policies and actions focused on indigenous school education. This article presents the use of linguistic tools associated with the construction of treebanks (corpora of texts with syntactic and morphological annotations) and the description of two minority indigenous languages belonging to the Tupían linguistic family spoken in the southwestern Amazon, Brazil. The treebanks, part of the Universal Dependencies project (De Marneffe et al., 2021; Duran et al., 2022), form the basis of experiments conducted in the Institutional Program for Scientific Initiation Scholarships at the Federal University of Paraíba (2021-2022), entitled "Education, Linguistics, History, and Indigenous Communities." We discuss the application of these tools in linguistic description, their relationship with the study of indigenous language typology. Furthermore, we explore the intersection of computational linguistics with descriptive linguistics. |
publishDate |
2024 |
dc.date.none.fl_str_mv |
2024-01-11 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/44734 10.15448/1984-7726.2023.1.44734 |
url |
https://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/44734 |
identifier_str_mv |
10.15448/1984-7726.2023.1.44734 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.relation.none.fl_str_mv |
https://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/44734/28448 |
dc.rights.driver.fl_str_mv |
Copyright (c) 2024 Letras de Hoje http://creativecommons.org/licenses/by/4.0 info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Copyright (c) 2024 Letras de Hoje http://creativecommons.org/licenses/by/4.0 |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Editora da PUCRS - ediPUCRS |
publisher.none.fl_str_mv |
Editora da PUCRS - ediPUCRS |
dc.source.none.fl_str_mv |
Letras de Hoje; Vol. 59 No. 1 (2024): Single Volume - Continuous flow; e44734 Letras de Hoje; Vol. 59 Núm. 1 (2024): Volumen Único - Flujo continuo; e44734 Letras de Hoje; v. 59 n. 1 (2024): Volume único - Fluxo contínuo; e44734 1984-7726 0101-3335 10.15448/1984-7726.2024.1 reponame:Letras de Hoje (Online) instname:Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS) instacron:PUC_RS |
instname_str |
Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS) |
instacron_str |
PUC_RS |
institution |
PUC_RS |
reponame_str |
Letras de Hoje (Online) |
collection |
Letras de Hoje (Online) |
repository.name.fl_str_mv |
Letras de Hoje (Online) - Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS) |
repository.mail.fl_str_mv |
editora.periodicos@pucrs.br || letrasdehoje@pucrs.br |
_version_ |
1799128772616650752 |