Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation

Detalhes bibliográficos
Autor(a) principal: Luiza Santos, Luana
Data de Publicação: 2024
Outros Autores: Coelho Aragon, Carolina, Gerardi, Fabrício
Tipo de documento: Artigo
Idioma: por
Título da fonte: Letras de Hoje (Online)
Texto Completo: https://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/44734
Resumo: Many of the Brazilian indigenous languages are endangered. In most cases, revitalization and conservation strategies for these languages are essential (Crystal, 2002; Harrison, 2007), requiring continuous processes of promoting language policies and actions focused on indigenous school education. This article presents the use of linguistic tools associated with the construction of treebanks (corpora of texts with syntactic and morphological annotations) and the description of two minority indigenous languages belonging to the Tupían linguistic family spoken in the southwestern Amazon, Brazil. The treebanks, part of the Universal Dependencies project (De Marneffe et al., 2021; Duran et al., 2022), form the basis of experiments conducted in the Institutional Program for Scientific Initiation Scholarships at the Federal University of Paraíba (2021-2022), entitled "Education, Linguistics, History, and Indigenous Communities." We discuss the application of these tools in linguistic description, their relationship with the study of indigenous language typology. Furthermore, we explore the intersection of computational linguistics with descriptive linguistics.  
id PUC_RS-19_3f3dcfd7c274c27dbd9454d5a711bb22
oai_identifier_str oai:ojs.revistaseletronicas.pucrs.br:article/44734
network_acronym_str PUC_RS-19
network_name_str Letras de Hoje (Online)
repository_id_str
spelling Minorities languages and syntactic annotations of corpora: research experiences in scientific initiationLenguas minoritarias y anotaciones sintácticas de corpora: experiencias de investigación en la iniciación científicaLínguas minoritárias e anotações sintáticas de corpora: experiências de pesquisa na iniciação científicalinguística computacionallinguística descritiva línguas Tupídependências universaistreebanks.computational linguisticsdescriptive linguisticsTupían languagesuniversal dependenciestreebanks.lingüística computacional lingüística descriptivalenguas Tupídependencias universales treebanks.Many of the Brazilian indigenous languages are endangered. In most cases, revitalization and conservation strategies for these languages are essential (Crystal, 2002; Harrison, 2007), requiring continuous processes of promoting language policies and actions focused on indigenous school education. This article presents the use of linguistic tools associated with the construction of treebanks (corpora of texts with syntactic and morphological annotations) and the description of two minority indigenous languages belonging to the Tupían linguistic family spoken in the southwestern Amazon, Brazil. The treebanks, part of the Universal Dependencies project (De Marneffe et al., 2021; Duran et al., 2022), form the basis of experiments conducted in the Institutional Program for Scientific Initiation Scholarships at the Federal University of Paraíba (2021-2022), entitled "Education, Linguistics, History, and Indigenous Communities." We discuss the application of these tools in linguistic description, their relationship with the study of indigenous language typology. Furthermore, we explore the intersection of computational linguistics with descriptive linguistics.   Muchas de las lenguas indígenas de Brasil están en peligro de extinción. En la mayoría de los casos, las estrategias de revitalización y conservación de estas lenguas son esenciales (Crystal, 2002; Harrison, 2007), requiriendo procesos continuos para promover políticas y acciones lingüísticas dirigidas a la educación escolar indígena. Este artículo presenta el uso de herramientas lingüísticas asociadas a la construcción de treebanks (corpus de textos con anotaciones sintácticas y morfológicas) y a la descripción de dos lenguas minoritarias del tronco lingüístico Tupí habladas en el suroeste de la Amazonía, Brasil. Los bancos de árboles, parte del proyecto Dependencias Universales (De Marneffe et al., 2021; Duran et al. 2022), son la base de las experiencias desarrolladas en el Programa Institucional de Becas de Iniciación Científica de la Universidad Federal de Paraíba (2021-2022), titulado “Educación, Lingüística, Historia y Comunidades Indígenas”. Discutimos la aplicación de estas herramientas en la descripción lingüística, sus relaciones con el estudio de la tipología de lenguas indígenas. Además, exploramos la intersección de la lingüística computacional y la lingüística descriptiva.Muitas das línguas indígenas brasileiras estão ameaçadas de extinção. Na maioria dos casos, estratégias de revitalização e de conservação dessas línguas são imprescindíveis (Crystal, 2002; Harrison, 2007), necessitando de processos contínuos de promoção de políticas linguísticas e de ações voltadas à educação escolar indígena. Este artigo apresenta o uso de ferramentas linguísticas associadas à construção de treebanks (corpus de textos com anotações sintáticas e morfológicas) e à descrição de duas línguas minoritárias do tronco linguístico Tupí faladas no sudoeste Amazônico. Os treebanks, parte das Dependências Universais (De Marneffe et al., 2021; Duran et al. 2022), são a base de algumas das atividades do projeto “Educação, Linguística, História e Comunidades Indígenas” vinculado ao Programa Institucional de Bolsas de Iniciação Científica (2021-2022) da Universidade Federal da Paraíba (UFPB). Neste artigo, discutimos a aplicação dessas ferramentas na descrição linguística e exploramos a interseção da linguística computacional com a linguística descritiva. Editora da PUCRS - ediPUCRS2024-01-11info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/4473410.15448/1984-7726.2023.1.44734Letras de Hoje; Vol. 59 No. 1 (2024): Single Volume - Continuous flow; e44734Letras de Hoje; Vol. 59 Núm. 1 (2024): Volumen Único - Flujo continuo; e44734Letras de Hoje; v. 59 n. 1 (2024): Volume único - Fluxo contínuo; e447341984-77260101-333510.15448/1984-7726.2024.1reponame:Letras de Hoje (Online)instname:Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS)instacron:PUC_RSporhttps://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/44734/28448Copyright (c) 2024 Letras de Hojehttp://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessLuiza Santos, LuanaCoelho Aragon, CarolinaGerardi, Fabrício2024-03-12T12:59:11Zoai:ojs.revistaseletronicas.pucrs.br:article/44734Revistahttps://revistaseletronicas.pucrs.br/ojs/index.php/falePRIhttps://revistaseletronicas.pucrs.br/ojs/index.php/fale/oaieditora.periodicos@pucrs.br || letrasdehoje@pucrs.br1984-77260101-3335opendoar:2024-03-12T12:59:11Letras de Hoje (Online) - Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS)false
dc.title.none.fl_str_mv Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation
Lenguas minoritarias y anotaciones sintácticas de corpora: experiencias de investigación en la iniciación científica
Línguas minoritárias e anotações sintáticas de corpora: experiências de pesquisa na iniciação científica
title Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation
spellingShingle Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation
Luiza Santos, Luana
linguística computacional
linguística descritiva
línguas Tupí
dependências universais
treebanks.
computational linguistics
descriptive linguistics
Tupían languages
universal dependencies
treebanks.
lingüística computacional
lingüística descriptiva
lenguas Tupí
dependencias universales
treebanks.
title_short Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation
title_full Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation
title_fullStr Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation
title_full_unstemmed Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation
title_sort Minorities languages and syntactic annotations of corpora: research experiences in scientific initiation
author Luiza Santos, Luana
author_facet Luiza Santos, Luana
Coelho Aragon, Carolina
Gerardi, Fabrício
author_role author
author2 Coelho Aragon, Carolina
Gerardi, Fabrício
author2_role author
author
dc.contributor.author.fl_str_mv Luiza Santos, Luana
Coelho Aragon, Carolina
Gerardi, Fabrício
dc.subject.por.fl_str_mv linguística computacional
linguística descritiva
línguas Tupí
dependências universais
treebanks.
computational linguistics
descriptive linguistics
Tupían languages
universal dependencies
treebanks.
lingüística computacional
lingüística descriptiva
lenguas Tupí
dependencias universales
treebanks.
topic linguística computacional
linguística descritiva
línguas Tupí
dependências universais
treebanks.
computational linguistics
descriptive linguistics
Tupían languages
universal dependencies
treebanks.
lingüística computacional
lingüística descriptiva
lenguas Tupí
dependencias universales
treebanks.
description Many of the Brazilian indigenous languages are endangered. In most cases, revitalization and conservation strategies for these languages are essential (Crystal, 2002; Harrison, 2007), requiring continuous processes of promoting language policies and actions focused on indigenous school education. This article presents the use of linguistic tools associated with the construction of treebanks (corpora of texts with syntactic and morphological annotations) and the description of two minority indigenous languages belonging to the Tupían linguistic family spoken in the southwestern Amazon, Brazil. The treebanks, part of the Universal Dependencies project (De Marneffe et al., 2021; Duran et al., 2022), form the basis of experiments conducted in the Institutional Program for Scientific Initiation Scholarships at the Federal University of Paraíba (2021-2022), entitled "Education, Linguistics, History, and Indigenous Communities." We discuss the application of these tools in linguistic description, their relationship with the study of indigenous language typology. Furthermore, we explore the intersection of computational linguistics with descriptive linguistics.  
publishDate 2024
dc.date.none.fl_str_mv 2024-01-11
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/44734
10.15448/1984-7726.2023.1.44734
url https://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/44734
identifier_str_mv 10.15448/1984-7726.2023.1.44734
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv https://revistaseletronicas.pucrs.br/ojs/index.php/fale/article/view/44734/28448
dc.rights.driver.fl_str_mv Copyright (c) 2024 Letras de Hoje
http://creativecommons.org/licenses/by/4.0
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2024 Letras de Hoje
http://creativecommons.org/licenses/by/4.0
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Editora da PUCRS - ediPUCRS
publisher.none.fl_str_mv Editora da PUCRS - ediPUCRS
dc.source.none.fl_str_mv Letras de Hoje; Vol. 59 No. 1 (2024): Single Volume - Continuous flow; e44734
Letras de Hoje; Vol. 59 Núm. 1 (2024): Volumen Único - Flujo continuo; e44734
Letras de Hoje; v. 59 n. 1 (2024): Volume único - Fluxo contínuo; e44734
1984-7726
0101-3335
10.15448/1984-7726.2024.1
reponame:Letras de Hoje (Online)
instname:Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS)
instacron:PUC_RS
instname_str Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS)
instacron_str PUC_RS
institution PUC_RS
reponame_str Letras de Hoje (Online)
collection Letras de Hoje (Online)
repository.name.fl_str_mv Letras de Hoje (Online) - Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS)
repository.mail.fl_str_mv editora.periodicos@pucrs.br || letrasdehoje@pucrs.br
_version_ 1799128772616650752