From the contact between Literature, Corpus Linguistics and Natural Language Processing: the case of the anagrammatics of Guimarães Rosa

Detalhes bibliográficos
Autor(a) principal: Vital, Átila Augusto Soares
Data de Publicação: 2022
Tipo de documento: Artigo
Idioma: por
Título da fonte: Texto livre
Texto Completo: https://periodicos.ufmg.br/index.php/textolivre/article/view/39316
Resumo: From the attempt to achieve the cooperation between Corpus Linguistics and the Natural Language Processing (NLP), important products have been created, as the possibility of processing lots of linguistic data and developing technologies that use language. The relationship between those areas and the Literary Studies, however, has been less studied, opening spaces for this study, which has the objective of carrying out an exploratory analysis of the poems assigned to the anagrammatics of João Guimarães Rosa, in Ave, Palavra, from 1970. In order to do so, approaches of Corpus Linguistics and NLP were used together, associated with the works of Rossi (2007), Brito (2012) and Vital (2021), about the rosian oeuvre. Using computational processing, we extracted the following data from the corpus: a) the number of words; b) type-token ratio; c) the number of stanzas; d) the most frequent words for each anagrammatics. The data were displayed in the form of graphics and word clouds. From the results, we observed that there are quantitative and qualitative differences for each poet, reinforcing, through observations of the epigraphs of each author, the complexity involved in the metapoeticity of anagrammatic masks.
id UFMG-9_c05a203bfc53134598f8eb51710169a5
oai_identifier_str oai:periodicos.ufmg.br:article/39316
network_acronym_str UFMG-9
network_name_str Texto livre
repository_id_str
spelling From the contact between Literature, Corpus Linguistics and Natural Language Processing: the case of the anagrammatics of Guimarães RosaDo contato entre a Literatura, a Linguística de Corpus e o Processamento de Língua Natural: o caso dos anagramáticos de Guimarães RosaLinguística de CorpusProcessamento de Língua NaturalGuimarães RosaCorpus LinguisticsNatural Language ProcessingGuimarães RosaFrom the attempt to achieve the cooperation between Corpus Linguistics and the Natural Language Processing (NLP), important products have been created, as the possibility of processing lots of linguistic data and developing technologies that use language. The relationship between those areas and the Literary Studies, however, has been less studied, opening spaces for this study, which has the objective of carrying out an exploratory analysis of the poems assigned to the anagrammatics of João Guimarães Rosa, in Ave, Palavra, from 1970. In order to do so, approaches of Corpus Linguistics and NLP were used together, associated with the works of Rossi (2007), Brito (2012) and Vital (2021), about the rosian oeuvre. Using computational processing, we extracted the following data from the corpus: a) the number of words; b) type-token ratio; c) the number of stanzas; d) the most frequent words for each anagrammatics. The data were displayed in the form of graphics and word clouds. From the results, we observed that there are quantitative and qualitative differences for each poet, reinforcing, through observations of the epigraphs of each author, the complexity involved in the metapoeticity of anagrammatic masks.Da tentativa de realizar a cooperação entre a Linguística de Corpus e o Processamento de Língua Natural (PLN), foram alcançados importantes frutos, como a possibilidade de processamento de grandes dados linguísticos e o desenvolvimento de tecnologias que se utilizam de dados da língua. A relação entre essas duas áreas e os Estudos Literários, no entanto, tem sido pouco explorada, o que abre espaços para o presente trabalho, que tem por objetivo fazer uma análise exploratória da construção dos poemas atribuídos a anagramáticos de João Guimarães Rosa, em Ave, Palavra, obra de 1970. Para isso, foram utilizadas, em conjunto, abordagens da Linguística de Corpus e do PLN, associadas aos trabalhos de Rossi (2007), Brito (2012) e Vital (2021), acerca da obra rosiana. Com o processamento computacional do corpus, pudemos extrair: a) o número de palavras; b) a razão type-token; c) o número de estrofes e de versos e d) as palavras mais frequentes para cada um dos anagramáticos. Os dados foram dispostos em gráficos e nuvens de palavras (wordclouds). Desses resultados, foi observado que existem, de fato, diferenças quantitativas e qualitativas presentes no nível poético, reafirmando, por meio de observações das epígrafes de cada anagramático, a complexidade envolvida na criação da metapoeticidade de suas máscaras.Universidade Federal de Minas Gerais2022-09-14info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionArtigo avaliado pelos paresapplication/pdfhttps://periodicos.ufmg.br/index.php/textolivre/article/view/3931610.35699/1983-3652.2022.39316Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e39316Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e39316Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e39316Texto Livre; v. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e393161983-3652reponame:Texto livreinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGporhttps://periodicos.ufmg.br/index.php/textolivre/article/view/39316/31382Copyright (c) 2022 Átila Augusto Soares Vitalhttps://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessVital, Átila Augusto Soares 2022-12-30T14:26:27Zoai:periodicos.ufmg.br:article/39316Revistahttp://www.periodicos.letras.ufmg.br/index.php/textolivrePUBhttps://periodicos.ufmg.br/index.php/textolivre/oairevistatextolivre@letras.ufmg.br1983-36521983-3652opendoar:2022-12-30T14:26:27Texto livre - Universidade Federal de Minas Gerais (UFMG)false
dc.title.none.fl_str_mv From the contact between Literature, Corpus Linguistics and Natural Language Processing: the case of the anagrammatics of Guimarães Rosa
Do contato entre a Literatura, a Linguística de Corpus e o Processamento de Língua Natural: o caso dos anagramáticos de Guimarães Rosa
title From the contact between Literature, Corpus Linguistics and Natural Language Processing: the case of the anagrammatics of Guimarães Rosa
spellingShingle From the contact between Literature, Corpus Linguistics and Natural Language Processing: the case of the anagrammatics of Guimarães Rosa
Vital, Átila Augusto Soares
Linguística de Corpus
Processamento de Língua Natural
Guimarães Rosa
Corpus Linguistics
Natural Language Processing
Guimarães Rosa
title_short From the contact between Literature, Corpus Linguistics and Natural Language Processing: the case of the anagrammatics of Guimarães Rosa
title_full From the contact between Literature, Corpus Linguistics and Natural Language Processing: the case of the anagrammatics of Guimarães Rosa
title_fullStr From the contact between Literature, Corpus Linguistics and Natural Language Processing: the case of the anagrammatics of Guimarães Rosa
title_full_unstemmed From the contact between Literature, Corpus Linguistics and Natural Language Processing: the case of the anagrammatics of Guimarães Rosa
title_sort From the contact between Literature, Corpus Linguistics and Natural Language Processing: the case of the anagrammatics of Guimarães Rosa
author Vital, Átila Augusto Soares
author_facet Vital, Átila Augusto Soares
author_role author
dc.contributor.author.fl_str_mv Vital, Átila Augusto Soares
dc.subject.por.fl_str_mv Linguística de Corpus
Processamento de Língua Natural
Guimarães Rosa
Corpus Linguistics
Natural Language Processing
Guimarães Rosa
topic Linguística de Corpus
Processamento de Língua Natural
Guimarães Rosa
Corpus Linguistics
Natural Language Processing
Guimarães Rosa
description From the attempt to achieve the cooperation between Corpus Linguistics and the Natural Language Processing (NLP), important products have been created, as the possibility of processing lots of linguistic data and developing technologies that use language. The relationship between those areas and the Literary Studies, however, has been less studied, opening spaces for this study, which has the objective of carrying out an exploratory analysis of the poems assigned to the anagrammatics of João Guimarães Rosa, in Ave, Palavra, from 1970. In order to do so, approaches of Corpus Linguistics and NLP were used together, associated with the works of Rossi (2007), Brito (2012) and Vital (2021), about the rosian oeuvre. Using computational processing, we extracted the following data from the corpus: a) the number of words; b) type-token ratio; c) the number of stanzas; d) the most frequent words for each anagrammatics. The data were displayed in the form of graphics and word clouds. From the results, we observed that there are quantitative and qualitative differences for each poet, reinforcing, through observations of the epigraphs of each author, the complexity involved in the metapoeticity of anagrammatic masks.
publishDate 2022
dc.date.none.fl_str_mv 2022-09-14
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Artigo avaliado pelos pares
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://periodicos.ufmg.br/index.php/textolivre/article/view/39316
10.35699/1983-3652.2022.39316
url https://periodicos.ufmg.br/index.php/textolivre/article/view/39316
identifier_str_mv 10.35699/1983-3652.2022.39316
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv https://periodicos.ufmg.br/index.php/textolivre/article/view/39316/31382
dc.rights.driver.fl_str_mv Copyright (c) 2022 Átila Augusto Soares Vital
https://creativecommons.org/licenses/by/4.0
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2022 Átila Augusto Soares Vital
https://creativecommons.org/licenses/by/4.0
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal de Minas Gerais
publisher.none.fl_str_mv Universidade Federal de Minas Gerais
dc.source.none.fl_str_mv Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e39316
Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e39316
Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e39316
Texto Livre; v. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e39316
1983-3652
reponame:Texto livre
instname:Universidade Federal de Minas Gerais (UFMG)
instacron:UFMG
instname_str Universidade Federal de Minas Gerais (UFMG)
instacron_str UFMG
institution UFMG
reponame_str Texto livre
collection Texto livre
repository.name.fl_str_mv Texto livre - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv revistatextolivre@letras.ufmg.br
_version_ 1799711143873216512