Lexical density in texts generated by ChatGPT: implications of artificial intelligence for writing in additional languages
Autor(a) principal: | |
---|---|
Data de Publicação: | 2023 |
Outros Autores: | |
Tipo de documento: | Artigo |
Idioma: | por |
Título da fonte: | Texto livre |
Texto Completo: | https://periodicos.ufmg.br/index.php/textolivre/article/view/47836 |
Resumo: | Technological advancement has had a significant impact on written production, especially in Additional Languages (ALs). Although technology has brought new opportunities for AL teaching, it also poses challenges, including concerns about the complexity of writing and the authenticity of students’ work. One such tool is ChatGPT, an artificial intelligence (AI) platform that has been the subject of debate since its popularization in 2022. This study analyses a corpus consisting of six tasks produced by ChatGPT in five languages (German, Spanish, French, Italian, and Portuguese), considering the proficiency levels proposed by the Common European Framework of Reference for Languages (CEFR), totalling 2991 texts and 706,401 words. The data were generated by students in a computer lab at a British university from 100 different profiles on the ChatGPT platform, following the researchers’ instructions. Data analysis employs Systemic Functional Linguistics (SFL) and the concept of lexical density (Halliday, 1985, 1987, 1993; Halliday; Matthiessen, 2014) to investigate the complexity of the produced texts, as lexical complexity is related to proficiency in writing, where more advanced texts proportionally use more “content words” (nouns, verbs, adjectives, and some adverbs of manner). The results reveal that ChatGPT does not adhere to task instructions regarding the requested word count, thereby impacting the calculation of lexical density, nor does it produce texts that show significant differences in lexical density among additional languages and proficiency levels. |
id |
UFMG-9_78ab25bbeee6f4349ab707a000b2e107 |
---|---|
oai_identifier_str |
oai:periodicos.ufmg.br:article/47836 |
network_acronym_str |
UFMG-9 |
network_name_str |
Texto livre |
repository_id_str |
|
spelling |
Lexical density in texts generated by ChatGPT: implications of artificial intelligence for writing in additional languagesDensidade lexical em textos gerados pelo ChatGPT: implicações da inteligência artificial para a escrita em línguas adicionaisLínguas adicionaisChatGPTInteligência ArtificialLinguística Sistêmico-funcionalDensidade lexicalAdditional languagesChatGPTArtificial IntelligenceSystemic Functional LinguisticsLexical densityTechnological advancement has had a significant impact on written production, especially in Additional Languages (ALs). Although technology has brought new opportunities for AL teaching, it also poses challenges, including concerns about the complexity of writing and the authenticity of students’ work. One such tool is ChatGPT, an artificial intelligence (AI) platform that has been the subject of debate since its popularization in 2022. This study analyses a corpus consisting of six tasks produced by ChatGPT in five languages (German, Spanish, French, Italian, and Portuguese), considering the proficiency levels proposed by the Common European Framework of Reference for Languages (CEFR), totalling 2991 texts and 706,401 words. The data were generated by students in a computer lab at a British university from 100 different profiles on the ChatGPT platform, following the researchers’ instructions. Data analysis employs Systemic Functional Linguistics (SFL) and the concept of lexical density (Halliday, 1985, 1987, 1993; Halliday; Matthiessen, 2014) to investigate the complexity of the produced texts, as lexical complexity is related to proficiency in writing, where more advanced texts proportionally use more “content words” (nouns, verbs, adjectives, and some adverbs of manner). The results reveal that ChatGPT does not adhere to task instructions regarding the requested word count, thereby impacting the calculation of lexical density, nor does it produce texts that show significant differences in lexical density among additional languages and proficiency levels.O avanço tecnológico tem tido um grande impacto na produção escrita, especialmente em Línguas Adicionais (LAs). Embora a tecnologia tenha trazido novas oportunidades para o ensino de LAs, ela também apresenta desafios, incluindo preocupações sobre a complexidade da escrita e a autenticidade dos trabalhos dos alunos. Uma dessas ferramentas é o ChatGPT, plataforma de inteligência artificial (IA) que tem sido objeto de debates desde sua popularização em 2022. Este estudo analisa um corpus composto por seis tarefas produzidas pelo ChatGPT em cinco idiomas (alemão, espanhol, francês, italiano e português), considerando os níveis de proficiência propostos pelo Quadro Comum Europeu de Referência para Línguas (CEFR), que totalizou 2991 textos e 706,401 palavras. Os dados foram gerados por alunos em um laboratório de informática em uma universidade britânica a partir de 100 diferentes perfis na plataforma do ChatGPT, seguindo instruções dos pesquisadores. A análise dos dados utiliza a linguística sistêmico-funcional (LSF) e o conceito de densidade lexical (Halliday, 1985, 1987, 1993; Halliday; Matthiessen, 2014) para investigar a complexidade dos textos produzidos, dado que a complexidade lexical está relacionada à proficiência na escrita, na qual textos mais avançados usam proporcionalmente mais “palavras de conteúdo” (nomes, verbos, adjetivos e alguns advérbios de modo). Os resultados revelam que o ChatGPT não segue as instruções das tarefas quanto ao número de palavras solicitadas, impactando, assim, no cálculo da densidade lexical, nem produz textos que mostram diferenças significativas da densidade lexical entre as línguas adicionais e níveis de proficiência.Universidade Federal de Minas Gerais2023-11-29info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://periodicos.ufmg.br/index.php/textolivre/article/view/4783610.1590/1983-3652.2024.47836Texto Livre; Vol. 17 (2024): Texto Livre: Linguagem e Tecnologia; e47836Texto Livre; Vol. 17 (2024): Texto Livre: Linguagem e Tecnologia; e47836Texto Livre; Vol. 17 (2024): Texto Livre: Linguagem e Tecnologia; e47836Texto Livre; v. 17 (2024): Texto Livre: Linguagem e Tecnologia; e478361983-3652reponame:Texto livreinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGporhttps://periodicos.ufmg.br/index.php/textolivre/article/view/47836/39293Copyright (c) 2023 Antonio Marcio Da Silva, Lucia Rottavahttps://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessDa Silva, Antonio MarcioRottava, Lucia2024-10-09T08:35:48Zoai:periodicos.ufmg.br:article/47836Revistahttp://www.periodicos.letras.ufmg.br/index.php/textolivrePUBhttps://periodicos.ufmg.br/index.php/textolivre/oairevistatextolivre@letras.ufmg.br1983-36521983-3652opendoar:2024-10-09T08:35:48Texto livre - Universidade Federal de Minas Gerais (UFMG)false |
dc.title.none.fl_str_mv |
Lexical density in texts generated by ChatGPT: implications of artificial intelligence for writing in additional languages Densidade lexical em textos gerados pelo ChatGPT: implicações da inteligência artificial para a escrita em línguas adicionais |
title |
Lexical density in texts generated by ChatGPT: implications of artificial intelligence for writing in additional languages |
spellingShingle |
Lexical density in texts generated by ChatGPT: implications of artificial intelligence for writing in additional languages Da Silva, Antonio Marcio Línguas adicionais ChatGPT Inteligência Artificial Linguística Sistêmico-funcional Densidade lexical Additional languages ChatGPT Artificial Intelligence Systemic Functional Linguistics Lexical density |
title_short |
Lexical density in texts generated by ChatGPT: implications of artificial intelligence for writing in additional languages |
title_full |
Lexical density in texts generated by ChatGPT: implications of artificial intelligence for writing in additional languages |
title_fullStr |
Lexical density in texts generated by ChatGPT: implications of artificial intelligence for writing in additional languages |
title_full_unstemmed |
Lexical density in texts generated by ChatGPT: implications of artificial intelligence for writing in additional languages |
title_sort |
Lexical density in texts generated by ChatGPT: implications of artificial intelligence for writing in additional languages |
author |
Da Silva, Antonio Marcio |
author_facet |
Da Silva, Antonio Marcio Rottava, Lucia |
author_role |
author |
author2 |
Rottava, Lucia |
author2_role |
author |
dc.contributor.author.fl_str_mv |
Da Silva, Antonio Marcio Rottava, Lucia |
dc.subject.por.fl_str_mv |
Línguas adicionais ChatGPT Inteligência Artificial Linguística Sistêmico-funcional Densidade lexical Additional languages ChatGPT Artificial Intelligence Systemic Functional Linguistics Lexical density |
topic |
Línguas adicionais ChatGPT Inteligência Artificial Linguística Sistêmico-funcional Densidade lexical Additional languages ChatGPT Artificial Intelligence Systemic Functional Linguistics Lexical density |
description |
Technological advancement has had a significant impact on written production, especially in Additional Languages (ALs). Although technology has brought new opportunities for AL teaching, it also poses challenges, including concerns about the complexity of writing and the authenticity of students’ work. One such tool is ChatGPT, an artificial intelligence (AI) platform that has been the subject of debate since its popularization in 2022. This study analyses a corpus consisting of six tasks produced by ChatGPT in five languages (German, Spanish, French, Italian, and Portuguese), considering the proficiency levels proposed by the Common European Framework of Reference for Languages (CEFR), totalling 2991 texts and 706,401 words. The data were generated by students in a computer lab at a British university from 100 different profiles on the ChatGPT platform, following the researchers’ instructions. Data analysis employs Systemic Functional Linguistics (SFL) and the concept of lexical density (Halliday, 1985, 1987, 1993; Halliday; Matthiessen, 2014) to investigate the complexity of the produced texts, as lexical complexity is related to proficiency in writing, where more advanced texts proportionally use more “content words” (nouns, verbs, adjectives, and some adverbs of manner). The results reveal that ChatGPT does not adhere to task instructions regarding the requested word count, thereby impacting the calculation of lexical density, nor does it produce texts that show significant differences in lexical density among additional languages and proficiency levels. |
publishDate |
2023 |
dc.date.none.fl_str_mv |
2023-11-29 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://periodicos.ufmg.br/index.php/textolivre/article/view/47836 10.1590/1983-3652.2024.47836 |
url |
https://periodicos.ufmg.br/index.php/textolivre/article/view/47836 |
identifier_str_mv |
10.1590/1983-3652.2024.47836 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.relation.none.fl_str_mv |
https://periodicos.ufmg.br/index.php/textolivre/article/view/47836/39293 |
dc.rights.driver.fl_str_mv |
Copyright (c) 2023 Antonio Marcio Da Silva, Lucia Rottava https://creativecommons.org/licenses/by/4.0 info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Copyright (c) 2023 Antonio Marcio Da Silva, Lucia Rottava https://creativecommons.org/licenses/by/4.0 |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Universidade Federal de Minas Gerais |
publisher.none.fl_str_mv |
Universidade Federal de Minas Gerais |
dc.source.none.fl_str_mv |
Texto Livre; Vol. 17 (2024): Texto Livre: Linguagem e Tecnologia; e47836 Texto Livre; Vol. 17 (2024): Texto Livre: Linguagem e Tecnologia; e47836 Texto Livre; Vol. 17 (2024): Texto Livre: Linguagem e Tecnologia; e47836 Texto Livre; v. 17 (2024): Texto Livre: Linguagem e Tecnologia; e47836 1983-3652 reponame:Texto livre instname:Universidade Federal de Minas Gerais (UFMG) instacron:UFMG |
instname_str |
Universidade Federal de Minas Gerais (UFMG) |
instacron_str |
UFMG |
institution |
UFMG |
reponame_str |
Texto livre |
collection |
Texto livre |
repository.name.fl_str_mv |
Texto livre - Universidade Federal de Minas Gerais (UFMG) |
repository.mail.fl_str_mv |
revistatextolivre@letras.ufmg.br |
_version_ |
1814256385779564544 |