Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordances

Detalhes bibliográficos
Autor(a) principal: Gottardi, William
Data de Publicação: 2022
Outros Autores: Almeida, Janaina Fernanda de, Tumolo, Celso Henrique Soufen
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Texto livre
Texto Completo: https://periodicos.ufmg.br/index.php/textolivre/article/view/36736
Resumo: This paper presents a reflection on two technologies – automatic speech recognition (ASR) and Text-to-Speech (TTS) – to improve learners’ pronunciation, aiming for successful spoken communication. It sheds some light on the practical usage of these technologies, demonstrating their effectiveness, qualities, and limitations to assist teachers in deciding the most efficient digital resources applied to their students’ needs. A review of literature on previous empirical studies was carried out, with quantitative and/or qualitative studies conducted by researchers in the field, investigating teachers’ and learners' perceptions and the use of ASR and TTS as a pedagogical tool for pronunciation practice. As a result, it was concluded that a) the presented resources seem to have the potential to enhance pronunciation practice, both in terms of perception and production; b) technology can result in considerable benefits to learners, mainly as a supplement to pronunciation teaching; and c) the use of these digital resources is a way of giving learners the opportunity to focus on their specific difficulties and receive personalized feedback while becoming more autonomous in their learning process.
id UFMG-9_2444a257cd9c43ac49f8b2493cf2c8a0
oai_identifier_str oai:periodicos.ufmg.br:article/36736
network_acronym_str UFMG-9
network_name_str Texto livre
repository_id_str
spelling Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordancesTecnologias de reconhecimento automático da fala e texto-fala para o aprimoramento da pronúncia em L2: reflexões das suas aplicabilidadesAutomatic speech recognitionText-to-speechCALLPronunciation teachingPronunciation improvementReconhecimento automático da falaTexto-falaCALLEnsino de pronúnciaAprimoramento de pronúnciaThis paper presents a reflection on two technologies – automatic speech recognition (ASR) and Text-to-Speech (TTS) – to improve learners’ pronunciation, aiming for successful spoken communication. It sheds some light on the practical usage of these technologies, demonstrating their effectiveness, qualities, and limitations to assist teachers in deciding the most efficient digital resources applied to their students’ needs. A review of literature on previous empirical studies was carried out, with quantitative and/or qualitative studies conducted by researchers in the field, investigating teachers’ and learners' perceptions and the use of ASR and TTS as a pedagogical tool for pronunciation practice. As a result, it was concluded that a) the presented resources seem to have the potential to enhance pronunciation practice, both in terms of perception and production; b) technology can result in considerable benefits to learners, mainly as a supplement to pronunciation teaching; and c) the use of these digital resources is a way of giving learners the opportunity to focus on their specific difficulties and receive personalized feedback while becoming more autonomous in their learning process.Este artigo apresenta uma reflexão sobre duas tecnologias – reconhecimento automático da fala (ASR – Automatic Speech Recognition) e texto-fala (TTS – Text-to-Speech) – para aprimorar a pronúncia dos alunos, visando a uma comunicação oral competente. O trabalho explora o uso dessas tecnologias, demonstrando sua eficácia, qualidades e limitações para ajudar os professores a decidirem os recursos digitais mais eficientes aplicados às necessidades de seus alunos. Foi realizada uma revisão bibliográfica de estudos empíricos prévios, com pesquisas quantitativas e/ou qualitativas realizadas por pesquisadores da área, investigando as percepções de professores e alunos e o uso de ASR e TTS como ferramentas pedagógicas para o ensino de pronúncia. Como resultado, concluiu-se que a) os recursos apresentados demonstram ter potencial para aprimorar a prática da pronúncia, tanto em termos de percepção como produção; b) a tecnologia pode resultar em benefícios consideráveis para os alunos, principalmente como um suplemento ao ensino de pronúncia; e c) o uso desses recursos digitais é uma forma de dar aos alunos a oportunidade de focar em suas dificuldades específicas e receber um retorno personalizado, tornando-os mais autônomos em seu processo de aprendizagem.Universidade Federal de Minas Gerais2022-02-10info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionArtigo avaliado pelos paresapplication/pdfhttps://periodicos.ufmg.br/index.php/textolivre/article/view/3673610.35699/1983-3652.2022.36736Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e36736Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e36736Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e36736Texto Livre; v. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e367361983-3652reponame:Texto livreinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGenghttps://periodicos.ufmg.br/index.php/textolivre/article/view/36736/29845Copyright (c) 2022 Gottardi et al.https://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessGottardi, WilliamAlmeida, Janaina Fernanda deTumolo, Celso Henrique Soufen2022-10-31T13:32:07Zoai:periodicos.ufmg.br:article/36736Revistahttp://www.periodicos.letras.ufmg.br/index.php/textolivrePUBhttps://periodicos.ufmg.br/index.php/textolivre/oairevistatextolivre@letras.ufmg.br1983-36521983-3652opendoar:2022-10-31T13:32:07Texto livre - Universidade Federal de Minas Gerais (UFMG)false
dc.title.none.fl_str_mv Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordances
Tecnologias de reconhecimento automático da fala e texto-fala para o aprimoramento da pronúncia em L2: reflexões das suas aplicabilidades
title Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordances
spellingShingle Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordances
Gottardi, William
Automatic speech recognition
Text-to-speech
CALL
Pronunciation teaching
Pronunciation improvement
Reconhecimento automático da fala
Texto-fala
CALL
Ensino de pronúncia
Aprimoramento de pronúncia
title_short Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordances
title_full Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordances
title_fullStr Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordances
title_full_unstemmed Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordances
title_sort Automatic speech recognition and text-to-speech technologies for L2 pronunciation improvement: reflections on their affordances
author Gottardi, William
author_facet Gottardi, William
Almeida, Janaina Fernanda de
Tumolo, Celso Henrique Soufen
author_role author
author2 Almeida, Janaina Fernanda de
Tumolo, Celso Henrique Soufen
author2_role author
author
dc.contributor.author.fl_str_mv Gottardi, William
Almeida, Janaina Fernanda de
Tumolo, Celso Henrique Soufen
dc.subject.por.fl_str_mv Automatic speech recognition
Text-to-speech
CALL
Pronunciation teaching
Pronunciation improvement
Reconhecimento automático da fala
Texto-fala
CALL
Ensino de pronúncia
Aprimoramento de pronúncia
topic Automatic speech recognition
Text-to-speech
CALL
Pronunciation teaching
Pronunciation improvement
Reconhecimento automático da fala
Texto-fala
CALL
Ensino de pronúncia
Aprimoramento de pronúncia
description This paper presents a reflection on two technologies – automatic speech recognition (ASR) and Text-to-Speech (TTS) – to improve learners’ pronunciation, aiming for successful spoken communication. It sheds some light on the practical usage of these technologies, demonstrating their effectiveness, qualities, and limitations to assist teachers in deciding the most efficient digital resources applied to their students’ needs. A review of literature on previous empirical studies was carried out, with quantitative and/or qualitative studies conducted by researchers in the field, investigating teachers’ and learners' perceptions and the use of ASR and TTS as a pedagogical tool for pronunciation practice. As a result, it was concluded that a) the presented resources seem to have the potential to enhance pronunciation practice, both in terms of perception and production; b) technology can result in considerable benefits to learners, mainly as a supplement to pronunciation teaching; and c) the use of these digital resources is a way of giving learners the opportunity to focus on their specific difficulties and receive personalized feedback while becoming more autonomous in their learning process.
publishDate 2022
dc.date.none.fl_str_mv 2022-02-10
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Artigo avaliado pelos pares
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://periodicos.ufmg.br/index.php/textolivre/article/view/36736
10.35699/1983-3652.2022.36736
url https://periodicos.ufmg.br/index.php/textolivre/article/view/36736
identifier_str_mv 10.35699/1983-3652.2022.36736
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv https://periodicos.ufmg.br/index.php/textolivre/article/view/36736/29845
dc.rights.driver.fl_str_mv Copyright (c) 2022 Gottardi et al.
https://creativecommons.org/licenses/by/4.0
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2022 Gottardi et al.
https://creativecommons.org/licenses/by/4.0
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal de Minas Gerais
publisher.none.fl_str_mv Universidade Federal de Minas Gerais
dc.source.none.fl_str_mv Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e36736
Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e36736
Texto Livre; Vol. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e36736
Texto Livre; v. 15 (2022): Texto Livre: Linguagem e Tecnologia ; e36736
1983-3652
reponame:Texto livre
instname:Universidade Federal de Minas Gerais (UFMG)
instacron:UFMG
instname_str Universidade Federal de Minas Gerais (UFMG)
instacron_str UFMG
institution UFMG
reponame_str Texto livre
collection Texto livre
repository.name.fl_str_mv Texto livre - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv revistatextolivre@letras.ufmg.br
_version_ 1799711143547109376