Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?

Detalhes bibliográficos
Autor(a) principal: Delgado, Héctor
Data de Publicação: 2015
Outros Autores: Matamala, Anna, Serrano, Javier
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Cadernos de Tradução (Florianópolis. Online)
Texto Completo: https://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p308
Resumo: This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized.
id UFSC-6_5ef0f577832d3ef9f19d1e0f8827a95b
oai_identifier_str oai:periodicos.ufsc.br:article/37501
network_acronym_str UFSC-6
network_name_str Cadernos de Tradução (Florianópolis. Online)
repository_id_str
spelling Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?Diarización y reconocimiento de habla en la semiautomatización de la audiodescripción: un estudio exploratorio sobre posibilidades futurasThis article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized.Este artículo presenta una visión panorámica de los componentes tecnológicos usados en el proceso de audiodescripción y propone un nuevo escenario en el que se aplicarían el reconocimiento de habla, la traducción automática y la síntesis de habla, con su correspondiente revisión humana, para incrementar la cantidad de audiodescripciones disponibles. El artículo describe un proceso en el que la diarización y el reconocimiento de habla permiten obtener una transcripción semiautomática de la audiodescripción. El artículo presenta detalladamente el proceso técnico así como un resumen de los resultados experimentales.Universidade Federal de Santa Catarina2015-06-17info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionexperimental and technological researchapplication/pdfhttps://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p30810.5007/2175-7968.2015v35n2p308Cadernos de Tradução; Vol. 35 No. 2 (2015): Edição Regular; 308-324Cadernos de Tradução; Vol. 35 Núm. 2 (2015): Edição Regular; 308-324Cadernos de Tradução; v. 35 n. 2 (2015): Edição Regular; 308-3242175-79681414-526Xreponame:Cadernos de Tradução (Florianópolis. Online)instname:Universidade Federal de Santa Catarina (UFSC)instacron:UFSCenghttps://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p308/31003Copyright (c) 2015 Cadernos de Traduçãoinfo:eu-repo/semantics/openAccessDelgado, HéctorMatamala, AnnaSerrano, Javier2022-12-04T03:38:53Zoai:periodicos.ufsc.br:article/37501Revistahttps://periodicos.ufsc.br/index.php/traducao/indexPUBhttps://periodicos.ufsc.br/index.php/traducao/oaieditorcadernostraducao@contato.ufsc.br||ecadernos@gmail.com||editorcadernostraducao@contato.ufsc.br|| cadernostraducao@contato.ufsc.br2175-79681414-526Xopendoar:2022-12-04T03:38:53Cadernos de Tradução (Florianópolis. Online) - Universidade Federal de Santa Catarina (UFSC)false
dc.title.none.fl_str_mv Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
Diarización y reconocimiento de habla en la semiautomatización de la audiodescripción: un estudio exploratorio sobre posibilidades futuras
title Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
spellingShingle Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
Delgado, Héctor
title_short Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
title_full Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
title_fullStr Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
title_full_unstemmed Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
title_sort Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
author Delgado, Héctor
author_facet Delgado, Héctor
Matamala, Anna
Serrano, Javier
author_role author
author2 Matamala, Anna
Serrano, Javier
author2_role author
author
dc.contributor.author.fl_str_mv Delgado, Héctor
Matamala, Anna
Serrano, Javier
description This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized.
publishDate 2015
dc.date.none.fl_str_mv 2015-06-17
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
experimental and technological research
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p308
10.5007/2175-7968.2015v35n2p308
url https://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p308
identifier_str_mv 10.5007/2175-7968.2015v35n2p308
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv https://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p308/31003
dc.rights.driver.fl_str_mv Copyright (c) 2015 Cadernos de Tradução
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2015 Cadernos de Tradução
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal de Santa Catarina
publisher.none.fl_str_mv Universidade Federal de Santa Catarina
dc.source.none.fl_str_mv Cadernos de Tradução; Vol. 35 No. 2 (2015): Edição Regular; 308-324
Cadernos de Tradução; Vol. 35 Núm. 2 (2015): Edição Regular; 308-324
Cadernos de Tradução; v. 35 n. 2 (2015): Edição Regular; 308-324
2175-7968
1414-526X
reponame:Cadernos de Tradução (Florianópolis. Online)
instname:Universidade Federal de Santa Catarina (UFSC)
instacron:UFSC
instname_str Universidade Federal de Santa Catarina (UFSC)
instacron_str UFSC
institution UFSC
reponame_str Cadernos de Tradução (Florianópolis. Online)
collection Cadernos de Tradução (Florianópolis. Online)
repository.name.fl_str_mv Cadernos de Tradução (Florianópolis. Online) - Universidade Federal de Santa Catarina (UFSC)
repository.mail.fl_str_mv editorcadernostraducao@contato.ufsc.br||ecadernos@gmail.com||editorcadernostraducao@contato.ufsc.br|| cadernostraducao@contato.ufsc.br
_version_ 1799875299620421632