Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
Autor(a) principal: | |
---|---|
Data de Publicação: | 2015 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Cadernos de Tradução (Florianópolis. Online) |
Texto Completo: | https://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p308 |
Resumo: | This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized. |
id |
UFSC-6_5ef0f577832d3ef9f19d1e0f8827a95b |
---|---|
oai_identifier_str |
oai:periodicos.ufsc.br:article/37501 |
network_acronym_str |
UFSC-6 |
network_name_str |
Cadernos de Tradução (Florianópolis. Online) |
repository_id_str |
|
spelling |
Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?Diarización y reconocimiento de habla en la semiautomatización de la audiodescripción: un estudio exploratorio sobre posibilidades futurasThis article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized.Este artículo presenta una visión panorámica de los componentes tecnológicos usados en el proceso de audiodescripción y propone un nuevo escenario en el que se aplicarían el reconocimiento de habla, la traducción automática y la síntesis de habla, con su correspondiente revisión humana, para incrementar la cantidad de audiodescripciones disponibles. El artículo describe un proceso en el que la diarización y el reconocimiento de habla permiten obtener una transcripción semiautomática de la audiodescripción. El artículo presenta detalladamente el proceso técnico así como un resumen de los resultados experimentales.Universidade Federal de Santa Catarina2015-06-17info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionexperimental and technological researchapplication/pdfhttps://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p30810.5007/2175-7968.2015v35n2p308Cadernos de Tradução; Vol. 35 No. 2 (2015): Edição Regular; 308-324Cadernos de Tradução; Vol. 35 Núm. 2 (2015): Edição Regular; 308-324Cadernos de Tradução; v. 35 n. 2 (2015): Edição Regular; 308-3242175-79681414-526Xreponame:Cadernos de Tradução (Florianópolis. Online)instname:Universidade Federal de Santa Catarina (UFSC)instacron:UFSCenghttps://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p308/31003Copyright (c) 2015 Cadernos de Traduçãoinfo:eu-repo/semantics/openAccessDelgado, HéctorMatamala, AnnaSerrano, Javier2022-12-04T03:38:53Zoai:periodicos.ufsc.br:article/37501Revistahttps://periodicos.ufsc.br/index.php/traducao/indexPUBhttps://periodicos.ufsc.br/index.php/traducao/oaieditorcadernostraducao@contato.ufsc.br||ecadernos@gmail.com||editorcadernostraducao@contato.ufsc.br|| cadernostraducao@contato.ufsc.br2175-79681414-526Xopendoar:2022-12-04T03:38:53Cadernos de Tradução (Florianópolis. Online) - Universidade Federal de Santa Catarina (UFSC)false |
dc.title.none.fl_str_mv |
Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? Diarización y reconocimiento de habla en la semiautomatización de la audiodescripción: un estudio exploratorio sobre posibilidades futuras |
title |
Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? |
spellingShingle |
Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? Delgado, Héctor |
title_short |
Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? |
title_full |
Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? |
title_fullStr |
Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? |
title_full_unstemmed |
Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? |
title_sort |
Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? |
author |
Delgado, Héctor |
author_facet |
Delgado, Héctor Matamala, Anna Serrano, Javier |
author_role |
author |
author2 |
Matamala, Anna Serrano, Javier |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Delgado, Héctor Matamala, Anna Serrano, Javier |
description |
This article presents an overview of the technological components used in the process of audio description, and suggests a new scenario in which speech recognition, machine translation, and text-to-speech, with the corresponding human revision, could be used to increase audio description provision. The article focuses on a process in which both speaker diarization and speech recognition are used in order to obtain a semi-automatic transcription of the audio description track. The technical process is presented and experimental results are summarized. |
publishDate |
2015 |
dc.date.none.fl_str_mv |
2015-06-17 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion experimental and technological research |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p308 10.5007/2175-7968.2015v35n2p308 |
url |
https://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p308 |
identifier_str_mv |
10.5007/2175-7968.2015v35n2p308 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
https://periodicos.ufsc.br/index.php/traducao/article/view/2175-7968.2015v35n2p308/31003 |
dc.rights.driver.fl_str_mv |
Copyright (c) 2015 Cadernos de Tradução info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Copyright (c) 2015 Cadernos de Tradução |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Universidade Federal de Santa Catarina |
publisher.none.fl_str_mv |
Universidade Federal de Santa Catarina |
dc.source.none.fl_str_mv |
Cadernos de Tradução; Vol. 35 No. 2 (2015): Edição Regular; 308-324 Cadernos de Tradução; Vol. 35 Núm. 2 (2015): Edição Regular; 308-324 Cadernos de Tradução; v. 35 n. 2 (2015): Edição Regular; 308-324 2175-7968 1414-526X reponame:Cadernos de Tradução (Florianópolis. Online) instname:Universidade Federal de Santa Catarina (UFSC) instacron:UFSC |
instname_str |
Universidade Federal de Santa Catarina (UFSC) |
instacron_str |
UFSC |
institution |
UFSC |
reponame_str |
Cadernos de Tradução (Florianópolis. Online) |
collection |
Cadernos de Tradução (Florianópolis. Online) |
repository.name.fl_str_mv |
Cadernos de Tradução (Florianópolis. Online) - Universidade Federal de Santa Catarina (UFSC) |
repository.mail.fl_str_mv |
editorcadernostraducao@contato.ufsc.br||ecadernos@gmail.com||editorcadernostraducao@contato.ufsc.br|| cadernostraducao@contato.ufsc.br |
_version_ |
1799875299620421632 |