Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos

Santana, Luciana Maiara Queiroz de

Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos

Detalhes bibliográficos
Autor(a) principal:	Santana, Luciana Maiara Queiroz de
Data de Publicação:	2017
Tipo de documento:	Dissertação
Idioma:	por
Título da fonte:	Repositório Institucional da UFS
Texto Completo:	http://ri.ufs.br/jspui/handle/riufs/10760
Resumo:	Many learning tasks require dealing with sequential data, such as text translators, music generators, and more. Deep Neural Networks have shown promising results in automatic speech recognition, where one of the main challenges is voice recognition signals in the presence of noise. In this manuscript, we combine two known deep learning architectures, Convolutional Neural Networks for acoustic modeling, and a recurrent architecture with Classification Temporal Conexionist for sequential modeling. Recurrent Neural Networks (RNN) are models that capture sequence dynamics through a topology that contains cycles, unlike acyclic neural networks or feedforward networks. The RNN studied in this work is a particular case of a deep learning network that, unlike its shallow correlates, it is able to retain a state that can represent information from an arbitrarily long context window. The experimental results showed that the proposed architecture achieved superior performance when compared to Hidden Markov Model in tests carried out on the same databases.

Metadados do item

id	UFS-2_82ea543620f926b0660ec7744895cf15
oai_identifier_str	oai:ufs.br:riufs/10760
network_acronym_str	UFS-2
network_name_str	Repositório Institucional da UFS
repository_id_str
spelling	Santana, Luciana Maiara Queiroz deMatos, Leonardo Nogueira2019-03-25T23:09:10Z2019-03-25T23:09:10Z2017-07-26SANTANA, Luciana Maiara Queiroz de. Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos. 2017. 68 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Sergipe, São Cristóvão, SE, 2017.http://ri.ufs.br/jspui/handle/riufs/10760Many learning tasks require dealing with sequential data, such as text translators, music generators, and more. Deep Neural Networks have shown promising results in automatic speech recognition, where one of the main challenges is voice recognition signals in the presence of noise. In this manuscript, we combine two known deep learning architectures, Convolutional Neural Networks for acoustic modeling, and a recurrent architecture with Classification Temporal Conexionist for sequential modeling. Recurrent Neural Networks (RNN) are models that capture sequence dynamics through a topology that contains cycles, unlike acyclic neural networks or feedforward networks. The RNN studied in this work is a particular case of a deep learning network that, unlike its shallow correlates, it is able to retain a state that can represent information from an arbitrarily long context window. The experimental results showed that the proposed architecture achieved superior performance when compared to Hidden Markov Model in tests carried out on the same databases.Inúmeras tarefas de aprendizagem exigem lidar com dados sequenciais, a exemplo de tradutores de textos, geradores de músicas, entre outros. Os sistemas que utilizam redes neurais profundas têm mostrado resultados promissores no reconhecimento automático de fala, onde um dos maiores desafios é o reconhecimento em sinais de voz contaminados com ruído. Para este trabalho, combinamos duas arquiteturas conhecidas de aprendizagem profunda, as redes neurais convolucionais para abordagem acústica e uma arquitetura recorrente com classificação temporal conexionista para modelagem sequencial. As redes neurais recorrentes são modelos que capturam a dinâmica da sequência através de uma topologia que contém ciclos, ao contrário das redes neurais acíclicas ou de alimentação direta (feedforward). O modelo estudado neste trabalho é um caso particular de rede recorrente profunda que, ao contrário de seus correlatos de arquitetura rasa, é capaz de reter um estado que pode representar informações de uma janela de contexto arbitrariamente longa. Os resultados experimentais mostraram que a arquitetura proposta alcançou um desempenho superior quando comparado ao modelo clássico, modelo oculto de Markov, em testes realizados sobre as mesmas bases de dados.São Cristóvão, SEporReconhecimento automático de vozRuído aditivoAprendizado profundoRede Neural Recorrente (RNN)Automatic Speech Recognition (ASR)Additive noiseDeep learningRecurrent Neural Network (RNN)CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAOAplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídosinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisPós-Graduação em Ciência da ComputaçãoUFSreponame:Repositório Institucional da UFSinstname:Universidade Federal de Sergipe (UFS)instacron:UFSinfo:eu-repo/semantics/openAccessTEXTLUCIANA_MAIARA_QUEIROZ_SANTANA.pdf.txtLUCIANA_MAIARA_QUEIROZ_SANTANA.pdf.txtExtracted texttext/plain138738https://ri.ufs.br/jspui/bitstream/riufs/10760/3/LUCIANA_MAIARA_QUEIROZ_SANTANA.pdf.txt42764e8439ee008ef78c7f7aff0c792bMD53THUMBNAILLUCIANA_MAIARA_QUEIROZ_SANTANA.pdf.jpgLUCIANA_MAIARA_QUEIROZ_SANTANA.pdf.jpgGenerated Thumbnailimage/jpeg1354https://ri.ufs.br/jspui/bitstream/riufs/10760/4/LUCIANA_MAIARA_QUEIROZ_SANTANA.pdf.jpgc576da86d49d9fe2800043d11b639ce3MD54LICENSElicense.txtlicense.txttext/plain; charset=utf-81475https://ri.ufs.br/jspui/bitstream/riufs/10760/1/license.txt098cbbf65c2c15e1fb2e49c5d306a44cMD51ORIGINALLUCIANA_MAIARA_QUEIROZ_SANTANA.pdfLUCIANA_MAIARA_QUEIROZ_SANTANA.pdfapplication/pdf11148256https://ri.ufs.br/jspui/bitstream/riufs/10760/2/LUCIANA_MAIARA_QUEIROZ_SANTANA.pdf1025158244aab1d7bb685026a2cd26cdMD52riufs/107602019-03-25 20:09:10.907oai:ufs.br:riufs/10760TElDRU7Dh0EgREUgRElTVFJJQlVJw4fDg08gTsODTy1FWENMVVNJVkEKCkNvbSBhIGFwcmVzZW50YcOnw6NvIGRlc3RhIGxpY2Vuw6dhLCB2b2PDqiAobyBhdXRvcihlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvcikgY29uY2VkZSDDoCBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkZSBTZXJnaXBlIG8gZGlyZWl0byBuw6NvLWV4Y2x1c2l2byBkZSByZXByb2R1emlyIHNldSB0cmFiYWxobyBubyBmb3JtYXRvIGVsZXRyw7RuaWNvLCBpbmNsdWluZG8gb3MgZm9ybWF0b3Mgw6F1ZGlvIG91IHbDrWRlby4KClZvY8OqIGNvbmNvcmRhIHF1ZSBhIFVuaXZlcnNpZGFkZSBGZWRlcmFsIGRlIFNlcmdpcGUgcG9kZSwgc2VtIGFsdGVyYXIgbyBjb250ZcO6ZG8sIHRyYW5zcG9yIHNldSB0cmFiYWxobyBwYXJhIHF1YWxxdWVyIG1laW8gb3UgZm9ybWF0byBwYXJhIGZpbnMgZGUgcHJlc2VydmHDp8Ojby4KClZvY8OqIHRhbWLDqW0gY29uY29yZGEgcXVlIGEgVW5pdmVyc2lkYWRlIEZlZGVyYWwgZGUgU2VyZ2lwZSBwb2RlIG1hbnRlciBtYWlzIGRlIHVtYSBjw7NwaWEgZGUgc2V1IHRyYWJhbGhvIHBhcmEgZmlucyBkZSBzZWd1cmFuw6dhLCBiYWNrLXVwIGUgcHJlc2VydmHDp8Ojby4KClZvY8OqIGRlY2xhcmEgcXVlIHNldSB0cmFiYWxobyDDqSBvcmlnaW5hbCBlIHF1ZSB2b2PDqiB0ZW0gbyBwb2RlciBkZSBjb25jZWRlciBvcyBkaXJlaXRvcyBjb250aWRvcyBuZXN0YSBsaWNlbsOnYS4gVm9jw6ogdGFtYsOpbSBkZWNsYXJhIHF1ZSBvIGRlcMOzc2l0bywgcXVlIHNlamEgZGUgc2V1IGNvbmhlY2ltZW50bywgbsOjbyBpbmZyaW5nZSBkaXJlaXRvcyBhdXRvcmFpcyBkZSBuaW5ndcOpbS4KCkNhc28gbyB0cmFiYWxobyBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jw6ogbsOjbyBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2PDqiBkZWNsYXJhIHF1ZSBvYnRldmUgYSBwZXJtaXNzw6NvIGlycmVzdHJpdGEgZG8gZGV0ZW50b3IgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgw6AgVW5pdmVyc2lkYWRlIEZlZGVyYWwgZGUgU2VyZ2lwZSBvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7Dp2EsIGUgcXVlIGVzc2UgbWF0ZXJpYWwgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3Ugbm8gY29udGXDumRvLgoKQSBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkZSBTZXJnaXBlIHNlIGNvbXByb21ldGUgYSBpZGVudGlmaWNhciBjbGFyYW1lbnRlIG8gc2V1IG5vbWUocykgb3UgbyhzKSBub21lKHMpIGRvKHMpIApkZXRlbnRvcihlcykgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIGRvIHRyYWJhbGhvLCBlIG7Do28gZmFyw6EgcXVhbHF1ZXIgYWx0ZXJhw6fDo28sIGFsw6ltIGRhcXVlbGFzIGNvbmNlZGlkYXMgcG9yIGVzdGEgbGljZW7Dp2EuIAo=Repositório InstitucionalPUBhttps://ri.ufs.br/oai/requestrepositorio@academico.ufs.bropendoar:2019-03-25T23:09:10Repositório Institucional da UFS - Universidade Federal de Sergipe (UFS)false
dc.title.pt_BR.fl_str_mv	Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos
title	Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos
spellingShingle	Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos Santana, Luciana Maiara Queiroz de Reconhecimento automático de voz Ruído aditivo Aprendizado profundo Rede Neural Recorrente (RNN) Automatic Speech Recognition (ASR) Additive noise Deep learning Recurrent Neural Network (RNN) CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
title_short	Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos
title_full	Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos
title_fullStr	Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos
title_full_unstemmed	Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos
title_sort	Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos
author	Santana, Luciana Maiara Queiroz de
author_facet	Santana, Luciana Maiara Queiroz de
author_role	author
dc.contributor.author.fl_str_mv	Santana, Luciana Maiara Queiroz de
dc.contributor.advisor1.fl_str_mv	Matos, Leonardo Nogueira
contributor_str_mv	Matos, Leonardo Nogueira
dc.subject.por.fl_str_mv	Reconhecimento automático de voz Ruído aditivo Aprendizado profundo Rede Neural Recorrente (RNN) Automatic Speech Recognition (ASR) Additive noise Deep learning Recurrent Neural Network (RNN)
topic	Reconhecimento automático de voz Ruído aditivo Aprendizado profundo Rede Neural Recorrente (RNN) Automatic Speech Recognition (ASR) Additive noise Deep learning Recurrent Neural Network (RNN) CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
dc.subject.cnpq.fl_str_mv	CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
description	Many learning tasks require dealing with sequential data, such as text translators, music generators, and more. Deep Neural Networks have shown promising results in automatic speech recognition, where one of the main challenges is voice recognition signals in the presence of noise. In this manuscript, we combine two known deep learning architectures, Convolutional Neural Networks for acoustic modeling, and a recurrent architecture with Classification Temporal Conexionist for sequential modeling. Recurrent Neural Networks (RNN) are models that capture sequence dynamics through a topology that contains cycles, unlike acyclic neural networks or feedforward networks. The RNN studied in this work is a particular case of a deep learning network that, unlike its shallow correlates, it is able to retain a state that can represent information from an arbitrarily long context window. The experimental results showed that the proposed architecture achieved superior performance when compared to Hidden Markov Model in tests carried out on the same databases.
publishDate	2017
dc.date.issued.fl_str_mv	2017-07-26
dc.date.accessioned.fl_str_mv	2019-03-25T23:09:10Z
dc.date.available.fl_str_mv	2019-03-25T23:09:10Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.citation.fl_str_mv	SANTANA, Luciana Maiara Queiroz de. Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos. 2017. 68 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Sergipe, São Cristóvão, SE, 2017.
dc.identifier.uri.fl_str_mv	http://ri.ufs.br/jspui/handle/riufs/10760
identifier_str_mv	SANTANA, Luciana Maiara Queiroz de. Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos. 2017. 68 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Sergipe, São Cristóvão, SE, 2017.
url	http://ri.ufs.br/jspui/handle/riufs/10760
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.publisher.program.fl_str_mv	Pós-Graduação em Ciência da Computação
dc.publisher.initials.fl_str_mv	UFS
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFS instname:Universidade Federal de Sergipe (UFS) instacron:UFS
instname_str	Universidade Federal de Sergipe (UFS)
instacron_str	UFS
institution	UFS
reponame_str	Repositório Institucional da UFS
collection	Repositório Institucional da UFS
bitstream.url.fl_str_mv	https://ri.ufs.br/jspui/bitstream/riufs/10760/3/LUCIANA_MAIARA_QUEIROZ_SANTANA.pdf.txt https://ri.ufs.br/jspui/bitstream/riufs/10760/4/LUCIANA_MAIARA_QUEIROZ_SANTANA.pdf.jpg https://ri.ufs.br/jspui/bitstream/riufs/10760/1/license.txt https://ri.ufs.br/jspui/bitstream/riufs/10760/2/LUCIANA_MAIARA_QUEIROZ_SANTANA.pdf
bitstream.checksum.fl_str_mv	42764e8439ee008ef78c7f7aff0c792b c576da86d49d9fe2800043d11b639ce3 098cbbf65c2c15e1fb2e49c5d306a44c 1025158244aab1d7bb685026a2cd26cd
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5 MD5
repository.name.fl_str_mv	Repositório Institucional da UFS - Universidade Federal de Sergipe (UFS)
repository.mail.fl_str_mv	repositorio@academico.ufs.br
_version_	1802110816662585344

Aplicação de redes neurais recorrentes no reconhecimento automático da fala em ambientes com ruídos

Registros relacionados