Reconhecimento de voz para aplicações em automação implementado em FPGA

Mulatinho, Gustavo Moscardo [UNESP]

Reconhecimento de voz para aplicações em automação implementado em FPGA

Detalhes bibliográficos
Autor(a) principal:	Mulatinho, Gustavo Moscardo [UNESP]
Data de Publicação:	2011
Tipo de documento:	Trabalho de conclusão de curso
Idioma:	por
Título da fonte:	Repositório Institucional da UNESP
Texto Completo:	http://hdl.handle.net/11449/120118
Resumo:	In many movies of scientific fiction, machines were capable of speaking with humans. However mankind is still far away of getting those types of machines, like the famous character C3PO of Star Wars. During the last six decades the automatic speech recognition systems have been the target of many studies. Throughout these years many technics were developed to be used in applications of both software and hardware. There are many types of automatic speech recognition system, among which the one used in this work were the isolated word and independent of the speaker system, using Hidden Markov Models as the recognition system. The goals of this work is to project and synthesize the first two steps of the speech recognition system, the steps are: the speech signal acquisition and the pre-processing of the signal. Both steps were developed in a reprogrammable component named FPGA, using the VHDL hardware description language, owing to the high performance of this component and the flexibility of the language. In this work it is presented all the theory of digital signal processing, as Fast Fourier Transforms and digital filters and also all the theory of speech recognition using Hidden Markov Models and LPC processor. It is also presented all the results obtained for each one of the blocks synthesized e verified in hardware

Metadados do item

id	UNSP_1de015d46a9d19d8a78bbc3a5e5242a3
oai_identifier_str	oai:repositorio.unesp.br:11449/120118
network_acronym_str	UNSP
network_name_str	Repositório Institucional da UNESP
repository_id_str	2946
spelling	Reconhecimento de voz para aplicações em automação implementado em FPGAReconhecimento automatico da vozVHDL (Linguagem descritiva de hardware)Reconhecimento de palavrasOndas sonorasAutomatic Speech RecognitionVHDLHidden Markov ModelsIn many movies of scientific fiction, machines were capable of speaking with humans. However mankind is still far away of getting those types of machines, like the famous character C3PO of Star Wars. During the last six decades the automatic speech recognition systems have been the target of many studies. Throughout these years many technics were developed to be used in applications of both software and hardware. There are many types of automatic speech recognition system, among which the one used in this work were the isolated word and independent of the speaker system, using Hidden Markov Models as the recognition system. The goals of this work is to project and synthesize the first two steps of the speech recognition system, the steps are: the speech signal acquisition and the pre-processing of the signal. Both steps were developed in a reprogrammable component named FPGA, using the VHDL hardware description language, owing to the high performance of this component and the flexibility of the language. In this work it is presented all the theory of digital signal processing, as Fast Fourier Transforms and digital filters and also all the theory of speech recognition using Hidden Markov Models and LPC processor. It is also presented all the results obtained for each one of the blocks synthesized e verified in hardwareMuitos são os filmes de ficção científica em que são utilizadas máquinas capazes de dialogar com os seres humanos. Porém, o homem ainda está longe de chegar em tais máquinas, como o personagem C3PO do filme Star Wars. Durante as últimas seis décadas muito se têm investido nos estudos de reconhecimento automático de voz, surgindo ao longo desses anos diversas técnicas que podem ser utilizadas por ambas as aplicações de software e hardware. Diversos são os tipos de sistemas de reconhecimento automático de voz, dentre os quais o utilizado para este trabalho é o sistema de palavras isoladas independentes do locutor, utilizando Modelos Escondidos de Markov como técnica de reconhecimento da palavra. Este trabalho tem por finalidade projetar e sintetizar as duas primeiras etapas de um sistema de reconhecimento de voz, sendo tais etapas: a aquisição do sinal de voz e o pré-processamento do mesmo. Sendo estas etapas desenvolvidas em um componente reprogramável denominado FPGA, utilizando linguagem de programação de hardware VHDL, tendo em vista o alto desempenho que este componente pode proporcionar e a flexibilidade da linguagem. Neste trabalho é apresentado todo o conteúdo teórico de processamento digital de sinais, como a teoria de Transformadas Rápidas de Fourier e filtros digitais e também toda a teoria de reconhecimento de voz utilizando Modelos Escondidos de Markov e processador LPC. Também são apresentados todos os resultados obtidos por cada um dos blocos sintetizados e verificados em hardwareUniversidade Estadual Paulista (Unesp)Mesquita, Leonardo [UNESP]Universidade Estadual Paulista (Unesp)Mulatinho, Gustavo Moscardo [UNESP]2015-03-23T15:24:10Z2015-03-23T15:24:10Z2011info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/bachelorThesisapplication/pdfMULATINHO, Gustavo Moscardo. Reconhecimento de voz para aplicações em automação implementado em FPGA. 2011. 1 CD-ROM. Trabalho de conclusão de curso - (bacharelado - Engenharia Elétrica) – Universidade Estadual Paulista, Faculdade de Engenharia de Guaratinguetá, 2011.http://hdl.handle.net/11449/120118000686916mulatinho_gm_tcc_guara.pdf9338079447464341Alephreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPporinfo:eu-repo/semantics/openAccess2024-01-14T06:22:06Zoai:repositorio.unesp.br:11449/120118Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-05-23T20:40:18.417898Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv	Reconhecimento de voz para aplicações em automação implementado em FPGA
title	Reconhecimento de voz para aplicações em automação implementado em FPGA
spellingShingle	Reconhecimento de voz para aplicações em automação implementado em FPGA Mulatinho, Gustavo Moscardo [UNESP] Reconhecimento automatico da voz VHDL (Linguagem descritiva de hardware) Reconhecimento de palavras Ondas sonoras Automatic Speech Recognition VHDL Hidden Markov Models
title_short	Reconhecimento de voz para aplicações em automação implementado em FPGA
title_full	Reconhecimento de voz para aplicações em automação implementado em FPGA
title_fullStr	Reconhecimento de voz para aplicações em automação implementado em FPGA
title_full_unstemmed	Reconhecimento de voz para aplicações em automação implementado em FPGA
title_sort	Reconhecimento de voz para aplicações em automação implementado em FPGA
author	Mulatinho, Gustavo Moscardo [UNESP]
author_facet	Mulatinho, Gustavo Moscardo [UNESP]
author_role	author
dc.contributor.none.fl_str_mv	Mesquita, Leonardo [UNESP] Universidade Estadual Paulista (Unesp)
dc.contributor.author.fl_str_mv	Mulatinho, Gustavo Moscardo [UNESP]
dc.subject.por.fl_str_mv	Reconhecimento automatico da voz VHDL (Linguagem descritiva de hardware) Reconhecimento de palavras Ondas sonoras Automatic Speech Recognition VHDL Hidden Markov Models
topic	Reconhecimento automatico da voz VHDL (Linguagem descritiva de hardware) Reconhecimento de palavras Ondas sonoras Automatic Speech Recognition VHDL Hidden Markov Models
description	In many movies of scientific fiction, machines were capable of speaking with humans. However mankind is still far away of getting those types of machines, like the famous character C3PO of Star Wars. During the last six decades the automatic speech recognition systems have been the target of many studies. Throughout these years many technics were developed to be used in applications of both software and hardware. There are many types of automatic speech recognition system, among which the one used in this work were the isolated word and independent of the speaker system, using Hidden Markov Models as the recognition system. The goals of this work is to project and synthesize the first two steps of the speech recognition system, the steps are: the speech signal acquisition and the pre-processing of the signal. Both steps were developed in a reprogrammable component named FPGA, using the VHDL hardware description language, owing to the high performance of this component and the flexibility of the language. In this work it is presented all the theory of digital signal processing, as Fast Fourier Transforms and digital filters and also all the theory of speech recognition using Hidden Markov Models and LPC processor. It is also presented all the results obtained for each one of the blocks synthesized e verified in hardware
publishDate	2011
dc.date.none.fl_str_mv	2011 2015-03-23T15:24:10Z 2015-03-23T15:24:10Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/bachelorThesis
format	bachelorThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	MULATINHO, Gustavo Moscardo. Reconhecimento de voz para aplicações em automação implementado em FPGA. 2011. 1 CD-ROM. Trabalho de conclusão de curso - (bacharelado - Engenharia Elétrica) – Universidade Estadual Paulista, Faculdade de Engenharia de Guaratinguetá, 2011. http://hdl.handle.net/11449/120118 000686916 mulatinho_gm_tcc_guara.pdf 9338079447464341
identifier_str_mv	MULATINHO, Gustavo Moscardo. Reconhecimento de voz para aplicações em automação implementado em FPGA. 2011. 1 CD-ROM. Trabalho de conclusão de curso - (bacharelado - Engenharia Elétrica) – Universidade Estadual Paulista, Faculdade de Engenharia de Guaratinguetá, 2011. 000686916 mulatinho_gm_tcc_guara.pdf 9338079447464341
url	http://hdl.handle.net/11449/120118
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Estadual Paulista (Unesp)
publisher.none.fl_str_mv	Universidade Estadual Paulista (Unesp)
dc.source.none.fl_str_mv	Aleph reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP
instname_str	Universidade Estadual Paulista (UNESP)
instacron_str	UNESP
institution	UNESP
reponame_str	Repositório Institucional da UNESP
collection	Repositório Institucional da UNESP
repository.name.fl_str_mv	Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_	1803045687605067776

Reconhecimento de voz para aplicações em automação implementado em FPGA

Registros relacionados