Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models

Detalhes bibliográficos
Autor(a) principal: Thomé, Antonio Carlos Gay
Data de Publicação: 1999
Outros Autores: Diniz, Suelaine dos Santos, Santos, Sidney Cerqueira Bispo dos, Silva, Dirceu Gonzaga da
Tipo de documento: Relatório
Idioma: eng
Título da fonte: Repositório Institucional da UFRJ
Texto Completo: http://hdl.handle.net/11422/2568
Resumo: In this work we do a comparative evaluation between Artificial Neural Networks (RNA's) and Continuous Hidden Markov Models (CDHMM), in the framework of the recognition of isolated words, under the constrain of using a small number of features extracted from each voice signal. In order to accomplish such comparison we used two models of neural networks: the Multilayer Perceptron (MLP) and a variant of the Radial Basis (RBF), and some HMM models. We evaluated the performance of all models using two different test set and observed that the neural models presented the best results in both cases. Seeking to improve the HMM performance we developed a hybrid system, HMM/MLP, that improved the results previously obtained with all HMMs, and even those obtained with the neural networks for the all previous HMM, and even the neural nets for the hardest test set case.
id UFRJ_849e51e68f4f5efe83da8dcb46a0233b
oai_identifier_str oai:pantheon.ufrj.br:11422/2568
network_acronym_str UFRJ
network_name_str Repositório Institucional da UFRJ
repository_id_str
spelling Thomé, Antonio Carlos GayDiniz, Suelaine dos SantosSantos, Sidney Cerqueira Bispo dosSilva, Dirceu Gonzaga da2017-08-03T14:21:53Z2023-11-30T03:02:12Z1999-12-31THOMÉ, C. G. T. et al. Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models. Rio de Janeiro: NCE, UFRJ, 1999. 4 p. (Relatório Técnico, 14/99)http://hdl.handle.net/11422/2568In this work we do a comparative evaluation between Artificial Neural Networks (RNA's) and Continuous Hidden Markov Models (CDHMM), in the framework of the recognition of isolated words, under the constrain of using a small number of features extracted from each voice signal. In order to accomplish such comparison we used two models of neural networks: the Multilayer Perceptron (MLP) and a variant of the Radial Basis (RBF), and some HMM models. We evaluated the performance of all models using two different test set and observed that the neural models presented the best results in both cases. Seeking to improve the HMM performance we developed a hybrid system, HMM/MLP, that improved the results previously obtained with all HMMs, and even those obtained with the neural networks for the all previous HMM, and even the neural nets for the hardest test set case.Submitted by Elaine Almeida (elaine.almeida@nce.ufrj.br) on 2017-08-03T14:21:52Z No. of bitstreams: 1 14_99_000611273.pdf: 743652 bytes, checksum: c2476658a64fb6649d8b6674d3f91dbf (MD5)Made available in DSpace on 2017-08-03T14:21:53Z (GMT). No. of bitstreams: 1 14_99_000611273.pdf: 743652 bytes, checksum: c2476658a64fb6649d8b6674d3f91dbf (MD5) Previous issue date: 1999-12-31engRelatório Técnico NCECNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAOReconhecimento automático de vozRedes neurais (Ciência da computação)Modelos markovianosAutomatic speech recognition: a comparative evaluation between neural networks and hidden markov modelsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/report1499abertoBrasilInstituto Tércio Pacitti de Aplicações e Pesquisas Computacionaisinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFRJinstname:Universidade Federal do Rio de Janeiro (UFRJ)instacron:UFRJORIGINAL14_99_000611273.pdf14_99_000611273.pdfapplication/pdf743652http://pantheon.ufrj.br:80/bitstream/11422/2568/1/14_99_000611273.pdfc2476658a64fb6649d8b6674d3f91dbfMD51LICENSElicense.txtlicense.txttext/plain; charset=utf-81853http://pantheon.ufrj.br:80/bitstream/11422/2568/2/license.txtdd32849f2bfb22da963c3aac6e26e255MD52TEXT14_99_000611273.pdf.txt14_99_000611273.pdf.txtExtracted texttext/plain0http://pantheon.ufrj.br:80/bitstream/11422/2568/3/14_99_000611273.pdf.txtd41d8cd98f00b204e9800998ecf8427eMD5311422/25682023-11-30 00:02:12.961oai:pantheon.ufrj.br:11422/2568TElDRU7Dh0EgTsODTy1FWENMVVNJVkEgREUgRElTVFJJQlVJw4fDg08KCkFvIGFzc2luYXIgZSBlbnRyZWdhciBlc3RhIGxpY2Vuw6dhLCB2b2PDqihzKSBvKHMpIGF1dG9yKGVzKSBvdSBwcm9wcmlldMOhcmlvKHMpIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBjb25jZWRlKG0pIGFvIFJlcG9zaXTDs3JpbyBQYW50aGVvbiBkYSBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkbyBSaW8gZGUgSmFuZWlybyAoVUZSSikgbyBkaXJlaXRvIG7Do28gLSBleGNsdXNpdm8gZGUgcmVwcm9kdXppciwgY29udmVydGVyIChjb21vIGRlZmluaWRvIGFiYWl4byksIGUvb3UgZGlzdHJpYnVpciBvIGRvY3VtZW50byBlbnRyZWd1ZSAoaW5jbHVpbmRvIG8gcmVzdW1vKSBlbSB0b2RvIG8gbXVuZG8sIGVtIGZvcm1hdG8gZWxldHLDtG5pY28gZSBlbSBxdWFscXVlciBtZWlvLCBpbmNsdWluZG8sIG1hcyBuw6NvIGxpbWl0YWRvIGEgw6F1ZGlvIGUvb3UgdsOtZGVvLgoKVm9jw6ogY29uY29yZGEgcXVlIGEgVUZSSiBwb2RlLCBzZW0gYWx0ZXJhciBvIGNvbnRlw7pkbywgdHJhZHV6aXIgYSBhcHJlc2VudGHDp8OjbyBkZSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gY29tIGEgZmluYWxpZGFkZSBkZSBwcmVzZXJ2YcOnw6NvLgoKVm9jw6ogdGFtYsOpbSBjb25jb3JkYSBxdWUgYSBVRlJKIHBvZGUgbWFudGVyIG1haXMgZGUgdW1hIGPDs3BpYSBkZXNzYSBzdWJtaXNzw6NvIHBhcmEgZmlucyBkZSBzZWd1cmFuw6dhLCBiYWNrLXVwIGUgcHJlc2VydmHDp8OjbyBkaWdpdGFsLgoKRGVjbGFyYSBxdWUgbyBkb2N1bWVudG8gZW50cmVndWUgw6kgc2V1IHRyYWJhbGhvIG9yaWdpbmFsLCBlIHF1ZSB2b2PDqiB0ZW0gbyBkaXJlaXRvIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vuw6dhLiBWb2PDqiB0YW1iw6ltIGRlY2xhcmEgcXVlIGEgc3VhIGFwcmVzZW50YcOnw6NvLCBjb20gbyBtZWxob3IgZGUgc2V1cyBjb25oZWNpbWVudG9zLCBuw6NvIGluZnJpbmdpIGRpcmVpdG9zIGF1dG9yYWlzIGRlIHRlcmNlaXJvcy4KClNlIG8gZG9jdW1lbnRvIGVudHJlZ3VlIGNvbnTDqW0gbWF0ZXJpYWwgZG8gcXVhbCB2b2PDqiBuw6NvIHRlbSBkaXJlaXRvcyBkZSBhdXRvciwgZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzc8OjbyBpcnJlc3RyaXRhIGRvIGRldGVudG9yIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBlIGNvbmNlZGUgYSBVRlJKIG9zIGRpcmVpdG9zIHJlcXVlcmlkb3MgcG9yIGVzdGEgbGljZW7Dp2EsIGUgcXVlIGVzc2UgbWF0ZXJpYWwgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3UgY29udGXDumRvIGRhIHN1Ym1pc3PDo28uCgpTZSBvIGRvY3VtZW50byBlbnRyZWd1ZSDDqSBiYXNlYWRvIGVtIHRyYWJhbGhvIHF1ZSBmb2ksIG91IHRlbSBzaWRvIHBhdHJvY2luYWRvIG91IGFwb2lhZG8gcG9yIHVtYSBhZ8OqbmNpYSBvdSBvdXRybyhzKSBvcmdhbmlzbW8ocykgcXVlIG7Do28gYSBVRlJKLCB2b2PDqiBkZWNsYXJhIHF1ZSBjdW1wcml1IHF1YWxxdWVyIGRpcmVpdG8gZGUgUkVWSVPDg08gb3UgZGUgb3V0cmFzIG9icmlnYcOnw7VlcyByZXF1ZXJpZGFzIHBvciBjb250cmF0byBvdSBhY29yZG8uCgpBIFVGUkogaXLDoSBpZGVudGlmaWNhciBjbGFyYW1lbnRlIG8ocykgc2V1KHMpIG5vbWUocykgY29tbyBhdXRvcihlcykgb3UgcHJvcHJpZXTDoXJpbyhzKSBkYSBzdWJtaXNzw6NvLCBlIG7Do28gZmFyw6EgcXVhbHF1ZXIgYWx0ZXJhw6fDo28sIHBhcmEgYWzDqW0gZGFzIHBlcm1pdGlkYXMgcG9yIGVzdGEgbGljZW7Dp2EsIG5vIGF0byBkZSBzdWJtaXNzw6NvLgo=Repositório de PublicaçõesPUBhttp://www.pantheon.ufrj.br/oai/requestopendoar:2023-11-30T03:02:12Repositório Institucional da UFRJ - Universidade Federal do Rio de Janeiro (UFRJ)false
dc.title.pt_BR.fl_str_mv Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models
title Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models
spellingShingle Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models
Thomé, Antonio Carlos Gay
CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO
Reconhecimento automático de voz
Redes neurais (Ciência da computação)
Modelos markovianos
title_short Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models
title_full Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models
title_fullStr Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models
title_full_unstemmed Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models
title_sort Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models
author Thomé, Antonio Carlos Gay
author_facet Thomé, Antonio Carlos Gay
Diniz, Suelaine dos Santos
Santos, Sidney Cerqueira Bispo dos
Silva, Dirceu Gonzaga da
author_role author
author2 Diniz, Suelaine dos Santos
Santos, Sidney Cerqueira Bispo dos
Silva, Dirceu Gonzaga da
author2_role author
author
author
dc.contributor.author.fl_str_mv Thomé, Antonio Carlos Gay
Diniz, Suelaine dos Santos
Santos, Sidney Cerqueira Bispo dos
Silva, Dirceu Gonzaga da
dc.subject.cnpq.fl_str_mv CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO
topic CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO
Reconhecimento automático de voz
Redes neurais (Ciência da computação)
Modelos markovianos
dc.subject.por.fl_str_mv Reconhecimento automático de voz
Redes neurais (Ciência da computação)
Modelos markovianos
description In this work we do a comparative evaluation between Artificial Neural Networks (RNA's) and Continuous Hidden Markov Models (CDHMM), in the framework of the recognition of isolated words, under the constrain of using a small number of features extracted from each voice signal. In order to accomplish such comparison we used two models of neural networks: the Multilayer Perceptron (MLP) and a variant of the Radial Basis (RBF), and some HMM models. We evaluated the performance of all models using two different test set and observed that the neural models presented the best results in both cases. Seeking to improve the HMM performance we developed a hybrid system, HMM/MLP, that improved the results previously obtained with all HMMs, and even those obtained with the neural networks for the all previous HMM, and even the neural nets for the hardest test set case.
publishDate 1999
dc.date.issued.fl_str_mv 1999-12-31
dc.date.accessioned.fl_str_mv 2017-08-03T14:21:53Z
dc.date.available.fl_str_mv 2023-11-30T03:02:12Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/report
format report
status_str publishedVersion
dc.identifier.citation.fl_str_mv THOMÉ, C. G. T. et al. Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models. Rio de Janeiro: NCE, UFRJ, 1999. 4 p. (Relatório Técnico, 14/99)
dc.identifier.uri.fl_str_mv http://hdl.handle.net/11422/2568
identifier_str_mv THOMÉ, C. G. T. et al. Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models. Rio de Janeiro: NCE, UFRJ, 1999. 4 p. (Relatório Técnico, 14/99)
url http://hdl.handle.net/11422/2568
dc.language.iso.fl_str_mv eng
language eng
dc.relation.ispartof.pt_BR.fl_str_mv Relatório Técnico NCE
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.country.fl_str_mv Brasil
dc.publisher.department.fl_str_mv Instituto Tércio Pacitti de Aplicações e Pesquisas Computacionais
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFRJ
instname:Universidade Federal do Rio de Janeiro (UFRJ)
instacron:UFRJ
instname_str Universidade Federal do Rio de Janeiro (UFRJ)
instacron_str UFRJ
institution UFRJ
reponame_str Repositório Institucional da UFRJ
collection Repositório Institucional da UFRJ
bitstream.url.fl_str_mv http://pantheon.ufrj.br:80/bitstream/11422/2568/1/14_99_000611273.pdf
http://pantheon.ufrj.br:80/bitstream/11422/2568/2/license.txt
http://pantheon.ufrj.br:80/bitstream/11422/2568/3/14_99_000611273.pdf.txt
bitstream.checksum.fl_str_mv c2476658a64fb6649d8b6674d3f91dbf
dd32849f2bfb22da963c3aac6e26e255
d41d8cd98f00b204e9800998ecf8427e
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFRJ - Universidade Federal do Rio de Janeiro (UFRJ)
repository.mail.fl_str_mv
_version_ 1784097090627960832