Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models
Autor(a) principal: | |
---|---|
Data de Publicação: | 1999 |
Outros Autores: | , , |
Tipo de documento: | Relatório |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UFRJ |
Texto Completo: | http://hdl.handle.net/11422/2568 |
Resumo: | In this work we do a comparative evaluation between Artificial Neural Networks (RNA's) and Continuous Hidden Markov Models (CDHMM), in the framework of the recognition of isolated words, under the constrain of using a small number of features extracted from each voice signal. In order to accomplish such comparison we used two models of neural networks: the Multilayer Perceptron (MLP) and a variant of the Radial Basis (RBF), and some HMM models. We evaluated the performance of all models using two different test set and observed that the neural models presented the best results in both cases. Seeking to improve the HMM performance we developed a hybrid system, HMM/MLP, that improved the results previously obtained with all HMMs, and even those obtained with the neural networks for the all previous HMM, and even the neural nets for the hardest test set case. |
id |
UFRJ_849e51e68f4f5efe83da8dcb46a0233b |
---|---|
oai_identifier_str |
oai:pantheon.ufrj.br:11422/2568 |
network_acronym_str |
UFRJ |
network_name_str |
Repositório Institucional da UFRJ |
repository_id_str |
|
spelling |
Thomé, Antonio Carlos GayDiniz, Suelaine dos SantosSantos, Sidney Cerqueira Bispo dosSilva, Dirceu Gonzaga da2017-08-03T14:21:53Z2023-11-30T03:02:12Z1999-12-31THOMÉ, C. G. T. et al. Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models. Rio de Janeiro: NCE, UFRJ, 1999. 4 p. (Relatório Técnico, 14/99)http://hdl.handle.net/11422/2568In this work we do a comparative evaluation between Artificial Neural Networks (RNA's) and Continuous Hidden Markov Models (CDHMM), in the framework of the recognition of isolated words, under the constrain of using a small number of features extracted from each voice signal. In order to accomplish such comparison we used two models of neural networks: the Multilayer Perceptron (MLP) and a variant of the Radial Basis (RBF), and some HMM models. We evaluated the performance of all models using two different test set and observed that the neural models presented the best results in both cases. Seeking to improve the HMM performance we developed a hybrid system, HMM/MLP, that improved the results previously obtained with all HMMs, and even those obtained with the neural networks for the all previous HMM, and even the neural nets for the hardest test set case.Submitted by Elaine Almeida (elaine.almeida@nce.ufrj.br) on 2017-08-03T14:21:52Z No. of bitstreams: 1 14_99_000611273.pdf: 743652 bytes, checksum: c2476658a64fb6649d8b6674d3f91dbf (MD5)Made available in DSpace on 2017-08-03T14:21:53Z (GMT). No. of bitstreams: 1 14_99_000611273.pdf: 743652 bytes, checksum: c2476658a64fb6649d8b6674d3f91dbf (MD5) Previous issue date: 1999-12-31engRelatório Técnico NCECNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAOReconhecimento automático de vozRedes neurais (Ciência da computação)Modelos markovianosAutomatic speech recognition: a comparative evaluation between neural networks and hidden markov modelsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/report1499abertoBrasilInstituto Tércio Pacitti de Aplicações e Pesquisas Computacionaisinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFRJinstname:Universidade Federal do Rio de Janeiro (UFRJ)instacron:UFRJORIGINAL14_99_000611273.pdf14_99_000611273.pdfapplication/pdf743652http://pantheon.ufrj.br:80/bitstream/11422/2568/1/14_99_000611273.pdfc2476658a64fb6649d8b6674d3f91dbfMD51LICENSElicense.txtlicense.txttext/plain; charset=utf-81853http://pantheon.ufrj.br:80/bitstream/11422/2568/2/license.txtdd32849f2bfb22da963c3aac6e26e255MD52TEXT14_99_000611273.pdf.txt14_99_000611273.pdf.txtExtracted texttext/plain0http://pantheon.ufrj.br:80/bitstream/11422/2568/3/14_99_000611273.pdf.txtd41d8cd98f00b204e9800998ecf8427eMD5311422/25682023-11-30 00:02:12.961oai:pantheon.ufrj.br:11422/2568TElDRU7Dh0EgTsODTy1FWENMVVNJVkEgREUgRElTVFJJQlVJw4fDg08KCkFvIGFzc2luYXIgZSBlbnRyZWdhciBlc3RhIGxpY2Vuw6dhLCB2b2PDqihzKSBvKHMpIGF1dG9yKGVzKSBvdSBwcm9wcmlldMOhcmlvKHMpIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBjb25jZWRlKG0pIGFvIFJlcG9zaXTDs3JpbyBQYW50aGVvbiBkYSBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkbyBSaW8gZGUgSmFuZWlybyAoVUZSSikgbyBkaXJlaXRvIG7Do28gLSBleGNsdXNpdm8gZGUgcmVwcm9kdXppciwgY29udmVydGVyIChjb21vIGRlZmluaWRvIGFiYWl4byksIGUvb3UgZGlzdHJpYnVpciBvIGRvY3VtZW50byBlbnRyZWd1ZSAoaW5jbHVpbmRvIG8gcmVzdW1vKSBlbSB0b2RvIG8gbXVuZG8sIGVtIGZvcm1hdG8gZWxldHLDtG5pY28gZSBlbSBxdWFscXVlciBtZWlvLCBpbmNsdWluZG8sIG1hcyBuw6NvIGxpbWl0YWRvIGEgw6F1ZGlvIGUvb3UgdsOtZGVvLgoKVm9jw6ogY29uY29yZGEgcXVlIGEgVUZSSiBwb2RlLCBzZW0gYWx0ZXJhciBvIGNvbnRlw7pkbywgdHJhZHV6aXIgYSBhcHJlc2VudGHDp8OjbyBkZSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gY29tIGEgZmluYWxpZGFkZSBkZSBwcmVzZXJ2YcOnw6NvLgoKVm9jw6ogdGFtYsOpbSBjb25jb3JkYSBxdWUgYSBVRlJKIHBvZGUgbWFudGVyIG1haXMgZGUgdW1hIGPDs3BpYSBkZXNzYSBzdWJtaXNzw6NvIHBhcmEgZmlucyBkZSBzZWd1cmFuw6dhLCBiYWNrLXVwIGUgcHJlc2VydmHDp8OjbyBkaWdpdGFsLgoKRGVjbGFyYSBxdWUgbyBkb2N1bWVudG8gZW50cmVndWUgw6kgc2V1IHRyYWJhbGhvIG9yaWdpbmFsLCBlIHF1ZSB2b2PDqiB0ZW0gbyBkaXJlaXRvIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vuw6dhLiBWb2PDqiB0YW1iw6ltIGRlY2xhcmEgcXVlIGEgc3VhIGFwcmVzZW50YcOnw6NvLCBjb20gbyBtZWxob3IgZGUgc2V1cyBjb25oZWNpbWVudG9zLCBuw6NvIGluZnJpbmdpIGRpcmVpdG9zIGF1dG9yYWlzIGRlIHRlcmNlaXJvcy4KClNlIG8gZG9jdW1lbnRvIGVudHJlZ3VlIGNvbnTDqW0gbWF0ZXJpYWwgZG8gcXVhbCB2b2PDqiBuw6NvIHRlbSBkaXJlaXRvcyBkZSBhdXRvciwgZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzc8OjbyBpcnJlc3RyaXRhIGRvIGRldGVudG9yIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBlIGNvbmNlZGUgYSBVRlJKIG9zIGRpcmVpdG9zIHJlcXVlcmlkb3MgcG9yIGVzdGEgbGljZW7Dp2EsIGUgcXVlIGVzc2UgbWF0ZXJpYWwgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3UgY29udGXDumRvIGRhIHN1Ym1pc3PDo28uCgpTZSBvIGRvY3VtZW50byBlbnRyZWd1ZSDDqSBiYXNlYWRvIGVtIHRyYWJhbGhvIHF1ZSBmb2ksIG91IHRlbSBzaWRvIHBhdHJvY2luYWRvIG91IGFwb2lhZG8gcG9yIHVtYSBhZ8OqbmNpYSBvdSBvdXRybyhzKSBvcmdhbmlzbW8ocykgcXVlIG7Do28gYSBVRlJKLCB2b2PDqiBkZWNsYXJhIHF1ZSBjdW1wcml1IHF1YWxxdWVyIGRpcmVpdG8gZGUgUkVWSVPDg08gb3UgZGUgb3V0cmFzIG9icmlnYcOnw7VlcyByZXF1ZXJpZGFzIHBvciBjb250cmF0byBvdSBhY29yZG8uCgpBIFVGUkogaXLDoSBpZGVudGlmaWNhciBjbGFyYW1lbnRlIG8ocykgc2V1KHMpIG5vbWUocykgY29tbyBhdXRvcihlcykgb3UgcHJvcHJpZXTDoXJpbyhzKSBkYSBzdWJtaXNzw6NvLCBlIG7Do28gZmFyw6EgcXVhbHF1ZXIgYWx0ZXJhw6fDo28sIHBhcmEgYWzDqW0gZGFzIHBlcm1pdGlkYXMgcG9yIGVzdGEgbGljZW7Dp2EsIG5vIGF0byBkZSBzdWJtaXNzw6NvLgo=Repositório de PublicaçõesPUBhttp://www.pantheon.ufrj.br/oai/requestopendoar:2023-11-30T03:02:12Repositório Institucional da UFRJ - Universidade Federal do Rio de Janeiro (UFRJ)false |
dc.title.pt_BR.fl_str_mv |
Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models |
title |
Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models |
spellingShingle |
Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models Thomé, Antonio Carlos Gay CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO Reconhecimento automático de voz Redes neurais (Ciência da computação) Modelos markovianos |
title_short |
Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models |
title_full |
Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models |
title_fullStr |
Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models |
title_full_unstemmed |
Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models |
title_sort |
Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models |
author |
Thomé, Antonio Carlos Gay |
author_facet |
Thomé, Antonio Carlos Gay Diniz, Suelaine dos Santos Santos, Sidney Cerqueira Bispo dos Silva, Dirceu Gonzaga da |
author_role |
author |
author2 |
Diniz, Suelaine dos Santos Santos, Sidney Cerqueira Bispo dos Silva, Dirceu Gonzaga da |
author2_role |
author author author |
dc.contributor.author.fl_str_mv |
Thomé, Antonio Carlos Gay Diniz, Suelaine dos Santos Santos, Sidney Cerqueira Bispo dos Silva, Dirceu Gonzaga da |
dc.subject.cnpq.fl_str_mv |
CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO |
topic |
CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO Reconhecimento automático de voz Redes neurais (Ciência da computação) Modelos markovianos |
dc.subject.por.fl_str_mv |
Reconhecimento automático de voz Redes neurais (Ciência da computação) Modelos markovianos |
description |
In this work we do a comparative evaluation between Artificial Neural Networks (RNA's) and Continuous Hidden Markov Models (CDHMM), in the framework of the recognition of isolated words, under the constrain of using a small number of features extracted from each voice signal. In order to accomplish such comparison we used two models of neural networks: the Multilayer Perceptron (MLP) and a variant of the Radial Basis (RBF), and some HMM models. We evaluated the performance of all models using two different test set and observed that the neural models presented the best results in both cases. Seeking to improve the HMM performance we developed a hybrid system, HMM/MLP, that improved the results previously obtained with all HMMs, and even those obtained with the neural networks for the all previous HMM, and even the neural nets for the hardest test set case. |
publishDate |
1999 |
dc.date.issued.fl_str_mv |
1999-12-31 |
dc.date.accessioned.fl_str_mv |
2017-08-03T14:21:53Z |
dc.date.available.fl_str_mv |
2023-11-30T03:02:12Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/report |
format |
report |
status_str |
publishedVersion |
dc.identifier.citation.fl_str_mv |
THOMÉ, C. G. T. et al. Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models. Rio de Janeiro: NCE, UFRJ, 1999. 4 p. (Relatório Técnico, 14/99) |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/11422/2568 |
identifier_str_mv |
THOMÉ, C. G. T. et al. Automatic speech recognition: a comparative evaluation between neural networks and hidden markov models. Rio de Janeiro: NCE, UFRJ, 1999. 4 p. (Relatório Técnico, 14/99) |
url |
http://hdl.handle.net/11422/2568 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.ispartof.pt_BR.fl_str_mv |
Relatório Técnico NCE |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.publisher.country.fl_str_mv |
Brasil |
dc.publisher.department.fl_str_mv |
Instituto Tércio Pacitti de Aplicações e Pesquisas Computacionais |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFRJ instname:Universidade Federal do Rio de Janeiro (UFRJ) instacron:UFRJ |
instname_str |
Universidade Federal do Rio de Janeiro (UFRJ) |
instacron_str |
UFRJ |
institution |
UFRJ |
reponame_str |
Repositório Institucional da UFRJ |
collection |
Repositório Institucional da UFRJ |
bitstream.url.fl_str_mv |
http://pantheon.ufrj.br:80/bitstream/11422/2568/1/14_99_000611273.pdf http://pantheon.ufrj.br:80/bitstream/11422/2568/2/license.txt http://pantheon.ufrj.br:80/bitstream/11422/2568/3/14_99_000611273.pdf.txt |
bitstream.checksum.fl_str_mv |
c2476658a64fb6649d8b6674d3f91dbf dd32849f2bfb22da963c3aac6e26e255 d41d8cd98f00b204e9800998ecf8427e |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 |
repository.name.fl_str_mv |
Repositório Institucional da UFRJ - Universidade Federal do Rio de Janeiro (UFRJ) |
repository.mail.fl_str_mv |
|
_version_ |
1784097090627960832 |