Estudo e aperfei?oamento de um vocoder de transformada senoidal
Autor(a) principal: | |
---|---|
Data de Publicação: | 2004 |
Tipo de documento: | Dissertação |
Idioma: | por |
Título da fonte: | Biblioteca Digital de Teses e Dissertações da INATEL |
Texto Completo: | http://tede.inatel.br:8080/tede/handle/tede/78 |
Resumo: | A basic model of sinusoidal speech representation is presented and recent coding techniques of the model parameters are studied. As a complement, new methods for improving speech quality at 2.4 and 4.8 kbps using sinusoidal representation are presented. One of the methods presented is refinement in pitch estimation, since the precision of this estimation is essential for a sinusoidal harmonic model of speech. Another proposal is a more efficient procedure for the estimation of spectrum envelope parameters, which explores the psychoacoustic characteristic known as Subjective Loudness of the human hearing system. This technique was able to reduce the amount of parameters necessary to represent the envelopes of the spectrum without interfering in the final speech quality. It is also proposed an alternative approach to the synthetic phase composition step, in such a way that a better processing cost is obtained without degradation of quality. Experimental results indicate that using all of these methods in combination with the basic sinusoidal coding system increases the efficiency of the Sinusoidal Transform Vocoder operating at 2.4 and 4.8 kbps, improving the final speech quality and its Computational cost. |
id |
INAT_006e231137978e46d74414f43e9319be |
---|---|
oai_identifier_str |
oai:localhost:tede/78 |
network_acronym_str |
INAT |
network_name_str |
Biblioteca Digital de Teses e Dissertações da INATEL |
repository_id_str |
|
spelling |
Silva, Francisco Jos? Fraga dahttp://lattes.cnpq.br/6574409043436708Silva, Francisco Jos? Fraga dahttp://lattes.cnpq.br/6574409043436708Ynoguti, Carlos Alberto156.167.778-70http://lattes.cnpq.br/5678667205895840Violaro, F?biohttp://lattes.cnpq.br/1810833808219352http://lattes.cnpq.br/0228608641509261Nascimento, F?bio Augusto Ribeiro do2017-01-10T12:53:01Z2004-07-30Nascimento, F?bio Augusto Ribeiro do. Estudo e aperfei?oamento de um vocoder de transformada senoidal. 2004. [73]. disserta??o( Mestrado em Engenharia de Telecomunica??es) - Instituto Nacional de Telecomunica??es, [Santa Rita do Sapuca?] .http://tede.inatel.br:8080/tede/handle/tede/78A basic model of sinusoidal speech representation is presented and recent coding techniques of the model parameters are studied. As a complement, new methods for improving speech quality at 2.4 and 4.8 kbps using sinusoidal representation are presented. One of the methods presented is refinement in pitch estimation, since the precision of this estimation is essential for a sinusoidal harmonic model of speech. Another proposal is a more efficient procedure for the estimation of spectrum envelope parameters, which explores the psychoacoustic characteristic known as Subjective Loudness of the human hearing system. This technique was able to reduce the amount of parameters necessary to represent the envelopes of the spectrum without interfering in the final speech quality. It is also proposed an alternative approach to the synthetic phase composition step, in such a way that a better processing cost is obtained without degradation of quality. Experimental results indicate that using all of these methods in combination with the basic sinusoidal coding system increases the efficiency of the Sinusoidal Transform Vocoder operating at 2.4 and 4.8 kbps, improving the final speech quality and its Computational cost.Um modelo b?sico de representa??o senoidal da fala ? apresentado e t?cnicas recentes de codifica??o dos par?metros do modelo s?o estudadas. Como complemento, apresentam-se novos m?todos para a melhoria da qualidade da fala a 2,4 e 4,8 kbps utilizando a representa??o senoidal. Um dos m?todos apresentados ? o refinamento na estimativa do pitch, uma vez que a precis?o dessa estimativa ? essencial para um modelo harm?nico senoidal da fala. Outra proposta ? um procedimento mais eficiente para a estimativa dos par?metros do envelope do espectro, o qual explora a caracter?stica psico-ac?stica conhecida como Subjective Loudness do sistema de audi??o humano. Esta t?cnica mostrou-se capaz de reduzir a quantidade de par?metros necess?rios para representar o envelopes do espectro sem interferir na qualidade final da fala. Prop?e-se tamb?m uma abordagem alternativa para a etapa de composi??o da fase sint?tica, de tal modo que um melhor custo de processamento ? obtido sem degrada??o de qualidade. Os resultados experimentais indicam que o uso de todos esses m?todos em combina??o com o sistema b?sico de codifica??o senoidal aumenta a efici?ncia do Vocoder de Transformada Senoidal operando a 2,4 e 4,8 kbps, melhorando-lhe a qualidade final da fala e o seu custo computacional.Submitted by Tede Dspace (tede@inatel.br) on 2017-01-10T12:53:01Z No. of bitstreams: 2 license_rdf: 0 bytes, checksum: d41d8cd98f00b204e9800998ecf8427e (MD5) Sinusoidal_Coding.pdf: 771816 bytes, checksum: 99edd3af83e9cc079077992a0ce483db (MD5)Made available in DSpace on 2017-01-10T12:53:01Z (GMT). No. of bitstreams: 2 license_rdf: 0 bytes, checksum: d41d8cd98f00b204e9800998ecf8427e (MD5) Sinusoidal_Coding.pdf: 771816 bytes, checksum: 99edd3af83e9cc079077992a0ce483db (MD5) Previous issue date: 2004-07-30application/pdfhttp://tede.inatel.br:8080/jspui/retrieve/713/Sinusoidal_Coding.pdf.jpgporInstituto Nacional de Telecomunica??esMestrado em Engenharia de Telecomunica??esINATELBrasilInstituto Nacional de Telecomunica??eshttp://creativecommons.org/licenses/by-nd/4.0/info:eu-repo/semantics/openAccessCodifica??o senoidal; excita??o senoidal; modelo senoidalEngenharia - Telecomunica??esEstudo e aperfei?oamento de um vocoder de transformada senoidalinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisreponame:Biblioteca Digital de Teses e Dissertações da INATELinstname:Instituto Nacional de Telecomunicações (INATEL)instacron:INATELLICENSElicense.txtlicense.txttext/plain; charset=utf-8112http://localhost:8080/tede/bitstream/tede/78/1/license.txtc6279291b293f0db82678eaa73a27769MD51CC-LICENSElicense_urllicense_urltext/plain; charset=utf-846http://localhost:8080/tede/bitstream/tede/78/2/license_url587cd8ffae15c8598ed3c46d248a3f38MD52license_textlicense_texttext/html; charset=utf-80http://localhost:8080/tede/bitstream/tede/78/3/license_textd41d8cd98f00b204e9800998ecf8427eMD53license_rdflicense_rdfapplication/rdf+xml; charset=utf-80http://localhost:8080/tede/bitstream/tede/78/4/license_rdfd41d8cd98f00b204e9800998ecf8427eMD54ORIGINALSinusoidal_Coding.pdfSinusoidal_Coding.pdfapplication/pdf771816http://localhost:8080/tede/bitstream/tede/78/5/Sinusoidal_Coding.pdf99edd3af83e9cc079077992a0ce483dbMD55THUMBNAILSinusoidal_Coding.pdf.jpgSinusoidal_Coding.pdf.jpgimage/jpeg3426http://localhost:8080/tede/bitstream/tede/78/6/Sinusoidal_Coding.pdf.jpg49d00ed465994f83146440e36f59798cMD56tede/782018-04-17 11:36:38.833oai:localhost:tede/78QXV0b3Jpem8gYSBwdWJsaWNhPz9vIGRhIG1pbmhhIERpc3NlcnRhPz9vIGRlIE1lc3RyYWRvLCBlbSBmb3JtYXRvIFBERiwgY29tIGJsb3F1ZWlvIGRlIGVkaT8/bywgY29sYWdlbSBlIGM/cGlhLg==Biblioteca Digital de Teses e Dissertaçõeshttp://tede.inatel.br:8080/jspui/PUBhttp://tede.inatel.br:8080/oai/requestbiblioteca@inatel.br || biblioteca.atendimento@inatel.bropendoar:2018-04-17T14:36:38Biblioteca Digital de Teses e Dissertações da INATEL - Instituto Nacional de Telecomunicações (INATEL)false |
dc.title.por.fl_str_mv |
Estudo e aperfei?oamento de um vocoder de transformada senoidal |
title |
Estudo e aperfei?oamento de um vocoder de transformada senoidal |
spellingShingle |
Estudo e aperfei?oamento de um vocoder de transformada senoidal Nascimento, F?bio Augusto Ribeiro do Codifica??o senoidal; excita??o senoidal; modelo senoidal Engenharia - Telecomunica??es |
title_short |
Estudo e aperfei?oamento de um vocoder de transformada senoidal |
title_full |
Estudo e aperfei?oamento de um vocoder de transformada senoidal |
title_fullStr |
Estudo e aperfei?oamento de um vocoder de transformada senoidal |
title_full_unstemmed |
Estudo e aperfei?oamento de um vocoder de transformada senoidal |
title_sort |
Estudo e aperfei?oamento de um vocoder de transformada senoidal |
author |
Nascimento, F?bio Augusto Ribeiro do |
author_facet |
Nascimento, F?bio Augusto Ribeiro do |
author_role |
author |
dc.contributor.advisor1.fl_str_mv |
Silva, Francisco Jos? Fraga da |
dc.contributor.advisor1Lattes.fl_str_mv |
http://lattes.cnpq.br/6574409043436708 |
dc.contributor.referee1.fl_str_mv |
Silva, Francisco Jos? Fraga da |
dc.contributor.referee1Lattes.fl_str_mv |
http://lattes.cnpq.br/6574409043436708 |
dc.contributor.referee2.fl_str_mv |
Ynoguti, Carlos Alberto |
dc.contributor.referee2ID.fl_str_mv |
156.167.778-70 |
dc.contributor.referee2Lattes.fl_str_mv |
http://lattes.cnpq.br/5678667205895840 |
dc.contributor.referee3.fl_str_mv |
Violaro, F?bio |
dc.contributor.referee3Lattes.fl_str_mv |
http://lattes.cnpq.br/1810833808219352 |
dc.contributor.authorLattes.fl_str_mv |
http://lattes.cnpq.br/0228608641509261 |
dc.contributor.author.fl_str_mv |
Nascimento, F?bio Augusto Ribeiro do |
contributor_str_mv |
Silva, Francisco Jos? Fraga da Silva, Francisco Jos? Fraga da Ynoguti, Carlos Alberto Violaro, F?bio |
dc.subject.por.fl_str_mv |
Codifica??o senoidal; excita??o senoidal; modelo senoidal |
topic |
Codifica??o senoidal; excita??o senoidal; modelo senoidal Engenharia - Telecomunica??es |
dc.subject.cnpq.fl_str_mv |
Engenharia - Telecomunica??es |
description |
A basic model of sinusoidal speech representation is presented and recent coding techniques of the model parameters are studied. As a complement, new methods for improving speech quality at 2.4 and 4.8 kbps using sinusoidal representation are presented. One of the methods presented is refinement in pitch estimation, since the precision of this estimation is essential for a sinusoidal harmonic model of speech. Another proposal is a more efficient procedure for the estimation of spectrum envelope parameters, which explores the psychoacoustic characteristic known as Subjective Loudness of the human hearing system. This technique was able to reduce the amount of parameters necessary to represent the envelopes of the spectrum without interfering in the final speech quality. It is also proposed an alternative approach to the synthetic phase composition step, in such a way that a better processing cost is obtained without degradation of quality. Experimental results indicate that using all of these methods in combination with the basic sinusoidal coding system increases the efficiency of the Sinusoidal Transform Vocoder operating at 2.4 and 4.8 kbps, improving the final speech quality and its Computational cost. |
publishDate |
2004 |
dc.date.issued.fl_str_mv |
2004-07-30 |
dc.date.accessioned.fl_str_mv |
2017-01-10T12:53:01Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.citation.fl_str_mv |
Nascimento, F?bio Augusto Ribeiro do. Estudo e aperfei?oamento de um vocoder de transformada senoidal. 2004. [73]. disserta??o( Mestrado em Engenharia de Telecomunica??es) - Instituto Nacional de Telecomunica??es, [Santa Rita do Sapuca?] . |
dc.identifier.uri.fl_str_mv |
http://tede.inatel.br:8080/tede/handle/tede/78 |
identifier_str_mv |
Nascimento, F?bio Augusto Ribeiro do. Estudo e aperfei?oamento de um vocoder de transformada senoidal. 2004. [73]. disserta??o( Mestrado em Engenharia de Telecomunica??es) - Instituto Nacional de Telecomunica??es, [Santa Rita do Sapuca?] . |
url |
http://tede.inatel.br:8080/tede/handle/tede/78 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.rights.driver.fl_str_mv |
http://creativecommons.org/licenses/by-nd/4.0/ info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
http://creativecommons.org/licenses/by-nd/4.0/ |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Instituto Nacional de Telecomunica??es |
dc.publisher.program.fl_str_mv |
Mestrado em Engenharia de Telecomunica??es |
dc.publisher.initials.fl_str_mv |
INATEL |
dc.publisher.country.fl_str_mv |
Brasil |
dc.publisher.department.fl_str_mv |
Instituto Nacional de Telecomunica??es |
publisher.none.fl_str_mv |
Instituto Nacional de Telecomunica??es |
dc.source.none.fl_str_mv |
reponame:Biblioteca Digital de Teses e Dissertações da INATEL instname:Instituto Nacional de Telecomunicações (INATEL) instacron:INATEL |
instname_str |
Instituto Nacional de Telecomunicações (INATEL) |
instacron_str |
INATEL |
institution |
INATEL |
reponame_str |
Biblioteca Digital de Teses e Dissertações da INATEL |
collection |
Biblioteca Digital de Teses e Dissertações da INATEL |
bitstream.url.fl_str_mv |
http://localhost:8080/tede/bitstream/tede/78/1/license.txt http://localhost:8080/tede/bitstream/tede/78/2/license_url http://localhost:8080/tede/bitstream/tede/78/3/license_text http://localhost:8080/tede/bitstream/tede/78/4/license_rdf http://localhost:8080/tede/bitstream/tede/78/5/Sinusoidal_Coding.pdf http://localhost:8080/tede/bitstream/tede/78/6/Sinusoidal_Coding.pdf.jpg |
bitstream.checksum.fl_str_mv |
c6279291b293f0db82678eaa73a27769 587cd8ffae15c8598ed3c46d248a3f38 d41d8cd98f00b204e9800998ecf8427e d41d8cd98f00b204e9800998ecf8427e 99edd3af83e9cc079077992a0ce483db 49d00ed465994f83146440e36f59798c |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 MD5 MD5 |
repository.name.fl_str_mv |
Biblioteca Digital de Teses e Dissertações da INATEL - Instituto Nacional de Telecomunicações (INATEL) |
repository.mail.fl_str_mv |
biblioteca@inatel.br || biblioteca.atendimento@inatel.br |
_version_ |
1800214190501134336 |