Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória

Martins, Ana Luísa Dine

Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória

Detalhes bibliográficos
Autor(a) principal:	Martins, Ana Luísa Dine
Data de Publicação:	2011
Tipo de documento:	Tese
Idioma:	por
Título da fonte:	Repositório Institucional da UFSCAR
Texto Completo:	https://repositorio.ufscar.br/handle/ufscar/257
Resumo:	Articulatory Synthesis consists in reproducing speech by means of models of the vocal tract and of articulatory processes. Recent advances in Magnetic Resonance Imaging (MRI) allowed for important improvements with respect to the speech comprehension and the forms taken by the vocal tract. However, one of the main challenges in the field is the fast and at the same time high-quality acquisition of image sequences. Since adopting more powerful acquisition devices might be financially inviable, a more feasible solution proposed in the literature is the resolution enhancement of the images by changes introduced in the acquisition model. This dissertation proposes a method for the spatio-temporal resolution enhancement of the obtained sequences using only digital image processing techniques. The approach involves two stages: (1) the temporal resolution enhancement by means of a motion compensated interpolation technique; and (2) the spatial resolution enhancement by means of a super resolution image reconstruction technique. With respect to the temporal resolution enhancement, two interpolation models are compared: linear interpolation considering two adjacent images and cubic splines interpolation considering four contiguous images. Since both models performed equally in the experiments, the linear interpolation was adopted, for its simplicity and lower computational cost. The initial goal of the spatial resolution enhancement was an extension of the candidate s approach proposed in her master s thesis. Adopting a maximum a posteriori probability approach (MAP), the high-resolution images were modeled using the Markov Random Fields (MRF) Generalized Isotropic Multi-Level Logistic (GIMLL) model and the Iterated Conditional Modes (ICM) algorithm. However, even though the approach has presented promising results, due to the dimension of the target problem, the algorithm presented high computational cost. Considering this limitation, an adaptation of the Wiener filter for the super-resolution reconstruction problem was considered. Inspired by two methods available in the literature, three approaches were proposed: the statistical interpolation, the multi-temporal approach, and the adaptive Wiener filter. In all cases, a separable Markovian model and an isotropic model were compared in the characterization of the spatial correlation structures. These models were used to characterize the correlation and cross correlation of observations for the statistical interpolation and the multi-temporal approach. On the other hand, for the adaptive Wiener filter, these models were used to characterize the a priori spatial correlation. According to the conducted experiments, the isotropic model outperformed the separable Markovian model. Besides, considering all Wiener filter-based approaches and the initial approach based on the GIMLL model, the adaptive Wiener filter outperformed all other approaches and was also faster than a single iteration of the GIMLL-based approach.

Metadados do item

id	SCAR_ef2e8de4a5991d2f9baa5fce2652c554
oai_identifier_str	oai:repositorio.ufscar.br:ufscar/257
network_acronym_str	SCAR
network_name_str	Repositório Institucional da UFSCAR
repository_id_str	4322
spelling	Martins, Ana Luísa DineMascarenhas, Nelson Delfino d'Ávilahttp://lattes.cnpq.br/0557976975338451http://lattes.cnpq.br/9249940379843542a0e87a30-e22a-4048-9fc9-30b5964d16582016-06-02T19:02:41Z2012-01-112016-06-02T19:02:41Z2011-10-31MARTINS-LEMOS, Ana Luísa Dine. Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória. 2011. 122 f. Tese (Doutorado em Multidisciplinar) - Universidade Federal de São Carlos, São Carlos, 2011.https://repositorio.ufscar.br/handle/ufscar/257Articulatory Synthesis consists in reproducing speech by means of models of the vocal tract and of articulatory processes. Recent advances in Magnetic Resonance Imaging (MRI) allowed for important improvements with respect to the speech comprehension and the forms taken by the vocal tract. However, one of the main challenges in the field is the fast and at the same time high-quality acquisition of image sequences. Since adopting more powerful acquisition devices might be financially inviable, a more feasible solution proposed in the literature is the resolution enhancement of the images by changes introduced in the acquisition model. This dissertation proposes a method for the spatio-temporal resolution enhancement of the obtained sequences using only digital image processing techniques. The approach involves two stages: (1) the temporal resolution enhancement by means of a motion compensated interpolation technique; and (2) the spatial resolution enhancement by means of a super resolution image reconstruction technique. With respect to the temporal resolution enhancement, two interpolation models are compared: linear interpolation considering two adjacent images and cubic splines interpolation considering four contiguous images. Since both models performed equally in the experiments, the linear interpolation was adopted, for its simplicity and lower computational cost. The initial goal of the spatial resolution enhancement was an extension of the candidate s approach proposed in her master s thesis. Adopting a maximum a posteriori probability approach (MAP), the high-resolution images were modeled using the Markov Random Fields (MRF) Generalized Isotropic Multi-Level Logistic (GIMLL) model and the Iterated Conditional Modes (ICM) algorithm. However, even though the approach has presented promising results, due to the dimension of the target problem, the algorithm presented high computational cost. Considering this limitation, an adaptation of the Wiener filter for the super-resolution reconstruction problem was considered. Inspired by two methods available in the literature, three approaches were proposed: the statistical interpolation, the multi-temporal approach, and the adaptive Wiener filter. In all cases, a separable Markovian model and an isotropic model were compared in the characterization of the spatial correlation structures. These models were used to characterize the correlation and cross correlation of observations for the statistical interpolation and the multi-temporal approach. On the other hand, for the adaptive Wiener filter, these models were used to characterize the a priori spatial correlation. According to the conducted experiments, the isotropic model outperformed the separable Markovian model. Besides, considering all Wiener filter-based approaches and the initial approach based on the GIMLL model, the adaptive Wiener filter outperformed all other approaches and was also faster than a single iteration of the GIMLL-based approach.A síntese articulatória procura produzir a fala através de modelos do trato vocal e dos processos articulatórios envolvidos. Os avanços no imageamento por ressonância magnética, permitiram que resultados importantes fossem alcançados com relação à fala e à forma do trato vocal. Entretanto um dos principais desafios ainda é a aquisição rápida e de alta qualidade das sequências de imagens. Além da opção de se utilizar meios de aquisição cada vez mais potentes, o que pode ser financeiramente inviável, abordagens propostas na literatura procuram aumentar a resolução modificando o processo de aquisição. Este trabalho propõe o aumento de resolução espaço-temporal das sequências adquiridas utilizando apenas técnicas de processamento de imagens digitais. A abordagem proposta é formada por duas etapas: o aumento de resolução temporal por meio de uma técnica de interpolação por compensação de movimento; e o aumento de resolução espacial por meio de uma técnica de reconstrução de imagens por super resolução. Com relação ao aumento de resolução temporal, dois métodos de interpolação são comparados: interpolação linear considerando duas imagens adjacentes e interpolação por splines cúbicas considerando quatro imagens consecutivas. Como, de acordo com os experimentos desenvolvidos, não existe diferença significativa entre esses dois métodos, a interpolação linear foi adotada por ser um procedimento mais simples e, consequentemente, apresentar menor custo computacional. O objetivo inicial para o aumento de resolução espacial das imagens observadas foi a extensão da abordagem proposta pela aluna em seu projeto de mestrado. Adotando uma abordagem de máxima probabilidade a posteriori (MAP), as imagens de alta resolução foram modeladas utilizando o modelo de campos aleatórios de Markov (MRF) Generalized Isotropic Multi-Level Logistic (GIMLL) e o algoritmo Iterated Conditional Modes (ICM) foi utilizado para maximizar as probabilidades condicionais locais sequencialmente. Entretanto, apesar de ter apresentado resultados promissores, devido à dimensão do problema tratado, o algoritmo ICM apresentou alto custo computacional. Considerando as limitações de performance desse algoritmo, decidiu-se adaptar o filtro de Wiener para o problema da reconstrução por super resolução. Utilizando dois trabalhos encontrados na literatura como inspiração, foram desenvolvidas três abordagens denominadas interpolação estatística, abordagem multitemporal e filtro de Wiener adaptativo. Em todos os casos, um modelo Markoviano separável e um modelo isotrópico foram comparados na caracterização das estruturas de correlação espacial. No caso da interpolação estatística e da abordagem multitemporal esses modelos foram utilizados para caracterizar as estruturas de correlação das observações e cruzada. Por outro lado, no caso da abordagem denominada filtro de Wiener adaptativo, esses modelos foram utilizados para caracterizar as estruturas de correlação espaciais a priori. De acordo com os experimentos desenvolvidos, o modelo isotrópico apresentou desempenho superior quando comparado ao modelo Markoviano separável. Além disso, considerando todas as propostas baseadas no filtro de Wiener e a proposta inicial baseada no modelo de Markov GIMLL, o filtro de Wiener adaptativo apresentou os melhores resultados e se mostrou mais rápido do que apenas uma iteração da abordagem baseada no modelo GIMLL.Universidade Federal de Minas Geraisapplication/pdfporUniversidade Federal de São CarlosPrograma de Pós-Graduação em Biotecnologia - PPGBiotecUFSCarBRBiotecnologiaProcessamento de imagens - técnicas digitaisRestauração de imagensReconstrução por super resoluçãoCIENCIAS EXATAS E DA TERRAAumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatóriainfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesis-1-1787a11d3-939f-471e-8064-0e22da9d895finfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFSCARinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCARORIGINAL4013.pdfapplication/pdf3833898https://repositorio.ufscar.br/bitstream/ufscar/257/1/4013.pdf7a490bae6746f0e2b3c0c3472c9fd2b4MD51TEXT4013.pdf.txt4013.pdf.txtExtracted texttext/plain244288https://repositorio.ufscar.br/bitstream/ufscar/257/2/4013.pdf.txtbd987b748044f088564e0c508238e923MD52THUMBNAIL4013.pdf.jpg4013.pdf.jpgIM Thumbnailimage/jpeg6184https://repositorio.ufscar.br/bitstream/ufscar/257/3/4013.pdf.jpgafa63600d1d5d838fc173ca2e737f3b0MD53ufscar/2572023-09-18 18:30:37.232oai:repositorio.ufscar.br:ufscar/257Repositório InstitucionalPUBhttps://repositorio.ufscar.br/oai/requestopendoar:43222023-09-18T18:30:37Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)false
dc.title.por.fl_str_mv	Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
title	Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
spellingShingle	Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória Martins, Ana Luísa Dine Biotecnologia Processamento de imagens - técnicas digitais Restauração de imagens Reconstrução por super resolução CIENCIAS EXATAS E DA TERRA
title_short	Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
title_full	Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
title_fullStr	Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
title_full_unstemmed	Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
title_sort	Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
author	Martins, Ana Luísa Dine
author_facet	Martins, Ana Luísa Dine
author_role	author
dc.contributor.authorlattes.por.fl_str_mv	http://lattes.cnpq.br/9249940379843542
dc.contributor.author.fl_str_mv	Martins, Ana Luísa Dine
dc.contributor.advisor1.fl_str_mv	Mascarenhas, Nelson Delfino d'Ávila
dc.contributor.advisor1Lattes.fl_str_mv	http://lattes.cnpq.br/0557976975338451
dc.contributor.authorID.fl_str_mv	a0e87a30-e22a-4048-9fc9-30b5964d1658
contributor_str_mv	Mascarenhas, Nelson Delfino d'Ávila
dc.subject.por.fl_str_mv	Biotecnologia Processamento de imagens - técnicas digitais Restauração de imagens Reconstrução por super resolução
topic	Biotecnologia Processamento de imagens - técnicas digitais Restauração de imagens Reconstrução por super resolução CIENCIAS EXATAS E DA TERRA
dc.subject.cnpq.fl_str_mv	CIENCIAS EXATAS E DA TERRA
description	Articulatory Synthesis consists in reproducing speech by means of models of the vocal tract and of articulatory processes. Recent advances in Magnetic Resonance Imaging (MRI) allowed for important improvements with respect to the speech comprehension and the forms taken by the vocal tract. However, one of the main challenges in the field is the fast and at the same time high-quality acquisition of image sequences. Since adopting more powerful acquisition devices might be financially inviable, a more feasible solution proposed in the literature is the resolution enhancement of the images by changes introduced in the acquisition model. This dissertation proposes a method for the spatio-temporal resolution enhancement of the obtained sequences using only digital image processing techniques. The approach involves two stages: (1) the temporal resolution enhancement by means of a motion compensated interpolation technique; and (2) the spatial resolution enhancement by means of a super resolution image reconstruction technique. With respect to the temporal resolution enhancement, two interpolation models are compared: linear interpolation considering two adjacent images and cubic splines interpolation considering four contiguous images. Since both models performed equally in the experiments, the linear interpolation was adopted, for its simplicity and lower computational cost. The initial goal of the spatial resolution enhancement was an extension of the candidate s approach proposed in her master s thesis. Adopting a maximum a posteriori probability approach (MAP), the high-resolution images were modeled using the Markov Random Fields (MRF) Generalized Isotropic Multi-Level Logistic (GIMLL) model and the Iterated Conditional Modes (ICM) algorithm. However, even though the approach has presented promising results, due to the dimension of the target problem, the algorithm presented high computational cost. Considering this limitation, an adaptation of the Wiener filter for the super-resolution reconstruction problem was considered. Inspired by two methods available in the literature, three approaches were proposed: the statistical interpolation, the multi-temporal approach, and the adaptive Wiener filter. In all cases, a separable Markovian model and an isotropic model were compared in the characterization of the spatial correlation structures. These models were used to characterize the correlation and cross correlation of observations for the statistical interpolation and the multi-temporal approach. On the other hand, for the adaptive Wiener filter, these models were used to characterize the a priori spatial correlation. According to the conducted experiments, the isotropic model outperformed the separable Markovian model. Besides, considering all Wiener filter-based approaches and the initial approach based on the GIMLL model, the adaptive Wiener filter outperformed all other approaches and was also faster than a single iteration of the GIMLL-based approach.
publishDate	2011
dc.date.issued.fl_str_mv	2011-10-31
dc.date.available.fl_str_mv	2012-01-11 2016-06-02T19:02:41Z
dc.date.accessioned.fl_str_mv	2016-06-02T19:02:41Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/doctoralThesis
format	doctoralThesis
status_str	publishedVersion
dc.identifier.citation.fl_str_mv	MARTINS-LEMOS, Ana Luísa Dine. Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória. 2011. 122 f. Tese (Doutorado em Multidisciplinar) - Universidade Federal de São Carlos, São Carlos, 2011.
dc.identifier.uri.fl_str_mv	https://repositorio.ufscar.br/handle/ufscar/257
identifier_str_mv	MARTINS-LEMOS, Ana Luísa Dine. Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória. 2011. 122 f. Tese (Doutorado em Multidisciplinar) - Universidade Federal de São Carlos, São Carlos, 2011.
url	https://repositorio.ufscar.br/handle/ufscar/257
dc.language.iso.fl_str_mv	por
language	por
dc.relation.confidence.fl_str_mv	-1 -1
dc.relation.authority.fl_str_mv	787a11d3-939f-471e-8064-0e22da9d895f
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Federal de São Carlos
dc.publisher.program.fl_str_mv	Programa de Pós-Graduação em Biotecnologia - PPGBiotec
dc.publisher.initials.fl_str_mv	UFSCar
dc.publisher.country.fl_str_mv	BR
publisher.none.fl_str_mv	Universidade Federal de São Carlos
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFSCAR instname:Universidade Federal de São Carlos (UFSCAR) instacron:UFSCAR
instname_str	Universidade Federal de São Carlos (UFSCAR)
instacron_str	UFSCAR
institution	UFSCAR
reponame_str	Repositório Institucional da UFSCAR
collection	Repositório Institucional da UFSCAR
bitstream.url.fl_str_mv	https://repositorio.ufscar.br/bitstream/ufscar/257/1/4013.pdf https://repositorio.ufscar.br/bitstream/ufscar/257/2/4013.pdf.txt https://repositorio.ufscar.br/bitstream/ufscar/257/3/4013.pdf.jpg
bitstream.checksum.fl_str_mv	7a490bae6746f0e2b3c0c3472c9fd2b4 bd987b748044f088564e0c508238e923 afa63600d1d5d838fc173ca2e737f3b0
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5
repository.name.fl_str_mv	Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)
repository.mail.fl_str_mv
_version_	1802136243368099840

Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória

Registros relacionados