Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória

Detalhes bibliográficos
Autor(a) principal: Martins, Ana Luísa Dine
Data de Publicação: 2011
Tipo de documento: Tese
Idioma: por
Título da fonte: Repositório Institucional da UFSCAR
Texto Completo: https://repositorio.ufscar.br/handle/ufscar/257
Resumo: Articulatory Synthesis consists in reproducing speech by means of models of the vocal tract and of articulatory processes. Recent advances in Magnetic Resonance Imaging (MRI) allowed for important improvements with respect to the speech comprehension and the forms taken by the vocal tract. However, one of the main challenges in the field is the fast and at the same time high-quality acquisition of image sequences. Since adopting more powerful acquisition devices might be financially inviable, a more feasible solution proposed in the literature is the resolution enhancement of the images by changes introduced in the acquisition model. This dissertation proposes a method for the spatio-temporal resolution enhancement of the obtained sequences using only digital image processing techniques. The approach involves two stages: (1) the temporal resolution enhancement by means of a motion compensated interpolation technique; and (2) the spatial resolution enhancement by means of a super resolution image reconstruction technique. With respect to the temporal resolution enhancement, two interpolation models are compared: linear interpolation considering two adjacent images and cubic splines interpolation considering four contiguous images. Since both models performed equally in the experiments, the linear interpolation was adopted, for its simplicity and lower computational cost. The initial goal of the spatial resolution enhancement was an extension of the candidate s approach proposed in her master s thesis. Adopting a maximum a posteriori probability approach (MAP), the high-resolution images were modeled using the Markov Random Fields (MRF) Generalized Isotropic Multi-Level Logistic (GIMLL) model and the Iterated Conditional Modes (ICM) algorithm. However, even though the approach has presented promising results, due to the dimension of the target problem, the algorithm presented high computational cost. Considering this limitation, an adaptation of the Wiener filter for the super-resolution reconstruction problem was considered. Inspired by two methods available in the literature, three approaches were proposed: the statistical interpolation, the multi-temporal approach, and the adaptive Wiener filter. In all cases, a separable Markovian model and an isotropic model were compared in the characterization of the spatial correlation structures. These models were used to characterize the correlation and cross correlation of observations for the statistical interpolation and the multi-temporal approach. On the other hand, for the adaptive Wiener filter, these models were used to characterize the a priori spatial correlation. According to the conducted experiments, the isotropic model outperformed the separable Markovian model. Besides, considering all Wiener filter-based approaches and the initial approach based on the GIMLL model, the adaptive Wiener filter outperformed all other approaches and was also faster than a single iteration of the GIMLL-based approach.
id SCAR_ef2e8de4a5991d2f9baa5fce2652c554
oai_identifier_str oai:repositorio.ufscar.br:ufscar/257
network_acronym_str SCAR
network_name_str Repositório Institucional da UFSCAR
repository_id_str 4322
spelling Martins, Ana Luísa DineMascarenhas, Nelson Delfino d'Ávilahttp://lattes.cnpq.br/0557976975338451http://lattes.cnpq.br/9249940379843542a0e87a30-e22a-4048-9fc9-30b5964d16582016-06-02T19:02:41Z2012-01-112016-06-02T19:02:41Z2011-10-31MARTINS-LEMOS, Ana Luísa Dine. Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória. 2011. 122 f. Tese (Doutorado em Multidisciplinar) - Universidade Federal de São Carlos, São Carlos, 2011.https://repositorio.ufscar.br/handle/ufscar/257Articulatory Synthesis consists in reproducing speech by means of models of the vocal tract and of articulatory processes. Recent advances in Magnetic Resonance Imaging (MRI) allowed for important improvements with respect to the speech comprehension and the forms taken by the vocal tract. However, one of the main challenges in the field is the fast and at the same time high-quality acquisition of image sequences. Since adopting more powerful acquisition devices might be financially inviable, a more feasible solution proposed in the literature is the resolution enhancement of the images by changes introduced in the acquisition model. This dissertation proposes a method for the spatio-temporal resolution enhancement of the obtained sequences using only digital image processing techniques. The approach involves two stages: (1) the temporal resolution enhancement by means of a motion compensated interpolation technique; and (2) the spatial resolution enhancement by means of a super resolution image reconstruction technique. With respect to the temporal resolution enhancement, two interpolation models are compared: linear interpolation considering two adjacent images and cubic splines interpolation considering four contiguous images. Since both models performed equally in the experiments, the linear interpolation was adopted, for its simplicity and lower computational cost. The initial goal of the spatial resolution enhancement was an extension of the candidate s approach proposed in her master s thesis. Adopting a maximum a posteriori probability approach (MAP), the high-resolution images were modeled using the Markov Random Fields (MRF) Generalized Isotropic Multi-Level Logistic (GIMLL) model and the Iterated Conditional Modes (ICM) algorithm. However, even though the approach has presented promising results, due to the dimension of the target problem, the algorithm presented high computational cost. Considering this limitation, an adaptation of the Wiener filter for the super-resolution reconstruction problem was considered. Inspired by two methods available in the literature, three approaches were proposed: the statistical interpolation, the multi-temporal approach, and the adaptive Wiener filter. In all cases, a separable Markovian model and an isotropic model were compared in the characterization of the spatial correlation structures. These models were used to characterize the correlation and cross correlation of observations for the statistical interpolation and the multi-temporal approach. On the other hand, for the adaptive Wiener filter, these models were used to characterize the a priori spatial correlation. According to the conducted experiments, the isotropic model outperformed the separable Markovian model. Besides, considering all Wiener filter-based approaches and the initial approach based on the GIMLL model, the adaptive Wiener filter outperformed all other approaches and was also faster than a single iteration of the GIMLL-based approach.A síntese articulatória procura produzir a fala através de modelos do trato vocal e dos processos articulatórios envolvidos. Os avanços no imageamento por ressonância magnética, permitiram que resultados importantes fossem alcançados com relação à fala e à forma do trato vocal. Entretanto um dos principais desafios ainda é a aquisição rápida e de alta qualidade das sequências de imagens. Além da opção de se utilizar meios de aquisição cada vez mais potentes, o que pode ser financeiramente inviável, abordagens propostas na literatura procuram aumentar a resolução modificando o processo de aquisição. Este trabalho propõe o aumento de resolução espaço-temporal das sequências adquiridas utilizando apenas técnicas de processamento de imagens digitais. A abordagem proposta é formada por duas etapas: o aumento de resolução temporal por meio de uma técnica de interpolação por compensação de movimento; e o aumento de resolução espacial por meio de uma técnica de reconstrução de imagens por super resolução. Com relação ao aumento de resolução temporal, dois métodos de interpolação são comparados: interpolação linear considerando duas imagens adjacentes e interpolação por splines cúbicas considerando quatro imagens consecutivas. Como, de acordo com os experimentos desenvolvidos, não existe diferença significativa entre esses dois métodos, a interpolação linear foi adotada por ser um procedimento mais simples e, consequentemente, apresentar menor custo computacional. O objetivo inicial para o aumento de resolução espacial das imagens observadas foi a extensão da abordagem proposta pela aluna em seu projeto de mestrado. Adotando uma abordagem de máxima probabilidade a posteriori (MAP), as imagens de alta resolução foram modeladas utilizando o modelo de campos aleatórios de Markov (MRF) Generalized Isotropic Multi-Level Logistic (GIMLL) e o algoritmo Iterated Conditional Modes (ICM) foi utilizado para maximizar as probabilidades condicionais locais sequencialmente. Entretanto, apesar de ter apresentado resultados promissores, devido à dimensão do problema tratado, o algoritmo ICM apresentou alto custo computacional. Considerando as limitações de performance desse algoritmo, decidiu-se adaptar o filtro de Wiener para o problema da reconstrução por super resolução. Utilizando dois trabalhos encontrados na literatura como inspiração, foram desenvolvidas três abordagens denominadas interpolação estatística, abordagem multitemporal e filtro de Wiener adaptativo. Em todos os casos, um modelo Markoviano separável e um modelo isotrópico foram comparados na caracterização das estruturas de correlação espacial. No caso da interpolação estatística e da abordagem multitemporal esses modelos foram utilizados para caracterizar as estruturas de correlação das observações e cruzada. Por outro lado, no caso da abordagem denominada filtro de Wiener adaptativo, esses modelos foram utilizados para caracterizar as estruturas de correlação espaciais a priori. De acordo com os experimentos desenvolvidos, o modelo isotrópico apresentou desempenho superior quando comparado ao modelo Markoviano separável. Além disso, considerando todas as propostas baseadas no filtro de Wiener e a proposta inicial baseada no modelo de Markov GIMLL, o filtro de Wiener adaptativo apresentou os melhores resultados e se mostrou mais rápido do que apenas uma iteração da abordagem baseada no modelo GIMLL.Universidade Federal de Minas Geraisapplication/pdfporUniversidade Federal de São CarlosPrograma de Pós-Graduação em Biotecnologia - PPGBiotecUFSCarBRBiotecnologiaProcessamento de imagens - técnicas digitaisRestauração de imagensReconstrução por super resoluçãoCIENCIAS EXATAS E DA TERRAAumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatóriainfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesis-1-1787a11d3-939f-471e-8064-0e22da9d895finfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFSCARinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCARORIGINAL4013.pdfapplication/pdf3833898https://repositorio.ufscar.br/bitstream/ufscar/257/1/4013.pdf7a490bae6746f0e2b3c0c3472c9fd2b4MD51TEXT4013.pdf.txt4013.pdf.txtExtracted texttext/plain244288https://repositorio.ufscar.br/bitstream/ufscar/257/2/4013.pdf.txtbd987b748044f088564e0c508238e923MD52THUMBNAIL4013.pdf.jpg4013.pdf.jpgIM Thumbnailimage/jpeg6184https://repositorio.ufscar.br/bitstream/ufscar/257/3/4013.pdf.jpgafa63600d1d5d838fc173ca2e737f3b0MD53ufscar/2572023-09-18 18:30:37.232oai:repositorio.ufscar.br:ufscar/257Repositório InstitucionalPUBhttps://repositorio.ufscar.br/oai/requestopendoar:43222023-09-18T18:30:37Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)false
dc.title.por.fl_str_mv Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
title Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
spellingShingle Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
Martins, Ana Luísa Dine
Biotecnologia
Processamento de imagens - técnicas digitais
Restauração de imagens
Reconstrução por super resolução
CIENCIAS EXATAS E DA TERRA
title_short Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
title_full Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
title_fullStr Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
title_full_unstemmed Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
title_sort Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória
author Martins, Ana Luísa Dine
author_facet Martins, Ana Luísa Dine
author_role author
dc.contributor.authorlattes.por.fl_str_mv http://lattes.cnpq.br/9249940379843542
dc.contributor.author.fl_str_mv Martins, Ana Luísa Dine
dc.contributor.advisor1.fl_str_mv Mascarenhas, Nelson Delfino d'Ávila
dc.contributor.advisor1Lattes.fl_str_mv http://lattes.cnpq.br/0557976975338451
dc.contributor.authorID.fl_str_mv a0e87a30-e22a-4048-9fc9-30b5964d1658
contributor_str_mv Mascarenhas, Nelson Delfino d'Ávila
dc.subject.por.fl_str_mv Biotecnologia
Processamento de imagens - técnicas digitais
Restauração de imagens
Reconstrução por super resolução
topic Biotecnologia
Processamento de imagens - técnicas digitais
Restauração de imagens
Reconstrução por super resolução
CIENCIAS EXATAS E DA TERRA
dc.subject.cnpq.fl_str_mv CIENCIAS EXATAS E DA TERRA
description Articulatory Synthesis consists in reproducing speech by means of models of the vocal tract and of articulatory processes. Recent advances in Magnetic Resonance Imaging (MRI) allowed for important improvements with respect to the speech comprehension and the forms taken by the vocal tract. However, one of the main challenges in the field is the fast and at the same time high-quality acquisition of image sequences. Since adopting more powerful acquisition devices might be financially inviable, a more feasible solution proposed in the literature is the resolution enhancement of the images by changes introduced in the acquisition model. This dissertation proposes a method for the spatio-temporal resolution enhancement of the obtained sequences using only digital image processing techniques. The approach involves two stages: (1) the temporal resolution enhancement by means of a motion compensated interpolation technique; and (2) the spatial resolution enhancement by means of a super resolution image reconstruction technique. With respect to the temporal resolution enhancement, two interpolation models are compared: linear interpolation considering two adjacent images and cubic splines interpolation considering four contiguous images. Since both models performed equally in the experiments, the linear interpolation was adopted, for its simplicity and lower computational cost. The initial goal of the spatial resolution enhancement was an extension of the candidate s approach proposed in her master s thesis. Adopting a maximum a posteriori probability approach (MAP), the high-resolution images were modeled using the Markov Random Fields (MRF) Generalized Isotropic Multi-Level Logistic (GIMLL) model and the Iterated Conditional Modes (ICM) algorithm. However, even though the approach has presented promising results, due to the dimension of the target problem, the algorithm presented high computational cost. Considering this limitation, an adaptation of the Wiener filter for the super-resolution reconstruction problem was considered. Inspired by two methods available in the literature, three approaches were proposed: the statistical interpolation, the multi-temporal approach, and the adaptive Wiener filter. In all cases, a separable Markovian model and an isotropic model were compared in the characterization of the spatial correlation structures. These models were used to characterize the correlation and cross correlation of observations for the statistical interpolation and the multi-temporal approach. On the other hand, for the adaptive Wiener filter, these models were used to characterize the a priori spatial correlation. According to the conducted experiments, the isotropic model outperformed the separable Markovian model. Besides, considering all Wiener filter-based approaches and the initial approach based on the GIMLL model, the adaptive Wiener filter outperformed all other approaches and was also faster than a single iteration of the GIMLL-based approach.
publishDate 2011
dc.date.issued.fl_str_mv 2011-10-31
dc.date.available.fl_str_mv 2012-01-11
2016-06-02T19:02:41Z
dc.date.accessioned.fl_str_mv 2016-06-02T19:02:41Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv MARTINS-LEMOS, Ana Luísa Dine. Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória. 2011. 122 f. Tese (Doutorado em Multidisciplinar) - Universidade Federal de São Carlos, São Carlos, 2011.
dc.identifier.uri.fl_str_mv https://repositorio.ufscar.br/handle/ufscar/257
identifier_str_mv MARTINS-LEMOS, Ana Luísa Dine. Aumento de resolução de imagens de ressonância magnética do trato vocal utilizadas em modelos de síntese articulatória. 2011. 122 f. Tese (Doutorado em Multidisciplinar) - Universidade Federal de São Carlos, São Carlos, 2011.
url https://repositorio.ufscar.br/handle/ufscar/257
dc.language.iso.fl_str_mv por
language por
dc.relation.confidence.fl_str_mv -1
-1
dc.relation.authority.fl_str_mv 787a11d3-939f-471e-8064-0e22da9d895f
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal de São Carlos
dc.publisher.program.fl_str_mv Programa de Pós-Graduação em Biotecnologia - PPGBiotec
dc.publisher.initials.fl_str_mv UFSCar
dc.publisher.country.fl_str_mv BR
publisher.none.fl_str_mv Universidade Federal de São Carlos
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFSCAR
instname:Universidade Federal de São Carlos (UFSCAR)
instacron:UFSCAR
instname_str Universidade Federal de São Carlos (UFSCAR)
instacron_str UFSCAR
institution UFSCAR
reponame_str Repositório Institucional da UFSCAR
collection Repositório Institucional da UFSCAR
bitstream.url.fl_str_mv https://repositorio.ufscar.br/bitstream/ufscar/257/1/4013.pdf
https://repositorio.ufscar.br/bitstream/ufscar/257/2/4013.pdf.txt
https://repositorio.ufscar.br/bitstream/ufscar/257/3/4013.pdf.jpg
bitstream.checksum.fl_str_mv 7a490bae6746f0e2b3c0c3472c9fd2b4
bd987b748044f088564e0c508238e923
afa63600d1d5d838fc173ca2e737f3b0
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)
repository.mail.fl_str_mv
_version_ 1802136243368099840