Deep Learning Techniques for Content-based Medical Image Retrieval

Detalhes bibliográficos
Autor(a) principal: Motta, Cezanne Alves Mendes
Data de Publicação: 2022
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Biblioteca Digital de Teses e Dissertações da USP
Texto Completo: https://www.teses.usp.br/teses/disponiveis/55/55134/tde-15092022-105141/
Resumo: In the context of Computer-Aided Diagnosis, it is often not enough for the system to produce correct predictions. When physicians are certain about the diagnosis of a given case, they can accept or disregard the systems prediction according to their own conclusions. However, in cases where they are uncertain, the physicians may not trust the system prediction without an explanation for it. In the medical domain, where the users are ethically and legally responsible for their decisions, the system should be able to articulate the reasons for its prediction in some way. One strategy that has been suggested to provide this support for decision is to retrieve images from similar cases that were already diagnosed. The physicians can then compare the retrieved cases to the one under consideration and decide if such diagnosis apply. Traditionally, Content-Based Medical Image Retrieval (CBMIR) has been done with hand-crafted features. Despite showing significant improvements in many other medical images analysis tasks, Deep Learning is not frequently used in CBMIR. Most current approaches to integrate Deep Learning into CBMIR use features obtained from models trained to classify images. These models tend to learn features that are correlated with the classes and ignore the ones that are not. Despite being useful to categorize images, the features learned from such models ignore intra-class variations that may be relevant to finding visually similar images. The ideal would be to retrieve the most visually similar case possible, not just one that belongs to the same class, so that the physicians can have more confidence in their decision. Autoencoders, on the other hand, are Deep Learning models that aim to learn features that describe the intrinsic factors of variations of a dataset. In this work, we investigate and discuss the use of Deep Learning for medical image retrieval, presenting the theoretical foundation and a critical analysis of current approaches found in literature. We also propose an approach for CBMIR based on Variational Autoencoders and show that this approach can yield better results than the ones based solely on classification and can even be used in combination to improve the results of the latter.
id USP_4d0f08aa640a42fab8fd12e29a989e30
oai_identifier_str oai:teses.usp.br:tde-15092022-105141
network_acronym_str USP
network_name_str Biblioteca Digital de Teses e Dissertações da USP
repository_id_str 2721
spelling Deep Learning Techniques for Content-based Medical Image RetrievalTécnicas de aprendizado profundo para busca de imagens médicas por conteúdoAprendizado profundoBusca por imagem baseada em conteúdoCBIRContent-based image retrievalDeep LearningImagens médicasMedical imagesVariational autoencoderIn the context of Computer-Aided Diagnosis, it is often not enough for the system to produce correct predictions. When physicians are certain about the diagnosis of a given case, they can accept or disregard the systems prediction according to their own conclusions. However, in cases where they are uncertain, the physicians may not trust the system prediction without an explanation for it. In the medical domain, where the users are ethically and legally responsible for their decisions, the system should be able to articulate the reasons for its prediction in some way. One strategy that has been suggested to provide this support for decision is to retrieve images from similar cases that were already diagnosed. The physicians can then compare the retrieved cases to the one under consideration and decide if such diagnosis apply. Traditionally, Content-Based Medical Image Retrieval (CBMIR) has been done with hand-crafted features. Despite showing significant improvements in many other medical images analysis tasks, Deep Learning is not frequently used in CBMIR. Most current approaches to integrate Deep Learning into CBMIR use features obtained from models trained to classify images. These models tend to learn features that are correlated with the classes and ignore the ones that are not. Despite being useful to categorize images, the features learned from such models ignore intra-class variations that may be relevant to finding visually similar images. The ideal would be to retrieve the most visually similar case possible, not just one that belongs to the same class, so that the physicians can have more confidence in their decision. Autoencoders, on the other hand, are Deep Learning models that aim to learn features that describe the intrinsic factors of variations of a dataset. In this work, we investigate and discuss the use of Deep Learning for medical image retrieval, presenting the theoretical foundation and a critical analysis of current approaches found in literature. We also propose an approach for CBMIR based on Variational Autoencoders and show that this approach can yield better results than the ones based solely on classification and can even be used in combination to improve the results of the latter.No contexto de Diagnóstico Assistido por Computador (CAD), às vezes não é suficiente que o sistema produza predições corretas. Quando os médicos estão seguros a respeito do diagnóstico de um caso em particular, eles podem aceitar ou desprezar a predição do sistema de acordo com suas próprias conclusões. Mas em casos em que os médicos estejam inseguros, eles podem não confiar na predição do sistema sem que haja alguma explicação para ela. No domínio médico, onde os usuários são ética e legalmente responsáveis por suas decisões, o sistema deve ser capaz de articular de alguma forma as razões de suas decisões. Uma estratégia que tem sido sugerida para prover esse suporte à decisão é a de recuperar imagens similares que já foram diagnosticadas. Dessa forma, os médicos podem, então, comparar os casos retornados ao caso em consideração e decidir se os diagnósticos desses se aplicam. Tradicionalmente, Recuperação de Imagens Médicas por Conteúdo (CBMIR) tem sido feita com representações projetadas manualmente. Apesar de demonstrar melhorias significativas em muitas outras tarefas de análise de imagens médicas, Deep Learning (DL) não é frequentemente utilizado em CBMIR. A maioria das abordagens atuais para integrar DL em CBMIR utiliza representações obtidas de modelos treinados para classificar imagens. Esses modelos tendem a aprender representações que captu- ram características que são correlacionadas com as classes e ignoram as características que não o são. Apesar de serem úteis parar classificar imagens, essas representações ignoram variações intra-classe que podem ser relevantes para encontrar imagens visualmente semelhantes. O ideal seria retornar casos com a maior similaridade visual possível, para que médicos possam ter mais confiança nas suas decisões, e não somente casos que pertençam à mesma classe. Autoencoders, por outro lado, são modelos de DL que visam aprender representações que descrevam fatores de variação intrínsecos do conjunto de dados. Este trabalho visa investigar e discutir o uso de DL para busca de imagens médicas, apresentando a fundamentação teórica e análise crítica das abordagens atuais encontradas na literatura. Também é apresentada uma abordagem para CBMIR baseada em Variational Autoencoders e mostrado que essa abordagem pode produzir resultados superiores àquelas baseadas puramente em classificação, e pode inclusive ser utilizada em conjunto com estas.Biblioteca Digitais de Teses e Dissertações da USPTraina, Agma Juci MachadoMotta, Cezanne Alves Mendes2022-07-08info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://www.teses.usp.br/teses/disponiveis/55/55134/tde-15092022-105141/reponame:Biblioteca Digital de Teses e Dissertações da USPinstname:Universidade de São Paulo (USP)instacron:USPLiberar o conteúdo para acesso público.info:eu-repo/semantics/openAccesseng2022-11-21T11:38:07Zoai:teses.usp.br:tde-15092022-105141Biblioteca Digital de Teses e Dissertaçõeshttp://www.teses.usp.br/PUBhttp://www.teses.usp.br/cgi-bin/mtd2br.plvirginia@if.usp.br|| atendimento@aguia.usp.br||virginia@if.usp.bropendoar:27212022-11-21T11:38:07Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)false
dc.title.none.fl_str_mv Deep Learning Techniques for Content-based Medical Image Retrieval
Técnicas de aprendizado profundo para busca de imagens médicas por conteúdo
title Deep Learning Techniques for Content-based Medical Image Retrieval
spellingShingle Deep Learning Techniques for Content-based Medical Image Retrieval
Motta, Cezanne Alves Mendes
Aprendizado profundo
Busca por imagem baseada em conteúdo
CBIR
Content-based image retrieval
Deep Learning
Imagens médicas
Medical images
Variational autoencoder
title_short Deep Learning Techniques for Content-based Medical Image Retrieval
title_full Deep Learning Techniques for Content-based Medical Image Retrieval
title_fullStr Deep Learning Techniques for Content-based Medical Image Retrieval
title_full_unstemmed Deep Learning Techniques for Content-based Medical Image Retrieval
title_sort Deep Learning Techniques for Content-based Medical Image Retrieval
author Motta, Cezanne Alves Mendes
author_facet Motta, Cezanne Alves Mendes
author_role author
dc.contributor.none.fl_str_mv Traina, Agma Juci Machado
dc.contributor.author.fl_str_mv Motta, Cezanne Alves Mendes
dc.subject.por.fl_str_mv Aprendizado profundo
Busca por imagem baseada em conteúdo
CBIR
Content-based image retrieval
Deep Learning
Imagens médicas
Medical images
Variational autoencoder
topic Aprendizado profundo
Busca por imagem baseada em conteúdo
CBIR
Content-based image retrieval
Deep Learning
Imagens médicas
Medical images
Variational autoencoder
description In the context of Computer-Aided Diagnosis, it is often not enough for the system to produce correct predictions. When physicians are certain about the diagnosis of a given case, they can accept or disregard the systems prediction according to their own conclusions. However, in cases where they are uncertain, the physicians may not trust the system prediction without an explanation for it. In the medical domain, where the users are ethically and legally responsible for their decisions, the system should be able to articulate the reasons for its prediction in some way. One strategy that has been suggested to provide this support for decision is to retrieve images from similar cases that were already diagnosed. The physicians can then compare the retrieved cases to the one under consideration and decide if such diagnosis apply. Traditionally, Content-Based Medical Image Retrieval (CBMIR) has been done with hand-crafted features. Despite showing significant improvements in many other medical images analysis tasks, Deep Learning is not frequently used in CBMIR. Most current approaches to integrate Deep Learning into CBMIR use features obtained from models trained to classify images. These models tend to learn features that are correlated with the classes and ignore the ones that are not. Despite being useful to categorize images, the features learned from such models ignore intra-class variations that may be relevant to finding visually similar images. The ideal would be to retrieve the most visually similar case possible, not just one that belongs to the same class, so that the physicians can have more confidence in their decision. Autoencoders, on the other hand, are Deep Learning models that aim to learn features that describe the intrinsic factors of variations of a dataset. In this work, we investigate and discuss the use of Deep Learning for medical image retrieval, presenting the theoretical foundation and a critical analysis of current approaches found in literature. We also propose an approach for CBMIR based on Variational Autoencoders and show that this approach can yield better results than the ones based solely on classification and can even be used in combination to improve the results of the latter.
publishDate 2022
dc.date.none.fl_str_mv 2022-07-08
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://www.teses.usp.br/teses/disponiveis/55/55134/tde-15092022-105141/
url https://www.teses.usp.br/teses/disponiveis/55/55134/tde-15092022-105141/
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv
dc.rights.driver.fl_str_mv Liberar o conteúdo para acesso público.
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Liberar o conteúdo para acesso público.
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.coverage.none.fl_str_mv
dc.publisher.none.fl_str_mv Biblioteca Digitais de Teses e Dissertações da USP
publisher.none.fl_str_mv Biblioteca Digitais de Teses e Dissertações da USP
dc.source.none.fl_str_mv
reponame:Biblioteca Digital de Teses e Dissertações da USP
instname:Universidade de São Paulo (USP)
instacron:USP
instname_str Universidade de São Paulo (USP)
instacron_str USP
institution USP
reponame_str Biblioteca Digital de Teses e Dissertações da USP
collection Biblioteca Digital de Teses e Dissertações da USP
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)
repository.mail.fl_str_mv virginia@if.usp.br|| atendimento@aguia.usp.br||virginia@if.usp.br
_version_ 1809090751949176832