Generative models for open set image recognition

Marcos Felipe Vendramini Carvalho

Generative models for open set image recognition

Detalhes bibliográficos
Autor(a) principal:	Marcos Felipe Vendramini Carvalho
Data de Publicação:	2021
Tipo de documento:	Dissertação
Idioma:	por
Título da fonte:	Repositório Institucional da UFMG
Texto Completo:	http://hdl.handle.net/1843/49031
Resumo:	Image classification methods are usually trained to perform predictions taking into account a predefined group of known classes. Real-world problems, however, may not allow for a full knowledge of the input and label spaces, making failures in recognition a hazard to deep visual learning. Open set recognition methods are characterized by the ability to correctly identify inputs of known and unknown classes. In this context, we propose GeMOS: simple and plug-and-play open set recognition modules that can be attached to pre-trained Deep Neural Networks for visual recognition. The GeMOS framework pairs pre-trained Convolutional Neural Networks with generative models for open set recognition to extract open set scores for each sample, allowing for failure recognition in object recognition tasks. We conduct a thorough evaluation of the proposed method against state-of-the-art open set algorithms. In these tests, different datasets were used, such as in and out of distribution, with the MNIST as in distribution, we reached the F1-score of 0.91 while the best baseline of the referent test reached 0.85, and, for the CIFAR10 as in distribution, we reached the F1-score of 0.93 while the best baseline of the benchmark test reached 0.81. Tests were also performed using the same dataset as in and out of distribution, a more complex case that showed the dependence of the method on the accuracy of pre-trained networks. The results showed that GeMOS competes with more complex and expensive models and in many cases outperforms them. For future work, we initially propose to apply the method to other domains and real-world problems, and to modify the method for other computer vision tasks.

Metadados do item

id	UFMG_f1bb81d87af9f6c5f5e369afaf453a60
oai_identifier_str	oai:repositorio.ufmg.br:1843/49031
network_acronym_str	UFMG
network_name_str	Repositório Institucional da UFMG
repository_id_str
spelling	Jefersson Alex Dos Santoshttp://lattes.cnpq.br/2171782600728348Alexei Manso Correa MachadoHugo Neves de OliveiraMatheus Pinheiro FerreiraFabrício Murai Ferreirahttp://lattes.cnpq.br/8206412711416236Marcos Felipe Vendramini Carvalho2023-01-20T14:02:44Z2023-01-20T14:02:44Z2021-12-10http://hdl.handle.net/1843/49031Image classification methods are usually trained to perform predictions taking into account a predefined group of known classes. Real-world problems, however, may not allow for a full knowledge of the input and label spaces, making failures in recognition a hazard to deep visual learning. Open set recognition methods are characterized by the ability to correctly identify inputs of known and unknown classes. In this context, we propose GeMOS: simple and plug-and-play open set recognition modules that can be attached to pre-trained Deep Neural Networks for visual recognition. The GeMOS framework pairs pre-trained Convolutional Neural Networks with generative models for open set recognition to extract open set scores for each sample, allowing for failure recognition in object recognition tasks. We conduct a thorough evaluation of the proposed method against state-of-the-art open set algorithms. In these tests, different datasets were used, such as in and out of distribution, with the MNIST as in distribution, we reached the F1-score of 0.91 while the best baseline of the referent test reached 0.85, and, for the CIFAR10 as in distribution, we reached the F1-score of 0.93 while the best baseline of the benchmark test reached 0.81. Tests were also performed using the same dataset as in and out of distribution, a more complex case that showed the dependence of the method on the accuracy of pre-trained networks. The results showed that GeMOS competes with more complex and expensive models and in many cases outperforms them. For future work, we initially propose to apply the method to other domains and real-world problems, and to modify the method for other computer vision tasks.Os métodos de classificação de imagens geralmente são treinados para realizar previsões levando em consideração um grupo predefinido de classes conhecidas. Problemas do mundo real, no entanto, podem não permitir um conhecimento completo de todas as entrada e rótulos do espaço, fazendo com que as falhas no reconhecimento seja um problema para o aprendizado visual profundo. Os métodos de reconhecimento de conjunto aberto são caracterizados pela capacidade de identificar corretamente as entradas de classes conhecidas e desconhecidas. Neste contexto, propomos GeMOS: módulos de reconhecimento de conjunto aberto simples que podem ser anexados a Redes Neurais Profundas pré-treinadas para reconhecimento visual. O framework GeMOS emparelha redes neurais convolucionais pré-treinadas com modelos generativos para introduzir o reconhecimento de conjunto aberto através da extração de pontuações para cada amostra, permitindo o reconhecimento de falha em tarefas de reconhecimento de objeto. Conduzimos uma avaliação completa do método proposto em comparação com algoritmos do estado-da-arte de conjunto aberto. Nesses testes foram utilizados diferentes datasets como dentro e fora da distribuição, onde, com o MNIST dentro da distribuição, atingimos o F1-score de 0.91 enquanto o melhor baseline do teste referente atingiu 0.85, e, para o CIFAR10 dentro da distribuição, atingimos o F1-score de 0.93 enquanto o melhor baseline do teste referente atingiu 0.81. Também foram realizados teste utilizando um mesmo dataset como dentro e fora da distribuição, um caso mais complexo que mostrou a dependência do método a acurácia das redes pré treinadas. Os resultados mostraram que o GeMOS compete com modelos mais complexos e caros e em muitos casos os superam. Para os trabalhos futuros propomos inicialmente aplicar o método a outros domínios e a problemas do mundo real, e modificar o método para outras tarefas de visão computacional.FAPEMIG - Fundação de Amparo à Pesquisa do Estado de Minas GeraisporUniversidade Federal de Minas GeraisPrograma de Pós-Graduação em Ciência da ComputaçãoUFMGBrasilICX - DEPARTAMENTO DE CIÊNCIA DA COMPUTAÇÃOComputação – TesesVisão computacional –TesesRedes neurais (Computação) – TesesReconhecimento de conjunto aberto –Teses.Computer visionNeural networksOpen set recognitionGenerative models for open set image recognitionModelos generativos para reconhecimento de imagens em conjunto abertoinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGLICENSElicense.txtlicense.txttext/plain; charset=utf-82118https://repositorio.ufmg.br/bitstream/1843/49031/4/license.txtcda590c95a0b51b4d15f60c9642ca272MD54ORIGINALMarcos Felipe Vendramini - Revisado 08dez.pdfMarcos Felipe Vendramini - Revisado 08dez.pdfapplication/pdf4410946https://repositorio.ufmg.br/bitstream/1843/49031/3/Marcos%20Felipe%20Vendramini%20-%20Revisado%2008dez.pdfb6f2e82c2f978816c35a1608d11b5e8bMD531843/490312023-01-20 11:02:44.581oai:repositorio.ufmg.br:1843/49031TElDRU7Dh0EgREUgRElTVFJJQlVJw4fDg08gTsODTy1FWENMVVNJVkEgRE8gUkVQT1NJVMOTUklPIElOU1RJVFVDSU9OQUwgREEgVUZNRwoKQ29tIGEgYXByZXNlbnRhw6fDo28gZGVzdGEgbGljZW7Dp2EsIHZvY8OqIChvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvcikgY29uY2VkZSBhbyBSZXBvc2l0w7NyaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIChSSS1VRk1HKSBvIGRpcmVpdG8gbsOjbyBleGNsdXNpdm8gZSBpcnJldm9nw6F2ZWwgZGUgcmVwcm9kdXppciBlL291IGRpc3RyaWJ1aXIgYSBzdWEgcHVibGljYcOnw6NvIChpbmNsdWluZG8gbyByZXN1bW8pIHBvciB0b2RvIG8gbXVuZG8gbm8gZm9ybWF0byBpbXByZXNzbyBlIGVsZXRyw7RuaWNvIGUgZW0gcXVhbHF1ZXIgbWVpbywgaW5jbHVpbmRvIG9zIGZvcm1hdG9zIMOhdWRpbyBvdSB2w61kZW8uCgpWb2PDqiBkZWNsYXJhIHF1ZSBjb25oZWNlIGEgcG9sw610aWNhIGRlIGNvcHlyaWdodCBkYSBlZGl0b3JhIGRvIHNldSBkb2N1bWVudG8gZSBxdWUgY29uaGVjZSBlIGFjZWl0YSBhcyBEaXJldHJpemVzIGRvIFJJLVVGTUcuCgpWb2PDqiBjb25jb3JkYSBxdWUgbyBSZXBvc2l0w7NyaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIHBvZGUsIHNlbSBhbHRlcmFyIG8gY29udGXDumRvLCB0cmFuc3BvciBhIHN1YSBwdWJsaWNhw6fDo28gcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhw6fDo28uCgpWb2PDqiB0YW1iw6ltIGNvbmNvcmRhIHF1ZSBvIFJlcG9zaXTDs3JpbyBJbnN0aXR1Y2lvbmFsIGRhIFVGTUcgcG9kZSBtYW50ZXIgbWFpcyBkZSB1bWEgY8OzcGlhIGRlIHN1YSBwdWJsaWNhw6fDo28gcGFyYSBmaW5zIGRlIHNlZ3VyYW7Dp2EsIGJhY2stdXAgZSBwcmVzZXJ2YcOnw6NvLgoKVm9jw6ogZGVjbGFyYSBxdWUgYSBzdWEgcHVibGljYcOnw6NvIMOpIG9yaWdpbmFsIGUgcXVlIHZvY8OqIHRlbSBvIHBvZGVyIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vuw6dhLiBWb2PDqiB0YW1iw6ltIGRlY2xhcmEgcXVlIG8gZGVww7NzaXRvIGRlIHN1YSBwdWJsaWNhw6fDo28gbsOjbywgcXVlIHNlamEgZGUgc2V1IGNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3XDqW0uCgpDYXNvIGEgc3VhIHB1YmxpY2HDp8OjbyBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jw6ogbsOjbyBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2PDqiBkZWNsYXJhIHF1ZSBvYnRldmUgYSBwZXJtaXNzw6NvIGlycmVzdHJpdGEgZG8gZGV0ZW50b3IgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgYW8gUmVwb3NpdMOzcmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7Dp2EsIGUgcXVlIGVzc2UgbWF0ZXJpYWwgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3Ugbm8gY29udGXDumRvIGRhIHB1YmxpY2HDp8OjbyBvcmEgZGVwb3NpdGFkYS4KCkNBU08gQSBQVUJMSUNBw4fDg08gT1JBIERFUE9TSVRBREEgVEVOSEEgU0lETyBSRVNVTFRBRE8gREUgVU0gUEFUUk9Dw41OSU8gT1UgQVBPSU8gREUgVU1BIEFHw4pOQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PLCBWT0PDiiBERUNMQVJBIFFVRSBSRVNQRUlUT1UgVE9ET1MgRSBRVUFJU1FVRVIgRElSRUlUT1MgREUgUkVWSVPDg08gQ09NTyBUQU1Cw4lNIEFTIERFTUFJUyBPQlJJR0HDh8OVRVMgRVhJR0lEQVMgUE9SIENPTlRSQVRPIE9VIEFDT1JETy4KCk8gUmVwb3NpdMOzcmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBzZSBjb21wcm9tZXRlIGEgaWRlbnRpZmljYXIgY2xhcmFtZW50ZSBvIHNldSBub21lKHMpIG91IG8ocykgbm9tZXMocykgZG8ocykgZGV0ZW50b3IoZXMpIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBkYSBwdWJsaWNhw6fDo28sIGUgbsOjbyBmYXLDoSBxdWFscXVlciBhbHRlcmHDp8OjbywgYWzDqW0gZGFxdWVsYXMgY29uY2VkaWRhcyBwb3IgZXN0YSBsaWNlbsOnYS4KRepositório de PublicaçõesPUBhttps://repositorio.ufmg.br/oaiopendoar:2023-01-20T14:02:44Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)false
dc.title.pt_BR.fl_str_mv	Generative models for open set image recognition
dc.title.alternative.pt_BR.fl_str_mv	Modelos generativos para reconhecimento de imagens em conjunto aberto
title	Generative models for open set image recognition
spellingShingle	Generative models for open set image recognition Marcos Felipe Vendramini Carvalho Computer vision Neural networks Open set recognition Computação – Teses Visão computacional –Teses Redes neurais (Computação) – Teses Reconhecimento de conjunto aberto –Teses.
title_short	Generative models for open set image recognition
title_full	Generative models for open set image recognition
title_fullStr	Generative models for open set image recognition
title_full_unstemmed	Generative models for open set image recognition
title_sort	Generative models for open set image recognition
author	Marcos Felipe Vendramini Carvalho
author_facet	Marcos Felipe Vendramini Carvalho
author_role	author
dc.contributor.advisor1.fl_str_mv	Jefersson Alex Dos Santos
dc.contributor.advisor1Lattes.fl_str_mv	http://lattes.cnpq.br/2171782600728348
dc.contributor.advisor-co1.fl_str_mv	Alexei Manso Correa Machado
dc.contributor.referee1.fl_str_mv	Hugo Neves de Oliveira
dc.contributor.referee2.fl_str_mv	Matheus Pinheiro Ferreira
dc.contributor.referee3.fl_str_mv	Fabrício Murai Ferreira
dc.contributor.authorLattes.fl_str_mv	http://lattes.cnpq.br/8206412711416236
dc.contributor.author.fl_str_mv	Marcos Felipe Vendramini Carvalho
contributor_str_mv	Jefersson Alex Dos Santos Alexei Manso Correa Machado Hugo Neves de Oliveira Matheus Pinheiro Ferreira Fabrício Murai Ferreira
dc.subject.por.fl_str_mv	Computer vision Neural networks Open set recognition
topic	Computer vision Neural networks Open set recognition Computação – Teses Visão computacional –Teses Redes neurais (Computação) – Teses Reconhecimento de conjunto aberto –Teses.
dc.subject.other.pt_BR.fl_str_mv	Computação – Teses Visão computacional –Teses Redes neurais (Computação) – Teses Reconhecimento de conjunto aberto –Teses.
description	Image classification methods are usually trained to perform predictions taking into account a predefined group of known classes. Real-world problems, however, may not allow for a full knowledge of the input and label spaces, making failures in recognition a hazard to deep visual learning. Open set recognition methods are characterized by the ability to correctly identify inputs of known and unknown classes. In this context, we propose GeMOS: simple and plug-and-play open set recognition modules that can be attached to pre-trained Deep Neural Networks for visual recognition. The GeMOS framework pairs pre-trained Convolutional Neural Networks with generative models for open set recognition to extract open set scores for each sample, allowing for failure recognition in object recognition tasks. We conduct a thorough evaluation of the proposed method against state-of-the-art open set algorithms. In these tests, different datasets were used, such as in and out of distribution, with the MNIST as in distribution, we reached the F1-score of 0.91 while the best baseline of the referent test reached 0.85, and, for the CIFAR10 as in distribution, we reached the F1-score of 0.93 while the best baseline of the benchmark test reached 0.81. Tests were also performed using the same dataset as in and out of distribution, a more complex case that showed the dependence of the method on the accuracy of pre-trained networks. The results showed that GeMOS competes with more complex and expensive models and in many cases outperforms them. For future work, we initially propose to apply the method to other domains and real-world problems, and to modify the method for other computer vision tasks.
publishDate	2021
dc.date.issued.fl_str_mv	2021-12-10
dc.date.accessioned.fl_str_mv	2023-01-20T14:02:44Z
dc.date.available.fl_str_mv	2023-01-20T14:02:44Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/1843/49031
url	http://hdl.handle.net/1843/49031
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.publisher.none.fl_str_mv	Universidade Federal de Minas Gerais
dc.publisher.program.fl_str_mv	Programa de Pós-Graduação em Ciência da Computação
dc.publisher.initials.fl_str_mv	UFMG
dc.publisher.country.fl_str_mv	Brasil
dc.publisher.department.fl_str_mv	ICX - DEPARTAMENTO DE CIÊNCIA DA COMPUTAÇÃO
publisher.none.fl_str_mv	Universidade Federal de Minas Gerais
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFMG instname:Universidade Federal de Minas Gerais (UFMG) instacron:UFMG
instname_str	Universidade Federal de Minas Gerais (UFMG)
instacron_str	UFMG
institution	UFMG
reponame_str	Repositório Institucional da UFMG
collection	Repositório Institucional da UFMG
bitstream.url.fl_str_mv	https://repositorio.ufmg.br/bitstream/1843/49031/4/license.txt https://repositorio.ufmg.br/bitstream/1843/49031/3/Marcos%20Felipe%20Vendramini%20-%20Revisado%2008dez.pdf
bitstream.checksum.fl_str_mv	cda590c95a0b51b4d15f60c9642ca272 b6f2e82c2f978816c35a1608d11b5e8b
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5
repository.name.fl_str_mv	Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv
_version_	1803589495104208896

Generative models for open set image recognition

Registros relacionados