Patterns and randomness in networks for computer vision: from graphs to neural networks

Scabini, Leonardo Felipe dos Santos

Patterns and randomness in networks for computer vision: from graphs to neural networks

Detalhes bibliográficos
Autor(a) principal:	Scabini, Leonardo Felipe dos Santos
Data de Publicação:	2023
Tipo de documento:	Tese
Idioma:	eng
Título da fonte:	Biblioteca Digital de Teses e Dissertações da USP
Texto Completo:	https://www.teses.usp.br/teses/disponiveis/76/76132/tde-28092023-095319/
Resumo:	Complex networks pervade various aspects of nature, society, and science. One of the most discussed types of network is a neural network, particularly in the last decade with the advances in artificial intelligence (AI). However, little is known about the structure of artificial neural networks (ANNs) in terms of topology and dynamics, from a network science point of view. Moreover, these models are known for being black-box systems, and may also exhibit unexpected behavior. This thesis focuses on AI networks, neural or not, for computer vision (CV) - the sub-field of AI that deals with visual information. Our study explores several aspects of network structure, patterns, and randomness, and their potential for understanding, enhancing, and developing new network-based CV systems. Firstly, we propose a novel network science-based framework for examining ANNs, focusing on their neuronal centrality. This approach, named Bag-of-Neurons (BoN), uncovers the relationship between structural patterns within trained ANNs and their performance and proved to be a promising approach for understanding these systems. These findings led to a new ANN random initialization technique, named Preferential Attachment Rewiring (PARw), which enhances performance and accelerates the training process of shallow and deep ANNs. By leveraging network science, we also develop CV techniques for texture analysis problems, ranging from pure texture images to texture in the wild. We introduce the Spatio-Spectral Network (SSN), a method for image modeling using a graph of pixels, which achieves state-of-the-art (SOTA) results in benchmark datasets. Another new proposal (SSR) couples SSNs with small randomized neural networks, improving performance without a substantial increase in computational costs. The thesis further presents the Randomized encoding of Aggregated Deep Activation Maps (RADAM), a transfer learning method for deep convolutional networks (CNNs) that offers remarkable performance in all CV tasks that were evaluated. We also explore the potential of fully randomized deep CNNs (FR-DCNN) coupled with PARw and RADAM, which prove to be robust texture feature extractors. Lastly, we apply our methods to real-world tasks, including prostate cancer and COVID-19 diagnosis, environmental biosensors using plants, and plant species identification. Our methods, particularly RADAM, consistently achieved SOTA results in these tasks. In conclusion, this thesis introduces six network-based methods for ANNs and CV, and the results show that they consistently improve the SOTA on various image classification problems.

Metadados do item

id	USP_7e2551f26aefe7cf4c97620dcffe0fbf
oai_identifier_str	oai:teses.usp.br:tde-28092023-095319
network_acronym_str	USP
network_name_str	Biblioteca Digital de Teses e Dissertações da USP
repository_id_str	2721
spelling	Patterns and randomness in networks for computer vision: from graphs to neural networksPadrões e aleatoriedade em redes para visão computacional: de grafos à redes neuraisAprendizado de máquinaAprendizagem profundaArtificial neural networksCiência das redesComputer visionDeep learningMachine learningNetwork scienceRedes neurais artificiaisVisão computacionalComplex networks pervade various aspects of nature, society, and science. One of the most discussed types of network is a neural network, particularly in the last decade with the advances in artificial intelligence (AI). However, little is known about the structure of artificial neural networks (ANNs) in terms of topology and dynamics, from a network science point of view. Moreover, these models are known for being black-box systems, and may also exhibit unexpected behavior. This thesis focuses on AI networks, neural or not, for computer vision (CV) - the sub-field of AI that deals with visual information. Our study explores several aspects of network structure, patterns, and randomness, and their potential for understanding, enhancing, and developing new network-based CV systems. Firstly, we propose a novel network science-based framework for examining ANNs, focusing on their neuronal centrality. This approach, named Bag-of-Neurons (BoN), uncovers the relationship between structural patterns within trained ANNs and their performance and proved to be a promising approach for understanding these systems. These findings led to a new ANN random initialization technique, named Preferential Attachment Rewiring (PARw), which enhances performance and accelerates the training process of shallow and deep ANNs. By leveraging network science, we also develop CV techniques for texture analysis problems, ranging from pure texture images to texture in the wild. We introduce the Spatio-Spectral Network (SSN), a method for image modeling using a graph of pixels, which achieves state-of-the-art (SOTA) results in benchmark datasets. Another new proposal (SSR) couples SSNs with small randomized neural networks, improving performance without a substantial increase in computational costs. The thesis further presents the Randomized encoding of Aggregated Deep Activation Maps (RADAM), a transfer learning method for deep convolutional networks (CNNs) that offers remarkable performance in all CV tasks that were evaluated. We also explore the potential of fully randomized deep CNNs (FR-DCNN) coupled with PARw and RADAM, which prove to be robust texture feature extractors. Lastly, we apply our methods to real-world tasks, including prostate cancer and COVID-19 diagnosis, environmental biosensors using plants, and plant species identification. Our methods, particularly RADAM, consistently achieved SOTA results in these tasks. In conclusion, this thesis introduces six network-based methods for ANNs and CV, and the results show that they consistently improve the SOTA on various image classification problems.As redes complexas permeiam vários aspectos da natureza, da sociedade e da ciência. Um dos tipos de rede mais discutidos é a rede neural, principalmente na última década, com os avanços da inteligência artificial (AI). No entanto, pouco se sabe sobre a estrutura de redes neurais artificiais (ANNs) em termos de topologia e dinâmica, do ponto de vista de ciência das redes. Além disso, esses modelos são conhecidos por serem sistemas caixa-preta e também podem apresentar comportamentos inesperados. Esta tese foca em redes de AI, neurais ou não, para visão computacional (CV) - o subcampo da AI que lida com informações visuais. Nosso estudo explora vários aspectos da estrutura, padrões e aleatoriedade em redes, e seu potencial para entender, aprimorar e desenvolver novos sistemas de CV. Em primeiro lugar, propomos uma nova metodologia baseada em ciência das redes para examinar ANNs, com foco em sua centralidade neuronal. Esta abordagem (BoN) revela a relação entre padrões estruturais dentro de ANNs treinadas e seu desempenho, provando ser promissora para entender esses sistemas. Essas descobertas levaram a uma nova técnica de inicialização aleatória de ANN (PARw), que melhora o desempenho e acelera o processo de treinamento de redes rasas e profundas. Também desenvolvemos técnicas de CV para problemas de análise de textura utilizando ciência das redes, focando entre imagens de textura pura ou sem controle. Apresentamos um método (SSN) de modelagem de imagem usando um grafo de pixels, que alcança resultados estado-da-arte em bases de dados de referência. Outra nova proposta (SSR) acopla SSNs à pequenas redes neurais aleatórias, melhorando o desempenho sem um aumento substancial nos custos computacionais. A tese apresenta ainda RADAM, um método de transferência de aprendizado para redes convolucionais profundas que também demonstra desempenho notável em CV. Também exploramos o potencial de redes profundas totalmente randomizadas (FR-DCNN), que combinadas com PARw e RADAM provaram ser extratores robustos de características. Por fim, aplicamos nossos métodos em tarefas reais, incluindo diagnóstico de câncer de próstata e de COVID-19, uso folhas como biossensores, e identificação de espécies de planta. Nossos métodos, particularmente o RADAM, alcançaram os melhores resultados nessas tarefas. Em conclusão, esta tese apresenta seis métodos baseados em redes para ANNs e CV, e os resultados mostram que eles melhoram consistentemente o estado-da-arte em vários problemas de classificação de imagens.Biblioteca Digitais de Teses e Dissertações da USPBaets, Bernard DeBruno, Odemir MartinezScabini, Leonardo Felipe dos Santos2023-07-24info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisapplication/pdfhttps://www.teses.usp.br/teses/disponiveis/76/76132/tde-28092023-095319/reponame:Biblioteca Digital de Teses e Dissertações da USPinstname:Universidade de São Paulo (USP)instacron:USPLiberar o conteúdo para acesso público.info:eu-repo/semantics/openAccesseng2023-09-29T13:51:03Zoai:teses.usp.br:tde-28092023-095319Biblioteca Digital de Teses e Dissertaçõeshttp://www.teses.usp.br/PUBhttp://www.teses.usp.br/cgi-bin/mtd2br.plvirginia@if.usp.br\|\| atendimento@aguia.usp.br\|\|virginia@if.usp.bropendoar:27212023-09-29T13:51:03Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)false
dc.title.none.fl_str_mv	Patterns and randomness in networks for computer vision: from graphs to neural networks Padrões e aleatoriedade em redes para visão computacional: de grafos à redes neurais
title	Patterns and randomness in networks for computer vision: from graphs to neural networks
spellingShingle	Patterns and randomness in networks for computer vision: from graphs to neural networks Scabini, Leonardo Felipe dos Santos Aprendizado de máquina Aprendizagem profunda Artificial neural networks Ciência das redes Computer vision Deep learning Machine learning Network science Redes neurais artificiais Visão computacional
title_short	Patterns and randomness in networks for computer vision: from graphs to neural networks
title_full	Patterns and randomness in networks for computer vision: from graphs to neural networks
title_fullStr	Patterns and randomness in networks for computer vision: from graphs to neural networks
title_full_unstemmed	Patterns and randomness in networks for computer vision: from graphs to neural networks
title_sort	Patterns and randomness in networks for computer vision: from graphs to neural networks
author	Scabini, Leonardo Felipe dos Santos
author_facet	Scabini, Leonardo Felipe dos Santos
author_role	author
dc.contributor.none.fl_str_mv	Baets, Bernard De Bruno, Odemir Martinez
dc.contributor.author.fl_str_mv	Scabini, Leonardo Felipe dos Santos
dc.subject.por.fl_str_mv	Aprendizado de máquina Aprendizagem profunda Artificial neural networks Ciência das redes Computer vision Deep learning Machine learning Network science Redes neurais artificiais Visão computacional
topic	Aprendizado de máquina Aprendizagem profunda Artificial neural networks Ciência das redes Computer vision Deep learning Machine learning Network science Redes neurais artificiais Visão computacional
description	Complex networks pervade various aspects of nature, society, and science. One of the most discussed types of network is a neural network, particularly in the last decade with the advances in artificial intelligence (AI). However, little is known about the structure of artificial neural networks (ANNs) in terms of topology and dynamics, from a network science point of view. Moreover, these models are known for being black-box systems, and may also exhibit unexpected behavior. This thesis focuses on AI networks, neural or not, for computer vision (CV) - the sub-field of AI that deals with visual information. Our study explores several aspects of network structure, patterns, and randomness, and their potential for understanding, enhancing, and developing new network-based CV systems. Firstly, we propose a novel network science-based framework for examining ANNs, focusing on their neuronal centrality. This approach, named Bag-of-Neurons (BoN), uncovers the relationship between structural patterns within trained ANNs and their performance and proved to be a promising approach for understanding these systems. These findings led to a new ANN random initialization technique, named Preferential Attachment Rewiring (PARw), which enhances performance and accelerates the training process of shallow and deep ANNs. By leveraging network science, we also develop CV techniques for texture analysis problems, ranging from pure texture images to texture in the wild. We introduce the Spatio-Spectral Network (SSN), a method for image modeling using a graph of pixels, which achieves state-of-the-art (SOTA) results in benchmark datasets. Another new proposal (SSR) couples SSNs with small randomized neural networks, improving performance without a substantial increase in computational costs. The thesis further presents the Randomized encoding of Aggregated Deep Activation Maps (RADAM), a transfer learning method for deep convolutional networks (CNNs) that offers remarkable performance in all CV tasks that were evaluated. We also explore the potential of fully randomized deep CNNs (FR-DCNN) coupled with PARw and RADAM, which prove to be robust texture feature extractors. Lastly, we apply our methods to real-world tasks, including prostate cancer and COVID-19 diagnosis, environmental biosensors using plants, and plant species identification. Our methods, particularly RADAM, consistently achieved SOTA results in these tasks. In conclusion, this thesis introduces six network-based methods for ANNs and CV, and the results show that they consistently improve the SOTA on various image classification problems.
publishDate	2023
dc.date.none.fl_str_mv	2023-07-24
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/doctoralThesis
format	doctoralThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	https://www.teses.usp.br/teses/disponiveis/76/76132/tde-28092023-095319/
url	https://www.teses.usp.br/teses/disponiveis/76/76132/tde-28092023-095319/
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv
dc.rights.driver.fl_str_mv	Liberar o conteúdo para acesso público. info:eu-repo/semantics/openAccess
rights_invalid_str_mv	Liberar o conteúdo para acesso público.
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.coverage.none.fl_str_mv
dc.publisher.none.fl_str_mv	Biblioteca Digitais de Teses e Dissertações da USP
publisher.none.fl_str_mv	Biblioteca Digitais de Teses e Dissertações da USP
dc.source.none.fl_str_mv	reponame:Biblioteca Digital de Teses e Dissertações da USP instname:Universidade de São Paulo (USP) instacron:USP
instname_str	Universidade de São Paulo (USP)
instacron_str	USP
institution	USP
reponame_str	Biblioteca Digital de Teses e Dissertações da USP
collection	Biblioteca Digital de Teses e Dissertações da USP
repository.name.fl_str_mv	Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)
repository.mail.fl_str_mv	virginia@if.usp.br\|\| atendimento@aguia.usp.br\|\|virginia@if.usp.br
_version_	1809091160104239104

Patterns and randomness in networks for computer vision: from graphs to neural networks

Registros relacionados