Patterns and randomness in networks for computer vision: from graphs to neural networks

Detalhes bibliográficos
Autor(a) principal: Scabini, Leonardo Felipe dos Santos
Data de Publicação: 2023
Tipo de documento: Tese
Idioma: eng
Título da fonte: Biblioteca Digital de Teses e Dissertações da USP
Texto Completo: https://www.teses.usp.br/teses/disponiveis/76/76132/tde-28092023-095319/
Resumo: Complex networks pervade various aspects of nature, society, and science. One of the most discussed types of network is a neural network, particularly in the last decade with the advances in artificial intelligence (AI). However, little is known about the structure of artificial neural networks (ANNs) in terms of topology and dynamics, from a network science point of view. Moreover, these models are known for being black-box systems, and may also exhibit unexpected behavior. This thesis focuses on AI networks, neural or not, for computer vision (CV) - the sub-field of AI that deals with visual information. Our study explores several aspects of network structure, patterns, and randomness, and their potential for understanding, enhancing, and developing new network-based CV systems. Firstly, we propose a novel network science-based framework for examining ANNs, focusing on their neuronal centrality. This approach, named Bag-of-Neurons (BoN), uncovers the relationship between structural patterns within trained ANNs and their performance and proved to be a promising approach for understanding these systems. These findings led to a new ANN random initialization technique, named Preferential Attachment Rewiring (PARw), which enhances performance and accelerates the training process of shallow and deep ANNs. By leveraging network science, we also develop CV techniques for texture analysis problems, ranging from pure texture images to texture in the wild. We introduce the Spatio-Spectral Network (SSN), a method for image modeling using a graph of pixels, which achieves state-of-the-art (SOTA) results in benchmark datasets. Another new proposal (SSR) couples SSNs with small randomized neural networks, improving performance without a substantial increase in computational costs. The thesis further presents the Randomized encoding of Aggregated Deep Activation Maps (RADAM), a transfer learning method for deep convolutional networks (CNNs) that offers remarkable performance in all CV tasks that were evaluated. We also explore the potential of fully randomized deep CNNs (FR-DCNN) coupled with PARw and RADAM, which prove to be robust texture feature extractors. Lastly, we apply our methods to real-world tasks, including prostate cancer and COVID-19 diagnosis, environmental biosensors using plants, and plant species identification. Our methods, particularly RADAM, consistently achieved SOTA results in these tasks. In conclusion, this thesis introduces six network-based methods for ANNs and CV, and the results show that they consistently improve the SOTA on various image classification problems.
id USP_7e2551f26aefe7cf4c97620dcffe0fbf
oai_identifier_str oai:teses.usp.br:tde-28092023-095319
network_acronym_str USP
network_name_str Biblioteca Digital de Teses e Dissertações da USP
repository_id_str 2721
spelling Patterns and randomness in networks for computer vision: from graphs to neural networksPadrões e aleatoriedade em redes para visão computacional: de grafos à redes neuraisAprendizado de máquinaAprendizagem profundaArtificial neural networksCiência das redesComputer visionDeep learningMachine learningNetwork scienceRedes neurais artificiaisVisão computacionalComplex networks pervade various aspects of nature, society, and science. One of the most discussed types of network is a neural network, particularly in the last decade with the advances in artificial intelligence (AI). However, little is known about the structure of artificial neural networks (ANNs) in terms of topology and dynamics, from a network science point of view. Moreover, these models are known for being black-box systems, and may also exhibit unexpected behavior. This thesis focuses on AI networks, neural or not, for computer vision (CV) - the sub-field of AI that deals with visual information. Our study explores several aspects of network structure, patterns, and randomness, and their potential for understanding, enhancing, and developing new network-based CV systems. Firstly, we propose a novel network science-based framework for examining ANNs, focusing on their neuronal centrality. This approach, named Bag-of-Neurons (BoN), uncovers the relationship between structural patterns within trained ANNs and their performance and proved to be a promising approach for understanding these systems. These findings led to a new ANN random initialization technique, named Preferential Attachment Rewiring (PARw), which enhances performance and accelerates the training process of shallow and deep ANNs. By leveraging network science, we also develop CV techniques for texture analysis problems, ranging from pure texture images to texture in the wild. We introduce the Spatio-Spectral Network (SSN), a method for image modeling using a graph of pixels, which achieves state-of-the-art (SOTA) results in benchmark datasets. Another new proposal (SSR) couples SSNs with small randomized neural networks, improving performance without a substantial increase in computational costs. The thesis further presents the Randomized encoding of Aggregated Deep Activation Maps (RADAM), a transfer learning method for deep convolutional networks (CNNs) that offers remarkable performance in all CV tasks that were evaluated. We also explore the potential of fully randomized deep CNNs (FR-DCNN) coupled with PARw and RADAM, which prove to be robust texture feature extractors. Lastly, we apply our methods to real-world tasks, including prostate cancer and COVID-19 diagnosis, environmental biosensors using plants, and plant species identification. Our methods, particularly RADAM, consistently achieved SOTA results in these tasks. In conclusion, this thesis introduces six network-based methods for ANNs and CV, and the results show that they consistently improve the SOTA on various image classification problems.As redes complexas permeiam vários aspectos da natureza, da sociedade e da ciência. Um dos tipos de rede mais discutidos é a rede neural, principalmente na última década, com os avanços da inteligência artificial (AI). No entanto, pouco se sabe sobre a estrutura de redes neurais artificiais (ANNs) em termos de topologia e dinâmica, do ponto de vista de ciência das redes. Além disso, esses modelos são conhecidos por serem sistemas caixa-preta e também podem apresentar comportamentos inesperados. Esta tese foca em redes de AI, neurais ou não, para visão computacional (CV) - o subcampo da AI que lida com informações visuais. Nosso estudo explora vários aspectos da estrutura, padrões e aleatoriedade em redes, e seu potencial para entender, aprimorar e desenvolver novos sistemas de CV. Em primeiro lugar, propomos uma nova metodologia baseada em ciência das redes para examinar ANNs, com foco em sua centralidade neuronal. Esta abordagem (BoN) revela a relação entre padrões estruturais dentro de ANNs treinadas e seu desempenho, provando ser promissora para entender esses sistemas. Essas descobertas levaram a uma nova técnica de inicialização aleatória de ANN (PARw), que melhora o desempenho e acelera o processo de treinamento de redes rasas e profundas. Também desenvolvemos técnicas de CV para problemas de análise de textura utilizando ciência das redes, focando entre imagens de textura pura ou sem controle. Apresentamos um método (SSN) de modelagem de imagem usando um grafo de pixels, que alcança resultados estado-da-arte em bases de dados de referência. Outra nova proposta (SSR) acopla SSNs à pequenas redes neurais aleatórias, melhorando o desempenho sem um aumento substancial nos custos computacionais. A tese apresenta ainda RADAM, um método de transferência de aprendizado para redes convolucionais profundas que também demonstra desempenho notável em CV. Também exploramos o potencial de redes profundas totalmente randomizadas (FR-DCNN), que combinadas com PARw e RADAM provaram ser extratores robustos de características. Por fim, aplicamos nossos métodos em tarefas reais, incluindo diagnóstico de câncer de próstata e de COVID-19, uso folhas como biossensores, e identificação de espécies de planta. Nossos métodos, particularmente o RADAM, alcançaram os melhores resultados nessas tarefas. Em conclusão, esta tese apresenta seis métodos baseados em redes para ANNs e CV, e os resultados mostram que eles melhoram consistentemente o estado-da-arte em vários problemas de classificação de imagens.Biblioteca Digitais de Teses e Dissertações da USPBaets, Bernard DeBruno, Odemir MartinezScabini, Leonardo Felipe dos Santos2023-07-24info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisapplication/pdfhttps://www.teses.usp.br/teses/disponiveis/76/76132/tde-28092023-095319/reponame:Biblioteca Digital de Teses e Dissertações da USPinstname:Universidade de São Paulo (USP)instacron:USPLiberar o conteúdo para acesso público.info:eu-repo/semantics/openAccesseng2023-09-29T13:51:03Zoai:teses.usp.br:tde-28092023-095319Biblioteca Digital de Teses e Dissertaçõeshttp://www.teses.usp.br/PUBhttp://www.teses.usp.br/cgi-bin/mtd2br.plvirginia@if.usp.br|| atendimento@aguia.usp.br||virginia@if.usp.bropendoar:27212023-09-29T13:51:03Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)false
dc.title.none.fl_str_mv Patterns and randomness in networks for computer vision: from graphs to neural networks
Padrões e aleatoriedade em redes para visão computacional: de grafos à redes neurais
title Patterns and randomness in networks for computer vision: from graphs to neural networks
spellingShingle Patterns and randomness in networks for computer vision: from graphs to neural networks
Scabini, Leonardo Felipe dos Santos
Aprendizado de máquina
Aprendizagem profunda
Artificial neural networks
Ciência das redes
Computer vision
Deep learning
Machine learning
Network science
Redes neurais artificiais
Visão computacional
title_short Patterns and randomness in networks for computer vision: from graphs to neural networks
title_full Patterns and randomness in networks for computer vision: from graphs to neural networks
title_fullStr Patterns and randomness in networks for computer vision: from graphs to neural networks
title_full_unstemmed Patterns and randomness in networks for computer vision: from graphs to neural networks
title_sort Patterns and randomness in networks for computer vision: from graphs to neural networks
author Scabini, Leonardo Felipe dos Santos
author_facet Scabini, Leonardo Felipe dos Santos
author_role author
dc.contributor.none.fl_str_mv Baets, Bernard De
Bruno, Odemir Martinez
dc.contributor.author.fl_str_mv Scabini, Leonardo Felipe dos Santos
dc.subject.por.fl_str_mv Aprendizado de máquina
Aprendizagem profunda
Artificial neural networks
Ciência das redes
Computer vision
Deep learning
Machine learning
Network science
Redes neurais artificiais
Visão computacional
topic Aprendizado de máquina
Aprendizagem profunda
Artificial neural networks
Ciência das redes
Computer vision
Deep learning
Machine learning
Network science
Redes neurais artificiais
Visão computacional
description Complex networks pervade various aspects of nature, society, and science. One of the most discussed types of network is a neural network, particularly in the last decade with the advances in artificial intelligence (AI). However, little is known about the structure of artificial neural networks (ANNs) in terms of topology and dynamics, from a network science point of view. Moreover, these models are known for being black-box systems, and may also exhibit unexpected behavior. This thesis focuses on AI networks, neural or not, for computer vision (CV) - the sub-field of AI that deals with visual information. Our study explores several aspects of network structure, patterns, and randomness, and their potential for understanding, enhancing, and developing new network-based CV systems. Firstly, we propose a novel network science-based framework for examining ANNs, focusing on their neuronal centrality. This approach, named Bag-of-Neurons (BoN), uncovers the relationship between structural patterns within trained ANNs and their performance and proved to be a promising approach for understanding these systems. These findings led to a new ANN random initialization technique, named Preferential Attachment Rewiring (PARw), which enhances performance and accelerates the training process of shallow and deep ANNs. By leveraging network science, we also develop CV techniques for texture analysis problems, ranging from pure texture images to texture in the wild. We introduce the Spatio-Spectral Network (SSN), a method for image modeling using a graph of pixels, which achieves state-of-the-art (SOTA) results in benchmark datasets. Another new proposal (SSR) couples SSNs with small randomized neural networks, improving performance without a substantial increase in computational costs. The thesis further presents the Randomized encoding of Aggregated Deep Activation Maps (RADAM), a transfer learning method for deep convolutional networks (CNNs) that offers remarkable performance in all CV tasks that were evaluated. We also explore the potential of fully randomized deep CNNs (FR-DCNN) coupled with PARw and RADAM, which prove to be robust texture feature extractors. Lastly, we apply our methods to real-world tasks, including prostate cancer and COVID-19 diagnosis, environmental biosensors using plants, and plant species identification. Our methods, particularly RADAM, consistently achieved SOTA results in these tasks. In conclusion, this thesis introduces six network-based methods for ANNs and CV, and the results show that they consistently improve the SOTA on various image classification problems.
publishDate 2023
dc.date.none.fl_str_mv 2023-07-24
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://www.teses.usp.br/teses/disponiveis/76/76132/tde-28092023-095319/
url https://www.teses.usp.br/teses/disponiveis/76/76132/tde-28092023-095319/
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv
dc.rights.driver.fl_str_mv Liberar o conteúdo para acesso público.
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Liberar o conteúdo para acesso público.
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.coverage.none.fl_str_mv
dc.publisher.none.fl_str_mv Biblioteca Digitais de Teses e Dissertações da USP
publisher.none.fl_str_mv Biblioteca Digitais de Teses e Dissertações da USP
dc.source.none.fl_str_mv
reponame:Biblioteca Digital de Teses e Dissertações da USP
instname:Universidade de São Paulo (USP)
instacron:USP
instname_str Universidade de São Paulo (USP)
instacron_str USP
institution USP
reponame_str Biblioteca Digital de Teses e Dissertações da USP
collection Biblioteca Digital de Teses e Dissertações da USP
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)
repository.mail.fl_str_mv virginia@if.usp.br|| atendimento@aguia.usp.br||virginia@if.usp.br
_version_ 1809091160104239104