Modular dynamic neural network: a continual learning architecture

Turner, Daniel; Cardoso, Pedro; Rodrigues, João

Modular dynamic neural network: a continual learning architecture

Detalhes bibliográficos
Autor(a) principal:	Turner, Daniel
Data de Publicação:	2021
Outros Autores:	Cardoso, Pedro, Rodrigues, João
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10400.1/17460
Resumo:	Learning to recognize a new object after having learned to recognize other objects may be a simple task for a human, but not for machines. The present go-to approaches for teaching a machine to recognize a set of objects are based on the use of deep neural networks (DNN). So, intuitively, the solution for teaching new objects on the fly to a machine should be DNN. The problem is that the trained DNN weights used to classify the initial set of objects are extremely fragile, meaning that any change to those weights can severely damage the capacity to perform the initial recognitions; this phenomenon is known as catastrophic forgetting (CF). This paper presents a new (DNN) continual learning (CL) architecture that can deal with CF, the modular dynamic neural network (MDNN). The presented architecture consists of two main components: (a) the ResNet50-based feature extraction component as the backbone; and (b) the modular dynamic classification component, which consists of multiple sub-networks and progressively builds itself up in a tree-like structure that rearranges itself as it learns over time in such a way that each sub-network can function independently. The main contribution of the paper is a new architecture that is strongly based on its modular dynamic training feature. This modular structure allows for new classes to be added while only altering specific sub-networks in such a way that previously known classes are not forgotten. Tests on the CORe50 dataset showed results above the state of the art for CL architectures.

Metadados do item

id	RCAP_49abd75a8479e66d3e8fd3e5f7d37cdc
oai_identifier_str	oai:sapientia.ualg.pt:10400.1/17460
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Modular dynamic neural network: a continual learning architectureContinual learningNeural networksCatastrophic forgettingObject recognitionLearning to recognize a new object after having learned to recognize other objects may be a simple task for a human, but not for machines. The present go-to approaches for teaching a machine to recognize a set of objects are based on the use of deep neural networks (DNN). So, intuitively, the solution for teaching new objects on the fly to a machine should be DNN. The problem is that the trained DNN weights used to classify the initial set of objects are extremely fragile, meaning that any change to those weights can severely damage the capacity to perform the initial recognitions; this phenomenon is known as catastrophic forgetting (CF). This paper presents a new (DNN) continual learning (CL) architecture that can deal with CF, the modular dynamic neural network (MDNN). The presented architecture consists of two main components: (a) the ResNet50-based feature extraction component as the backbone; and (b) the modular dynamic classification component, which consists of multiple sub-networks and progressively builds itself up in a tree-like structure that rearranges itself as it learns over time in such a way that each sub-network can function independently. The main contribution of the paper is a new architecture that is strongly based on its modular dynamic training feature. This modular structure allows for new classes to be added while only altering specific sub-networks in such a way that previously known classes are not forgotten. Tests on the CORe50 dataset showed results above the state of the art for CL architectures.MDPISapientiaTurner, DanielCardoso, PedroRodrigues, João2022-01-10T15:13:00Z2021-12-182021-12-23T15:06:58Z2021-12-18T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.1/17460engApplied Sciences 11 (24): 12078 (2021)2076-341710.3390/app112412078info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-24T10:29:34Zoai:sapientia.ualg.pt:10400.1/17460Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T20:07:23.678789Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Modular dynamic neural network: a continual learning architecture
title	Modular dynamic neural network: a continual learning architecture
spellingShingle	Modular dynamic neural network: a continual learning architecture Turner, Daniel Continual learning Neural networks Catastrophic forgetting Object recognition
title_short	Modular dynamic neural network: a continual learning architecture
title_full	Modular dynamic neural network: a continual learning architecture
title_fullStr	Modular dynamic neural network: a continual learning architecture
title_full_unstemmed	Modular dynamic neural network: a continual learning architecture
title_sort	Modular dynamic neural network: a continual learning architecture
author	Turner, Daniel
author_facet	Turner, Daniel Cardoso, Pedro Rodrigues, João
author_role	author
author2	Cardoso, Pedro Rodrigues, João
author2_role	author author
dc.contributor.none.fl_str_mv	Sapientia
dc.contributor.author.fl_str_mv	Turner, Daniel Cardoso, Pedro Rodrigues, João
dc.subject.por.fl_str_mv	Continual learning Neural networks Catastrophic forgetting Object recognition
topic	Continual learning Neural networks Catastrophic forgetting Object recognition
description	Learning to recognize a new object after having learned to recognize other objects may be a simple task for a human, but not for machines. The present go-to approaches for teaching a machine to recognize a set of objects are based on the use of deep neural networks (DNN). So, intuitively, the solution for teaching new objects on the fly to a machine should be DNN. The problem is that the trained DNN weights used to classify the initial set of objects are extremely fragile, meaning that any change to those weights can severely damage the capacity to perform the initial recognitions; this phenomenon is known as catastrophic forgetting (CF). This paper presents a new (DNN) continual learning (CL) architecture that can deal with CF, the modular dynamic neural network (MDNN). The presented architecture consists of two main components: (a) the ResNet50-based feature extraction component as the backbone; and (b) the modular dynamic classification component, which consists of multiple sub-networks and progressively builds itself up in a tree-like structure that rearranges itself as it learns over time in such a way that each sub-network can function independently. The main contribution of the paper is a new architecture that is strongly based on its modular dynamic training feature. This modular structure allows for new classes to be added while only altering specific sub-networks in such a way that previously known classes are not forgotten. Tests on the CORe50 dataset showed results above the state of the art for CL architectures.
publishDate	2021
dc.date.none.fl_str_mv	2021-12-18 2021-12-23T15:06:58Z 2021-12-18T00:00:00Z 2022-01-10T15:13:00Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10400.1/17460
url	http://hdl.handle.net/10400.1/17460
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	Applied Sciences 11 (24): 12078 (2021) 2076-3417 10.3390/app112412078
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	MDPI
publisher.none.fl_str_mv	MDPI
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799133318585778176

Modular dynamic neural network: a continual learning architecture

Registros relacionados