Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning

Véstias, Mário; Duarte, Rui Policarpo; De Sousa, Jose; Cláudio de Campos Neto, Horácio

Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning

Detalhes bibliográficos
Autor(a) principal:	Véstias, Mário
Data de Publicação:	2019
Outros Autores:	Duarte, Rui Policarpo, De Sousa, Jose, Cláudio de Campos Neto, Horácio
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10400.21/10735
Resumo:	Edge devices are becoming smarter with the integration of machine learning methods, such as deep learning, and are therefore used in many application domains where decisions have to be made without human intervention. Deep learning and, in particular, convolutional neural networks (CNN) are more efﬁcient than previous algorithms for several computer vision applications such as security and surveillance, where image and video analysis are required. This better efﬁciency comes with a cost of high computation and memory requirements. Hence, running CNNs in embedded computing devices is a challenge for both algorithm and hardware designers. New processing devices, dedicated system architectures and optimization of the networks have been researched to deal with these computation requirements. In this paper, we improve the inference execution times of CNNs in low density FPGAs (Field-Programmable Gate Arrays) using ﬁxed-point arithmetic, zero-skipping and weight pruning. The developed architecture supports the execution of large CNNs in FPGA devices with reduced on-chip memory and computing resources. With the proposed architecture, it is possible to infer an image in AlexNet in 2.9 ms in a ZYNQ7020 and 1.0 ms in a ZYNQ7045 with less than 1% accuracy degradation. These results improve previous state-of-the-art architectures for CNN inference.

Metadados do item

id	RCAP_412607c31530d381a97105f0710eb021
oai_identifier_str	oai:repositorio.ipl.pt:10400.21/10735
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruningDeep learningConvolutional neural networkSmart edge devicesZero-skippingPruningFPGAEdge devices are becoming smarter with the integration of machine learning methods, such as deep learning, and are therefore used in many application domains where decisions have to be made without human intervention. Deep learning and, in particular, convolutional neural networks (CNN) are more efﬁcient than previous algorithms for several computer vision applications such as security and surveillance, where image and video analysis are required. This better efﬁciency comes with a cost of high computation and memory requirements. Hence, running CNNs in embedded computing devices is a challenge for both algorithm and hardware designers. New processing devices, dedicated system architectures and optimization of the networks have been researched to deal with these computation requirements. In this paper, we improve the inference execution times of CNNs in low density FPGAs (Field-Programmable Gate Arrays) using ﬁxed-point arithmetic, zero-skipping and weight pruning. The developed architecture supports the execution of large CNNs in FPGA devices with reduced on-chip memory and computing resources. With the proposed architecture, it is possible to infer an image in AlexNet in 2.9 ms in a ZYNQ7020 and 1.0 ms in a ZYNQ7045 with less than 1% accuracy degradation. These results improve previous state-of-the-art architectures for CNN inference.MDPIRCIPLVéstias, MárioDuarte, Rui PolicarpoDe Sousa, JoseCláudio de Campos Neto, Horácio2019-11-25T11:15:23Z2019-11-092019-11-09T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.21/10735engVÉSTIAS, Mário P.; [et al] – Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning. Electronics. ISSN 2079-9292. Vol. 8, N.º 11 (2019), pp. 1-242079-929210.3390/electronics8111321info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-08-03T10:01:07Zoai:repositorio.ipl.pt:10400.21/10735Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T20:19:05.647785Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning
title	Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning
spellingShingle	Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning Véstias, Mário Deep learning Convolutional neural network Smart edge devices Zero-skipping Pruning FPGA
title_short	Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning
title_full	Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning
title_fullStr	Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning
title_full_unstemmed	Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning
title_sort	Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning
author	Véstias, Mário
author_facet	Véstias, Mário Duarte, Rui Policarpo De Sousa, Jose Cláudio de Campos Neto, Horácio
author_role	author
author2	Duarte, Rui Policarpo De Sousa, Jose Cláudio de Campos Neto, Horácio
author2_role	author author author
dc.contributor.none.fl_str_mv	RCIPL
dc.contributor.author.fl_str_mv	Véstias, Mário Duarte, Rui Policarpo De Sousa, Jose Cláudio de Campos Neto, Horácio
dc.subject.por.fl_str_mv	Deep learning Convolutional neural network Smart edge devices Zero-skipping Pruning FPGA
topic	Deep learning Convolutional neural network Smart edge devices Zero-skipping Pruning FPGA
description	Edge devices are becoming smarter with the integration of machine learning methods, such as deep learning, and are therefore used in many application domains where decisions have to be made without human intervention. Deep learning and, in particular, convolutional neural networks (CNN) are more efﬁcient than previous algorithms for several computer vision applications such as security and surveillance, where image and video analysis are required. This better efﬁciency comes with a cost of high computation and memory requirements. Hence, running CNNs in embedded computing devices is a challenge for both algorithm and hardware designers. New processing devices, dedicated system architectures and optimization of the networks have been researched to deal with these computation requirements. In this paper, we improve the inference execution times of CNNs in low density FPGAs (Field-Programmable Gate Arrays) using ﬁxed-point arithmetic, zero-skipping and weight pruning. The developed architecture supports the execution of large CNNs in FPGA devices with reduced on-chip memory and computing resources. With the proposed architecture, it is possible to infer an image in AlexNet in 2.9 ms in a ZYNQ7020 and 1.0 ms in a ZYNQ7045 with less than 1% accuracy degradation. These results improve previous state-of-the-art architectures for CNN inference.
publishDate	2019
dc.date.none.fl_str_mv	2019-11-25T11:15:23Z 2019-11-09 2019-11-09T00:00:00Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10400.21/10735
url	http://hdl.handle.net/10400.21/10735
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	VÉSTIAS, Mário P.; [et al] – Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning. Electronics. ISSN 2079-9292. Vol. 8, N.º 11 (2019), pp. 1-24 2079-9292 10.3390/electronics8111321
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	MDPI
publisher.none.fl_str_mv	MDPI
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799133456702111744

Fast convolutional neural networks in low density FPGAs using zero-skipping and weight pruning

Registros relacionados