A fast and scalable architecture to run convolutional neural networks in low density FPGAs
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Outros Autores: | , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.21/12279 |
Resumo: | Este trabalho foi financiado pelo Concurso Anual para Projetos de Investigação, Desenvolvimento, Inovação e Criação Artística (IDI&CA) 2016 do Instituto Politécnico de Lisboa. Código de referência IPL/2019/inCNeuraINet_ISEL |
id |
RCAP_634c261c91ef5e99af2975e0dd856e03 |
---|---|
oai_identifier_str |
oai:repositorio.ipl.pt:10400.21/12279 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
A fast and scalable architecture to run convolutional neural networks in low density FPGAsDeep learningConvolutional neural networkSmart edge devicesFPGAEste trabalho foi financiado pelo Concurso Anual para Projetos de Investigação, Desenvolvimento, Inovação e Criação Artística (IDI&CA) 2016 do Instituto Politécnico de Lisboa. Código de referência IPL/2019/inCNeuraINet_ISELDeep learning and, in particular, convolutional neural networks (CNN) achieve very good results on several computer vision applications like security and surveillance, where image and video analysis are required. These networks are quite demanding in terms of computation and memory and therefore are usually implemented in high-performance computing platforms or devices. Running CNNs in embedded platforms or devices with low computational and memory resources requires a careful optimization of system architectures and algorithms to obtain very efficient designs. In this context, Field Programmable Gate Arrays (FPGA) can achieve this efficiency since the programmable hardware fabric can be tailored for each specific network. In this paper, a very efficient configurable architecture for CNN inference targeting any density FPGAs is described. The architecture considers fixed-point arithmetic and image batch to reduce computational, memory and memory bandwidth requirements without compromising network accuracy. The developed architecture supports the execution of large CNNs in any FPGA devices including those with small on-chip memory size and logic resources. With the proposed architecture, it is possible to infer an image in AlexNet in 4.3 ms in a ZYNQ7020 and 1.2 ms in a ZYNQ7045.ElsevierRCIPLVéstias, MárioDuarte, RuiDe Sousa, JoseNeto, Horácio C2020-10-07T14:07:34Z2020-092020-09-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.21/12279engVÉSTIAS, Mário P.; [et al] – A fast and scalable architecture to run convolutional neural networks in low density FPGAs. Microprocessors and Microsystems. ISSN 0141-9331. Vol. 77 (2020), pp. 1-150141-933110.1016/j.micpro.2020.103136metadata only accessinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-08-03T10:04:53Zoai:repositorio.ipl.pt:10400.21/12279Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T20:20:24.013490Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
A fast and scalable architecture to run convolutional neural networks in low density FPGAs |
title |
A fast and scalable architecture to run convolutional neural networks in low density FPGAs |
spellingShingle |
A fast and scalable architecture to run convolutional neural networks in low density FPGAs Véstias, Mário Deep learning Convolutional neural network Smart edge devices FPGA |
title_short |
A fast and scalable architecture to run convolutional neural networks in low density FPGAs |
title_full |
A fast and scalable architecture to run convolutional neural networks in low density FPGAs |
title_fullStr |
A fast and scalable architecture to run convolutional neural networks in low density FPGAs |
title_full_unstemmed |
A fast and scalable architecture to run convolutional neural networks in low density FPGAs |
title_sort |
A fast and scalable architecture to run convolutional neural networks in low density FPGAs |
author |
Véstias, Mário |
author_facet |
Véstias, Mário Duarte, Rui De Sousa, Jose Neto, Horácio C |
author_role |
author |
author2 |
Duarte, Rui De Sousa, Jose Neto, Horácio C |
author2_role |
author author author |
dc.contributor.none.fl_str_mv |
RCIPL |
dc.contributor.author.fl_str_mv |
Véstias, Mário Duarte, Rui De Sousa, Jose Neto, Horácio C |
dc.subject.por.fl_str_mv |
Deep learning Convolutional neural network Smart edge devices FPGA |
topic |
Deep learning Convolutional neural network Smart edge devices FPGA |
description |
Este trabalho foi financiado pelo Concurso Anual para Projetos de Investigação, Desenvolvimento, Inovação e Criação Artística (IDI&CA) 2016 do Instituto Politécnico de Lisboa. Código de referência IPL/2019/inCNeuraINet_ISEL |
publishDate |
2020 |
dc.date.none.fl_str_mv |
2020-10-07T14:07:34Z 2020-09 2020-09-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.21/12279 |
url |
http://hdl.handle.net/10400.21/12279 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
VÉSTIAS, Mário P.; [et al] – A fast and scalable architecture to run convolutional neural networks in low density FPGAs. Microprocessors and Microsystems. ISSN 0141-9331. Vol. 77 (2020), pp. 1-15 0141-9331 10.1016/j.micpro.2020.103136 |
dc.rights.driver.fl_str_mv |
metadata only access info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
metadata only access |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Elsevier |
publisher.none.fl_str_mv |
Elsevier |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799133472499957760 |