Efficient Memory Organization for DNN Hardware Accelerator Implementation on PSoC

Rios-Navarro, Antonio and Gutierrez-Galan, Daniel and Dominguez-Morales, Juan Pedro and Piñero-Fuentes, Enrique and Duran-Lopez, Lourdes and Tapiador-Morales, Ricardo and Dominguez-Morales, Manuel Jesús (2021) Efficient Memory Organization for DNN Hardware Accelerator Implementation on PSoC. Electronics, 10 (1). p. 94. ISSN 2079-9292

[thumbnail of electronics-10-00094-v4.pdf] Text
electronics-10-00094-v4.pdf - Published Version

Download (361kB)

Abstract

The use of deep learning solutions in different disciplines is increasing and their algorithms are computationally expensive in most cases. For this reason, numerous hardware accelerators have appeared to compute their operations efficiently in parallel, achieving higher performance and lower latency. These algorithms need large amounts of data to feed each of their computing layers, which makes it necessary to efficiently handle the data transfers that feed and collect the information to and from the accelerators. For the implementation of these accelerators, hybrid devices are widely used, which have an embedded computer, where an operating system can be run, and a field-programmable gate array (FPGA), where the accelerator can be deployed. In this work, we present a software API that efficiently organizes the memory, preventing reallocating data from one memory area to another, which improves the native Linux driver with a 85% speed-up and reduces the frame computing time by 28% in a real application.

Item Type: Article
Uncontrolled Keywords: deep learning; embedded systems; PSoC; memory organization; FPGA; hardware accelerator
Subjects: STM Repository > Engineering
Depositing User: Managing Editor
Date Deposited: 05 Jun 2023 04:29
Last Modified: 06 Mar 2024 04:16
URI: http://classical.goforpromo.com/id/eprint/689

Actions (login required)

View Item
View Item