Google Scholar

LAMPS: A Layer-wised Mixed-Precision-and-Sparsity Accelerator for NAS-Optimized CNNs on FPGA

S Yang, C Ding, M Huang, K Li, C Li… - 2024 IEEE 32nd …, 2024 - ieeexplore.ieee.org

S Yang, C Ding, M Huang, K Li, C Li, Z Wei, S Huang, J Dong, L Zhang, H Yu

2024 IEEE 32nd Annual International Symposium on Field …, 2024•ieeexplore.ieee.org

The increasing model size and computation load of convolutional neural networks (CNN) pose a grand challenge to deploy CNN models on edge computing devices. To further improve performance without significant accuracy loss, this paper developed a neural architecture search (NAS) method to achieve a layer-wise mixed-precision-and-sparsity (LAMPS) CNN. However, this optimization cannot be fully utilized and directly mapped to existing AI accelerators due to the irregu- lar computation of sparse and multi-precision data. To tackle this challenge, this work proposed a LAMPS vector systolic accelerator and demonstrated state-of-the-art results. Experi- mental results show that the LAMPS accelerator on Xilinx ZCU102 achieves an average performance of 756.83 GOPS and 470.25 GOPS when accelerating the NAS-optimized VGG16 and Resnet18, respectively, leading to 1.3-6.0x speed-up over the state- of-the-art accelerators on FPGA.

ieeexplore.ieee.org

Show moreShow less

Save Cite Cited by 3 Related articles All 2 versions

Showing the best result for this search. See all results

Cite

Advanced search

Saved to My library

LAMPS: A Layer-wised Mixed-Precision-and-Sparsity Accelerator for NAS-Optimized CNNs on FPGA