ARES: A Mapping Framework of DNNs Towards Diverse PIMs with General Abstractions

X Cui, S Zheng, T Jia, L Ye… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
X Cui, S Zheng, T Jia, L Ye, Y Liang
2023 IEEE/ACM International Conference on Computer Aided Design …, 2023ieeexplore.ieee.org
Numerous architectures based on processing-in-memory (PIM) have recently emerged,
exhibiting diversity in memory types, compute functions, memory mapping constraints, etc.
To effectively utilize PIM hardware for deploying deep neural networks (DNNs),
programmers face the challenge of mapping computations and data across multiple memory
arrays, scheduling computation and data transfers, while adhering to various hardware
constraints. Existing mapping approaches, however, are tailored to specific architectures …
Numerous architectures based on processing-in-memory (PIM) have recently emerged, exhibiting diversity in memory types, compute functions, memory mapping constraints, etc. To effectively utilize PIM hardware for deploying deep neural networks (DNNs), programmers face the challenge of mapping computations and data across multiple memory arrays, scheduling computation and data transfers, while adhering to various hardware constraints. Existing mapping approaches, however, are tailored to specific architectures and lack a general formulation for mapping optimization, limiting their applicability and performance. In this paper, we present ARES, a comprehensive mapping framework designed for diverse PIM architectures. The core of the framework is hardware abstractions for PIMs, which is inspired by the fact that DNNs on PIM hardware can be represented by a tensorized compute function and data layout constraints in the memory array. This abstraction forms the basis for constructing a mapping space that encompasses both compute and memory constraints. Through exploration of this mapping space, we derive efficient mapping strategies tailored to different PIM hardware configurations. Experimental evaluation conducted on four distinct hardware architectures demonstrates that compared to state-of-the-art mapping methods, ARES yields up to a 70% speed improvement for single operator mapping and a 50% speedup for overall network mapping.
ieeexplore.ieee.org