End-to-End DNN Inference on a Massively Parallel Analog In Memory Computing Architecture

Bruschi, Nazareno; Tagliavini, Giuseppe; Garofalo, Angelo; Conti, Francesco; Boybat, Irem; Benini, Luca; Rossi, Davide

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2211.12877 (cs)

[Submitted on 23 Nov 2022]

Title:End-to-End DNN Inference on a Massively Parallel Analog In Memory Computing Architecture

Authors:Nazareno Bruschi, Giuseppe Tagliavini, Angelo Garofalo, Francesco Conti, Irem Boybat, Luca Benini, Davide Rossi

View PDF

Abstract:The demand for computation resources and energy efficiency of Convolutional Neural Networks (CNN) applications requires a new paradigm to overcome the "Memory Wall". Analog In-Memory Computing (AIMC) is a promising paradigm since it performs matrix-vector multiplications, the critical kernel of many ML applications, in-place in the analog domain within memory arrays structured as crossbars of memory cells. However, several factors limit the full exploitation of this technology, including the physical fabrication of the crossbar devices, which constrain the memory capacity of a single array. Multi-AIMC architectures have been proposed to overcome this limitation, but they have been demonstrated only for tiny and custom CNNs or performing some layers off-chip. In this work, we present the full inference of an end-to-end ResNet-18 DNN on a 512-cluster heterogeneous architecture coupling a mix of AIMC cores and digital RISC-V cores, achieving up to 20.2 TOPS. Moreover, we analyze the mapping of the network on the available non-volatile cells, compare it with state-of-the-art models, and derive guidelines for next-generation many-core architectures based on AIMC devices.

Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2211.12877 [cs.DC]
	(or arXiv:2211.12877v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2211.12877

Submission history

From: Nazareno Bruschi [view email]
[v1] Wed, 23 Nov 2022 11:32:07 UTC (646 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:End-to-End DNN Inference on a Massively Parallel Analog In Memory Computing Architecture

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:End-to-End DNN Inference on a Massively Parallel Analog In Memory Computing Architecture

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators