A Data-Driven Approach to Dataflow-Aware Online Scheduling for Graph Neural Network Inference

Puigdemont, Pol; Russo, Enrico; Wassington, Axel; Das, Abhijit; Abadal, Sergi; Palesi, Maurizio

doi:10.1145/3658617.3697660

Computer Science > Machine Learning

arXiv:2411.16342 (cs)

[Submitted on 25 Nov 2024]

Title:A Data-Driven Approach to Dataflow-Aware Online Scheduling for Graph Neural Network Inference

Authors:Pol Puigdemont, Enrico Russo, Axel Wassington, Abhijit Das, Sergi Abadal, Maurizio Palesi

View PDF HTML (experimental)

Abstract:Graph Neural Networks (GNNs) have shown significant promise in various domains, such as recommendation systems, bioinformatics, and network analysis. However, the irregularity of graph data poses unique challenges for efficient computation, leading to the development of specialized GNN accelerator architectures that surpass traditional CPU and GPU performance. Despite this, the structural diversity of input graphs results in varying performance across different GNN accelerators, depending on their dataflows. This variability in performance due to differing dataflows and graph properties remains largely unexplored, limiting the adaptability of GNN accelerators. To address this, we propose a data-driven framework for dataflow-aware latency prediction in GNN inference. Our approach involves training regressors to predict the latency of executing specific graphs on particular dataflows, using simulations on synthetic graphs. Experimental results indicate that our regressors can predict the optimal dataflow for a given graph with up to 91.28% accuracy and a Mean Absolute Percentage Error (MAPE) of 3.78%. Additionally, we introduce an online scheduling algorithm that uses these regressors to enhance scheduling decisions. Our experiments demonstrate that this algorithm achieves up to $3.17\times$ speedup in mean completion time and $6.26\times$ speedup in mean execution time compared to the best feasible baseline across all datasets.

Comments:	Accepted for ASP-DAC 2025
Subjects:	Machine Learning (cs.LG); Hardware Architecture (cs.AR)
Cite as:	arXiv:2411.16342 [cs.LG]
	(or arXiv:2411.16342v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.16342
Related DOI:	https://doi.org/10.1145/3658617.3697660

Submission history

From: Pol Puigdemont [view email]
[v1] Mon, 25 Nov 2024 12:38:59 UTC (318 KB)

Computer Science > Machine Learning

Title:A Data-Driven Approach to Dataflow-Aware Online Scheduling for Graph Neural Network Inference

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Data-Driven Approach to Dataflow-Aware Online Scheduling for Graph Neural Network Inference

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators