RAF: Holistic compilation for deep learning model training

CH Yu, H Fan, G Huang, Z Jia, Y Liu, J Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
As deep learning is pervasive in modern applications, many deep learning frameworks are
presented for deep learning practitioners to develop and train DNN models rapidly.
Meanwhile, as training large deep learning models becomes a trend in recent years, the
training throughput and memory footprint are getting crucial. Accordingly, optimizing training
workloads with compiler optimizations is inevitable and getting more and more attentions.
However, existing deep learning compilers (DLCs) mainly target inference and do not …

RAF: Holistic Compilation for Deep Learning Model Training

C Hao Yu, H Fan, G Huang, Z Jia, Y Liu… - arXiv e …, 2023 - ui.adsabs.harvard.edu
As deep learning is pervasive in modern applications, many deep learning frameworks are
presented for deep learning practitioners to develop and train DNN models rapidly.
Meanwhile, as training large deep learning models becomes a trend in recent years, the
training throughput and memory footprint are getting crucial. Accordingly, optimizing training
workloads with compiler optimizations is inevitable and getting more and more attentions.
However, existing deep learning compilers (DLCs) mainly target inference and do not …
Showing the best results for this search. See all results