Pruning in time (PIT): a lightweight network architecture optimizer for temporal convolutional networks

M Risso, A Burrello, DJ Pagliari, F Conti… - 2021 58th ACM/IEEE …, 2021 - ieeexplore.ieee.org
2021 58th ACM/IEEE Design Automation Conference (DAC), 2021ieeexplore.ieee.org
Temporal Convolutional Networks (TCNs) are promising Deep Learning models for time-
series processing tasks. One key feature of TCNs is time-dilated convolution, whose
optimization requires extensive experimentation. We propose an automatic dilation
optimizer, which tackles the problem as a weight pruning on the time-axis, and learns
dilation factors together with weights, in a single training. Our method reduces the model
size and inference latency on a real SoC hardware target by up to 7.4× and 3×, respectively …
Temporal Convolutional Networks (TCNs) are promising Deep Learning models for time-series processing tasks. One key feature of TCNs is time-dilated convolution, whose optimization requires extensive experimentation. We propose an automatic dilation optimizer, which tackles the problem as a weight pruning on the time-axis, and learns dilation factors together with weights, in a single training. Our method reduces the model size and inference latency on a real SoC hardware target by up to 7.4× and 3×, respectively with no accuracy drop compared to a network without dilation. It also yields a rich set of Pareto-optimal TCNs starting from a single model, outperforming hand-designed solutions in both size and accuracy.
ieeexplore.ieee.org
Showing the best result for this search. See all results