A Count-sketch to Reduce Memory Consumption when Training a Model with Gradient Descent | IEEE Conference Publication | IEEE Xplore