Implementing the Himeno benchmark with CUDA on GPU clusters | IEEE Conference Publication | IEEE Xplore