Interpretable Video Captioning via Trajectory Structured Localization | IEEE Conference Publication | IEEE Xplore