Learning Edit Machines for Robust Multimodal Understanding | IEEE Conference Publication | IEEE Xplore