Learning rich features for image manipulation detection

P Zhou, X Han, VI Morariu… - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Proceedings of the IEEE conference on computer vision and …, 2018openaccess.thecvf.com
Image manipulation detection is different from traditional semantic object detection because
it pays more attention to tampering artifacts than to image content, which suggests that richer
features need to be learned. We propose a two-stream Faster R-CNN network and train it
end-to-end to detect the tampered regions given a manipulated image. One of the two
streams is an RGB stream whose purpose is to extract features from the RGB image input to
find tampering artifacts like strong contrast difference, unnatural tampered boundaries, and …
Abstract
Image manipulation detection is different from traditional semantic object detection because it pays more attention to tampering artifacts than to image content, which suggests that richer features need to be learned. We propose a two-stream Faster R-CNN network and train it end-to-end to detect the tampered regions given a manipulated image. One of the two streams is an RGB stream whose purpose is to extract features from the RGB image input to find tampering artifacts like strong contrast difference, unnatural tampered boundaries, and so on. The other is a noise stream that leverages the noise features extracted from a steganalysis rich model filter layer to discover the noise inconsistency between authentic and tampered regions. We then fuse features from the two streams through a bilinear pooling layer to further incorporate spatial co-occurrence of these two modalities. Experiments on four standard image manipulation datasets demonstrate that our two-stream framework outperforms each individual stream, and also achieves state-of-the-art performance compared to alternative methods with robustness to resizing and compression.
openaccess.thecvf.com
Showing the best result for this search. See all results