Deep Bilateral Learning for Real-Time Image Enhancement

Gharbi, Michaël; Chen, Jiawen; Barron, Jonathan T.; Hasinoff, Samuel W.; Durand, Frédo

doi:10.1145/3072959.3073592

Computer Science > Graphics

arXiv:1707.02880 (cs)

[Submitted on 10 Jul 2017 (v1), last revised 22 Aug 2017 (this version, v2)]

Title:Deep Bilateral Learning for Real-Time Image Enhancement

Authors:Michaël Gharbi, Jiawen Chen, Jonathan T. Barron, Samuel W. Hasinoff, Frédo Durand

View PDF

Abstract:Performance is a critical challenge in mobile image processing. Given a reference imaging pipeline, or even human-adjusted pairs of images, we seek to reproduce the enhancements and enable real-time evaluation. For this, we introduce a new neural network architecture inspired by bilateral grid processing and local affine color transforms. Using pairs of input/output images, we train a convolutional neural network to predict the coefficients of a locally-affine model in bilateral space. Our architecture learns to make local, global, and content-dependent decisions to approximate the desired image transformation. At runtime, the neural network consumes a low-resolution version of the input image, produces a set of affine transformations in bilateral space, upsamples those transformations in an edge-preserving fashion using a new slicing node, and then applies those upsampled transformations to the full-resolution image. Our algorithm processes high-resolution images on a smartphone in milliseconds, provides a real-time viewfinder at 1080p resolution, and matches the quality of state-of-the-art approximation techniques on a large class of image operators. Unlike previous work, our model is trained off-line from data and therefore does not require access to the original operator at runtime. This allows our model to learn complex, scene-dependent transformations for which no reference implementation is available, such as the photographic edits of a human retoucher.

Comments:	12 pages, 14 figures, Siggraph 2017
Subjects:	Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1707.02880 [cs.GR]
	(or arXiv:1707.02880v2 [cs.GR] for this version)
	https://doi.org/10.48550/arXiv.1707.02880
Journal reference:	ACM Trans. Graph. 36, 4, Article 118 (2017)
Related DOI:	https://doi.org/10.1145/3072959.3073592

Submission history

From: Michael Gharbi [view email]
[v1] Mon, 10 Jul 2017 14:34:06 UTC (5,496 KB)
[v2] Tue, 22 Aug 2017 19:26:08 UTC (5,496 KB)

Computer Science > Graphics

Title:Deep Bilateral Learning for Real-Time Image Enhancement

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Graphics

Title:Deep Bilateral Learning for Real-Time Image Enhancement

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators