The Helmholtz Method: Using Perceptual Compression to Reduce Machine Learning Complexity

Friedland, Gerald; Wang, Jingkang; Jia, Ruoxi; Li, Bo

Computer Science > Computer Vision and Pattern Recognition

arXiv:1807.10569 (cs)

[Submitted on 10 Jul 2018]

Title:The Helmholtz Method: Using Perceptual Compression to Reduce Machine Learning Complexity

Authors:Gerald Friedland, Jingkang Wang, Ruoxi Jia, Bo Li

View PDF

Abstract:This paper proposes a fundamental answer to a frequently asked question in multimedia computing and machine learning: Do artifacts from perceptual compression contribute to error in the machine learning process and if so, how much? Our approach to the problem is a reinterpretation of the Helmholtz Free Energy formula from physics to explain the relationship between content and noise when using sensors (such as cameras or microphones) to capture multimedia data. The reinterpretation allows a bit-measurement of the noise contained in images, audio, and video by combining a classifier with perceptual compression, such as JPEG or MP3. Our experiments on CIFAR-10 as well as Fraunhofer's IDMT-SMT-Audio-Effects dataset indicate that, at the right quality level, perceptual compression is actually not harmful but contributes to a significant reduction of complexity of the machine learning process. That is, our noise quantification method can be used to speed up the training of deep learning classifiers significantly while maintaining, or sometimes even improving, overall classification accuracy. Moreover, our results provide insights into the reasons for the success of deep learning.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Applied Physics (physics.app-ph)
Cite as:	arXiv:1807.10569 [cs.CV]
	(or arXiv:1807.10569v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1807.10569

Submission history

From: Gerald Friedland [view email]
[v1] Tue, 10 Jul 2018 01:49:50 UTC (725 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:The Helmholtz Method: Using Perceptual Compression to Reduce Machine Learning Complexity

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:The Helmholtz Method: Using Perceptual Compression to Reduce Machine Learning Complexity

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators