Paper
17 March 2015 Learning visual balance from large-scale datasets of aesthetically highly rated images
Ali Jahanian, S.V.N. Vishwanathan, Jan P. Allebach
Author Affiliations +
Proceedings Volume 9394, Human Vision and Electronic Imaging XX; 93940Y (2015) https://doi.org/10.1117/12.2084548
Event: SPIE/IS&T Electronic Imaging, 2015, San Francisco, California, United States
Abstract
The concept of visual balance is innate for humans, and influences how we perceive visual aesthetics and cognize harmony. Although visual balance is a vital principle of design and taught in schools of designs, it is barely quantified. On the other hand, with emergence of automantic/semi-automatic visual designs for self-publishing, learning visual balance and computationally modeling it, may escalate aesthetics of such designs. In this paper, we present how questing for understanding visual balance inspired us to revisit one of the well-known theories in visual arts, the so called theory of “visual rightness”, elucidated by Arnheim. We define Arnheim’s hypothesis as a design mining problem with the goal of learning visual balance from work of professionals. We collected a dataset of 120K images that are aesthetically highly rated, from a professional photography website. We then computed factors that contribute to visual balance based on the notion of visual saliency. We fitted a mixture of Gaussians to the saliency maps of the images, and obtained the hotspots of the images. Our inferred Gaussians align with Arnheim’s hotspots, and confirm his theory. Moreover, the results support the viability of the center of mass, symmetry, as well as the Rule of Thirds in our dataset.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Ali Jahanian, S.V.N. Vishwanathan, and Jan P. Allebach "Learning visual balance from large-scale datasets of aesthetically highly rated images", Proc. SPIE 9394, Human Vision and Electronic Imaging XX, 93940Y (17 March 2015); https://doi.org/10.1117/12.2084548
Lens.org Logo
CITATIONS
Cited by 9 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Photography

Expectation maximization algorithms

Visual process modeling

Mining

Alternate lighting of surfaces

Parallel computing

Back to Top