Where to Look Next: Unsupervised Active Visual Exploration on 360{\deg} Input

Seifi, Soroush; Tuytelaars, Tinne

Computer Science > Computer Vision and Pattern Recognition

arXiv:1909.10304v1 (cs)

[Submitted on 23 Sep 2019 (this version), latest version 28 Nov 2019 (v2)]

Title:Where to Look Next: Unsupervised Active Visual Exploration on 360° Input

Authors:Soroush Seifi, Tinne Tuytelaars

View PDF

Abstract:We address the problem of active visual exploration of large 360° inputs. In our setting an active agent with a limited camera bandwidth explores its 360° environment by changing its viewing direction at limited discrete time steps. As such, it observes the world as a sequence of narrow field-of-view 'glimpses', deciding for itself where to look next. Our proposed method exceeds previous works' performance by a significant margin without the need for deep reinforcement learning or training separate networks as sidekicks. A key component of our system are the spatial memory maps that make the system aware of the glimpses' orientations (locations in the 360° image). Further, we stress the advantages of retina-like glimpses when the agent's sensor bandwidth and time-steps are limited. Finally, we use our trained model to do classification of the whole scene using only the information observed in the glimpses.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:1909.10304 [cs.CV]
	(or arXiv:1909.10304v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1909.10304

Submission history

From: Soroush Seifi [view email]
[v1] Mon, 23 Sep 2019 11:50:46 UTC (1,761 KB)
[v2] Thu, 28 Nov 2019 10:38:02 UTC (1,761 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-09

Change to browse by:

cs
cs.AI
cs.LG
cs.RO

References & Citations

DBLP - CS Bibliography

listing | bibtex

Tinne Tuytelaars

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Where to Look Next: Unsupervised Active Visual Exploration on 360° Input

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Where to Look Next: Unsupervised Active Visual Exploration on 360° Input

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators