Authors:
Francesco Ragusa
1
;
Antonino Furnari
2
;
Sebastiano Battiato
2
;
Giovanni Maria Signorello
3
and
Giovanni Maria Farinella
4
Affiliations:
1
DMI-IPLab, University of Catania, Italy, Xenia Gestione Documentale s.r.l. - Xenia Progetti s.r.l., Acicastello, Catania and Italy
;
2
DMI-IPLab, University of Catania and Italy
;
3
CUTGANA, University of Catania and Italy
;
4
DMI-IPLab, University of Catania, Italy, CUTGANA, University of Catania and Italy
Keyword(s):
Egocentric Vision, First Person Vision, Object Detection, Object Recognition.
Related
Ontology
Subjects/Areas/Topics:
Applications and Services
;
Computer Vision, Visualization and Computer Graphics
;
Imaging for Cultural Heritage (Modeling/Simulation, Virtual Restoration)
Abstract:
We consider the problem of the detection and recognition of points of interest in cultural sites. We observe that a “point of interest” in a cultural site may be either an object or an environment and highlight that the use of an object detector is beneficial to recognize points of interest which occupy a small part of the frame. To study the role of objects in the recognition of points of interest, we augment the labelling of the UNICT-VEDI dataset to include bounding box annotations for 57 points of interest. We hence compare two approaches to perform the recognition of points of interest. The first method is based on the processing of the whole frame during recognition. The second method employs a YOLO object detector and a selection procedure to determine the currently observed point of interest. Our experiments suggest that further improvements on point of interest recognition can be achieved fusing the two methodologies. Indeed, the results show the complementarity of the two a
pproaches on the UNICT-VEDI dataset.
(More)