On-line and unsupervised learning for codebook based visual recognition

Xu, Jie

doi:10.26190/unsworks/15152

On-line and unsupervised learning for codebook based visual recognition

Download files

Access & Terms of Use

open access
Copyright: Xu, Jie

CC BY-NC-ND 3.0

Abstract

In this thesis we develop unsupervised and on-line learning algorithms for codebook based visual recognition tasks. First, we study the Prob- abilistic Latent Semantic Analysis (PLSA), which is one instance of codebook based recognition models. It has been successfully applied to visual recognition tasks, such as image categorization, action recog- nition, etc. However it has been learned mainly in batch mode, and therefore it cannot handle the data that arrives sequentially. We pro- pose a novel on-line learning algorithm for learning the parameters of the PLSA under that situation. Our contributions are two-fold: (i) an on-line learning algorithm that learns the parameters of the PLSA model from incoming data; (ii) a codebook adaptation algorithm that can capture the full characteristics of all features during the learn- ing. Experimental results demonstrate that the proposed algorithm can handle sequentially arriving data that the batch PLSA learning cannot cope with. We then look at the Implicit Shape Model (ISM) for object detec- tion. ISM is a codebook based model in which object information is retained in codebooks. Existing ISM based methods require manual labeling of training data. We propose an algorithm that can label the training data automatically. We also propose a method for identify- ing moving edges in video frames so that object hypotheses can be generated only from the moving edges. We compare the proposed al- gorithm with a background subtraction based moving object detection algorithm. The experimental results demonstrate that the proposed algorithm achieves comparable performance to the background sub- traction based counterpart, and it even outperforms the counterpart in complex situations. We then extend the aforementioned batch algorithm for on-line learn- ing. We propose an on-line training data collection algorithm and also an on-line codebook based object detector. We evaluate the algorithm on three video datasets. The experimental results demonstrate that our algorithm outperforms the state-of-the-art on-line conservative learning algorithm.

Persistent link to this record

http://hdl.handle.net/1959.4/51513

DOI

https://doi.org/10.26190/unsworks/15152

Author(s)

Xu, Jie

Supervisor(s)

Wang, Yang

Wang, Wei

Ye, Getian

Publication Year

2011

Resource Type

Thesis

Degree Type

PhD Doctorate

UNSW Faculty

Files

whole.pdf

1.7 MB

Adobe Portable Document Format

View full record Show statistics

Library

On-line and unsupervised learning for codebook based visual recognition

Access & Terms of Use

Altmetric

Abstract

Persistent link to this record

DOI

Link to Publisher Version

Link to Open Access Version

Additional Link

Author(s)

Supervisor(s)

Creator(s)

Editor(s)

Translator(s)

Curator(s)

Designer(s)

Arranger(s)

Composer(s)

Recordist(s)

Conference Proceedings Editor(s)

Other Contributor(s)

Corporate/Industry Contributor(s)

Publication Year

Resource Type

Degree Type

UNSW Faculty

Files

Related dataset(s)