Multilingual scene character recognition with co-occurrence of histogram of oriented gradients
Automatic machine reading of texts in scenes is largely restricted by the poor character
recognition accuracy. In this paper, we extend the Histogram of Oriented Gradient (HOG)
and propose two new feature descriptors: Co-occurrence HOG (Co-HOG) and Convolutional
Co-HOG (ConvCo-HOG) for accurate recognition of scene texts of different languages.
Compared with HOG which counts orientation frequency of each single pixel, the Co-HOG
encodes more spatial contextual information by capturing the co-occurrence of orientation …
recognition accuracy. In this paper, we extend the Histogram of Oriented Gradient (HOG)
and propose two new feature descriptors: Co-occurrence HOG (Co-HOG) and Convolutional
Co-HOG (ConvCo-HOG) for accurate recognition of scene texts of different languages.
Compared with HOG which counts orientation frequency of each single pixel, the Co-HOG
encodes more spatial contextual information by capturing the co-occurrence of orientation …
Abstract
Automatic machine reading of texts in scenes is largely restricted by the poor character recognition accuracy. In this paper, we extend the Histogram of Oriented Gradient (HOG) and propose two new feature descriptors: Co-occurrence HOG (Co-HOG) and Convolutional Co-HOG (ConvCo-HOG) for accurate recognition of scene texts of different languages. Compared with HOG which counts orientation frequency of each single pixel, the Co-HOG encodes more spatial contextual information by capturing the co-occurrence of orientation pairs of neighboring pixels. Additionally, ConvCo-HOG exhaustively extracts Co-HOG features from every possible image patches within a character image for more spatial information. The two features have been evaluated extensively on five scene character datasets of three different languages including three sets in English, one set in Chinese and one set in Bengali. Experiments show that the proposed techniques provide superior scene character recognition accuracy and are capable of recognizing scene texts of different scripts and languages.
Elsevier
Showing the best result for this search. See all results