Paper
19 January 2009 Simultaneous segmentation and recognition of Arabic printed text using linguistic concepts of vocabulary
Mohamed Ben Halima, Adel M. Alimi
Author Affiliations +
Proceedings Volume 7247, Document Recognition and Retrieval XVI; 72470T (2009) https://doi.org/10.1117/12.805617
Event: IS&T/SPIE Electronic Imaging, 2009, San Jose, California, United States
Abstract
In this paper, we propose a new approach to Arabic printed text analysis and recognition. This approach is based on linguistic concepts of Arabic vocabulary. For the text, we allow to categorize the words in decomposable words (derived from a root) and indecomposable words (not derived from a root) and to put forth morpho-syntactic characterization hypotheses for each word. For the decomposable words, we attempt to recognize word basic morphemes: antefix, prefix, infix, suffix, postfix and root contrary to existing approaches which are usually based on recognition of word entity by holistic approach.
© (2009) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Mohamed Ben Halima and Adel M. Alimi "Simultaneous segmentation and recognition of Arabic printed text using linguistic concepts of vocabulary", Proc. SPIE 7247, Document Recognition and Retrieval XVI, 72470T (19 January 2009); https://doi.org/10.1117/12.805617
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Associative arrays

Chlorine

Systems modeling

Barium

Data mining

Databases

Image processing

Back to Top