Paper
13 January 2003 Document structure analysis algorithms: a literature survey
Song Mao, Azriel Rosenfeld, Tapas Kanungo
Author Affiliations +
Proceedings Volume 5010, Document Recognition and Retrieval X; (2003) https://doi.org/10.1117/12.476326
Event: Electronic Imaging 2003, 2003, Santa Clara, CA, United States
Abstract
Document structure analysis can be regarded as a syntactic analysis problem. The order and containment relations among the physical or logical components of a document page can be described by an ordered tree structure and can be modeled by a tree grammar which describes the page at the component level in terms of regions or blocks. This paper provides a detailed survey of past work on document structure analysis algorithms and summarize the limitations of past approaches. In particular, we survey past work on document physical layout representations and algorithms, document logical structure representations and algorithms, and performance evaluation of document structure analysis algorithms. In the last section, we summarize this work and point out its limitations.
© (2003) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Song Mao, Azriel Rosenfeld, and Tapas Kanungo "Document structure analysis algorithms: a literature survey", Proc. SPIE 5010, Document Recognition and Retrieval X, (13 January 2003); https://doi.org/10.1117/12.476326
Lens.org Logo
CITATIONS
Cited by 198 scholarly publications and 1 patent.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Detection and tracking algorithms

Optical character recognition

Error analysis

Image processing algorithms and systems

Analytical research

Stochastic processes

RELATED CONTENT

Non-Manhattan layout extraction algorithm
Proceedings of SPIE (March 21 2013)
Locally adaptive document skew detection
Proceedings of SPIE (April 03 1997)
Benchmarking system for document analysis algorithms
Proceedings of SPIE (April 01 1998)

Back to Top