Automatic document processing: A survey
Tang Y.Y.4; Lee S.-W.2; Suen C.Y.1
Source PublicationPattern Recognition
AbstractSurveys of the basic concepts and underlying techniques are presented in this paper. A basic model for document processing is described. In this model, document processing can be divided into two phases: document analysts and document understanding. A document has two structures: geometric (layout) structure and logical structure. Extraction of the geometric structure from a document refers to document analysis; mapping the geometric structure into logical structure deals with document understanding. Both types of document structures and the two areas of document processing are discussed. Two categories of methods have been used in document analysis, namely, (1) hierarchical methods including top-down and bottom-up approaches, (2) no-hierarchical methods including modified fractal signature. Tree transform, formatting knowledge and description language approaches have been used in document understanding. A particular case of form document processing is discussed. Form description and form registration approaches are presented. A form processing system is also introduced. Finally, many techniques, such as skew detection, Hough transform, Gabor filters, projection, crossing counts, form definition language, etc. which have been used in these approaches are discussed. Copyright © 1996 Pattern Recognition Society. Published by Elsevier Science Ltd.
KeywordDescription languages Document analysis and understanding Document processing Formatting knowledge Geometric and logical structures Hierarchical and no-hierarchical methods Texture analysis Tree transform
URLView the original
Fulltext Access
Citation statistics
Cited Times [WOS]:72   [WOS Record]     [Related Records in WOS]
Document TypeJournal article
CollectionUniversity of Macau
Affiliation1.Universite Concordia
2.Korea University
3.Delft University of Technology
4.Hong Kong Baptist University
5.Chungbuk National University
Recommended Citation
GB/T 7714
Tang Y.Y.,Lee S.-W.,Suen C.Y.. Automatic document processing: A survey[J]. Pattern Recognition,1996,29(12):1931-1952.
APA Tang Y.Y.,Lee S.-W.,&Suen C.Y..(1996).Automatic document processing: A survey.Pattern Recognition,29(12),1931-1952.
MLA Tang Y.Y.,et al."Automatic document processing: A survey".Pattern Recognition 29.12(1996):1931-1952.
Related Services
Recommend this item
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Tang Y.Y.]'s Articles
[Lee S.-W.]'s Articles
[Suen C.Y.]'s Articles
Baidu academic
Similar articles in Baidu academic
[Tang Y.Y.]'s Articles
[Lee S.-W.]'s Articles
[Suen C.Y.]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Tang Y.Y.]'s Articles
[Lee S.-W.]'s Articles
[Suen C.Y.]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.