Document Processing for Automatic Knowledge Acquisition
Tang Y.Y.; De Yan C.; Suen C.Y.
Source PublicationIEEE Transactions on Knowledge and Data Engineering
AbstractThe knowledge acquisition bottleneck has become the major impediment to the development and application of effective information systems. To remove this bottleneck, new document processing techniques must be introduced to automatically acquire knowledge from various types of documents. By presenting a survey on the techniques and problems involved this paper aims at serving as a catalyst to stimulate research in automatic knowledge acquisition through document processing. In this study, a document is considered to have two structures: geometric structure and logical structure. These play a key role in the process of the knowledge acquisition, which can be viewed as a process of acquiring the above structures. Extracting the geometric structure from a document refers to document analysis; mapping the geometric structure into logical structure is regarded as document understanding. Both areas will be described in this paper, and the basic concept of document structure and its measurement based on entropy analysis will be introduced. Logical structure and geometric models are proposed. Both top-down and bottom-up approaches and their entropy analyses will be presented. Different techniques will be discussed with practical examples. Mapping methods, such as tree transformation, document formatting knowledge and document format description language, will also be described. © 1994 IEEE
KeywordAutomatic knowledge acquisition document understanding ducument analysis entropy analysis geometric model geometric structure logical model logical structure
URLView the original
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionUniversity of Macau
AffiliationUniversite Concordia
Recommended Citation
GB/T 7714
Tang Y.Y.,De Yan C.,Suen C.Y.. Document Processing for Automatic Knowledge Acquisition[J]. IEEE Transactions on Knowledge and Data Engineering,1994,6(1):3-21.
APA Tang Y.Y.,De Yan C.,&Suen C.Y..(1994).Document Processing for Automatic Knowledge Acquisition.IEEE Transactions on Knowledge and Data Engineering,6(1),3-21.
MLA Tang Y.Y.,et al."Document Processing for Automatic Knowledge Acquisition".IEEE Transactions on Knowledge and Data Engineering 6.1(1994):3-21.
Related Services
Recommend this item
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Tang Y.Y.]'s Articles
[De Yan C.]'s Articles
[Suen C.Y.]'s Articles
Baidu academic
Similar articles in Baidu academic
[Tang Y.Y.]'s Articles
[De Yan C.]'s Articles
[Suen C.Y.]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Tang Y.Y.]'s Articles
[De Yan C.]'s Articles
[Suen C.Y.]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.