NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.
Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.
An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
K H. Lee, Y C. Choy, S B. Cho, Xiao Tang, V R. Mccrary
Abstract
¿¿¿With the widespread of XML documents on the Web, there is a growing interest in transforming paper-based documents into XML representations. In this paper, we present a syntactic method for logical structure analysis of documents with multiple pages and hierarchical structure. To generate a logical structure more accurately and quickly than previous works of which the basic units are text lines, the proposed method takes text regions with hierarchical structure as input. Furthermore, we define a document model that is able to represent explicit knowledge about geometric characteristics and logical structure information of documents efficiently. Experimental results with 372 images scanned from technical journal documents show that the method has performed logical structure analysis successfully. Particularly, the method generates XML documents as the result of structural analysis, so that it enhances the reusability of documents.
Proceedings Title
Lecture Notes in Computer Science (Proceedings of International Workshop on Multimedia Data and Document Engineering
Conference Location
, 1
Pub Type
Conferences
Citation
Lee, K.
, Choy, Y.
, Cho, S.
, Tang, X.
and Mccrary, V.
(2021),
Content Migration: From Paper to XML Documents, Lecture Notes in Computer Science (Proceedings of International Workshop on Multimedia Data and Document Engineering, , 1
(Accessed October 7, 2025)