Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Document Image Coding for Processing and Retrieval

Published

Author(s)

O E. Kia, D Doermann

Abstract

Document images belong to a unique class of images where the information content is contained in the language represented by a series of symbols on the page rather than in the visual objects themselves. From this, a new image coding strategy can be designed so as to address some compression and retrieval issues. In this paper we describe a coding methodology that not only exploits component-level redundancy to reduce code length but also expedites efficient data access. The approach uses an image pattern approach that captures image redundancy while providing a natural information index. The approach identifies patterns which appear repeatedly, represents similar patterns with a single prototype, stores the location of pattern instances, and codes the residuals between the prototypes and the pattern instances. Compression results are somewhat competitive but compressed-domain access is clearly superior to competing methods. Furthermore, applications to network-related problems have been considered which show favorable results.
Citation
Journal of Vlsi Signal Processing
Volume
20
Issue
1-2

Keywords

compression, document image coding, transmission

Citation

Kia, O. and Doermann, D. (1998), Document Image Coding for Processing and Retrieval, Journal of Vlsi Signal Processing, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=150716 (Accessed February 21, 2024)
Created October 30, 1998, Updated February 17, 2017