Skip to main content

NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.

Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.

U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Document Image Coding for Processing and Retrieval

Published

Author(s)

O E. Kia, D Doermann

Abstract

Document images belong to a unique class of images where the information content is contained in the language represented by a series of symbols on the page rather than in the visual objects themselves. From this, a new image coding strategy can be designed so as to address some compression and retrieval issues. In this paper we describe a coding methodology that not only exploits component-level redundancy to reduce code length but also expedites efficient data access. The approach uses an image pattern approach that captures image redundancy while providing a natural information index. The approach identifies patterns which appear repeatedly, represents similar patterns with a single prototype, stores the location of pattern instances, and codes the residuals between the prototypes and the pattern instances. Compression results are somewhat competitive but compressed-domain access is clearly superior to competing methods. Furthermore, applications to network-related problems have been considered which show favorable results.
Citation
Journal of Vlsi Signal Processing
Volume
20
Issue
1-2

Keywords

compression, document image coding, transmission

Citation

Kia, O. and Doermann, D. (1998), Document Image Coding for Processing and Retrieval, Journal of Vlsi Signal Processing, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=150716 (Accessed October 26, 2025)

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created October 30, 1998, Updated February 17, 2017
Was this page helpful?