Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

NIST Special Database 8. NIST Machine-Print Database of Gray Scale and Binary Images (MPDB)

Published

Author(s)

Michael Garris

Abstract

This database is a valuable tool for measurement and comparison of system performance on machine-print pages.The NIST machine-printed database contains gray scale and binary images of machine printed pages.There are 360 digitized pages on three CD-ROM discs. There are a total of 3,063,168 characters in the set which is an average of 8509 characters per page. A reference file is included for each page. These reference files are the ASCII text pages that were used to generate the original hardcopy that was digitized.This database is being distributed for use in the development and testing of Optical Character Recognition (OCR) systems on a common set of images. This allows vendors to report results with respect to this common image set.
Citation
World Wide Web-Internet and Web Information Systems

Keywords

ASCII reference, automated character recognition, automated data capture, binary, grayscale image database, machine print, OCR, optical character recognition, style

Citation

Garris, M. (2008), NIST Special Database 8. NIST Machine-Print Database of Gray Scale and Binary Images (MPDB), World Wide Web-Internet and Web Information Systems (Accessed May 5, 2024)
Created October 16, 2008