An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
NIST Special Database 8. NIST Machine-Print Database of Gray Scale and Binary Images (MPDB)
Published
Author(s)
Michael Garris
Abstract
This database is a valuable tool for measurement and comparison of system performance on machine-print pages.The NIST machine-printed database contains gray scale and binary images of machine printed pages.There are 360 digitized pages on three CD-ROM discs. There are a total of 3,063,168 characters in the set which is an average of 8509 characters per page. A reference file is included for each page. These reference files are the ASCII text pages that were used to generate the original hardcopy that was digitized.This database is being distributed for use in the development and testing of Optical Character Recognition (OCR) systems on a common set of images. This allows vendors to report results with respect to this common image set.
Citation
World Wide Web-Internet and Web Information Systems
Pub Type
Journals
Keywords
ASCII reference, automated character recognition, automated data capture, binary, grayscale image database, machine print, OCR, optical character recognition, style
Citation
Garris, M.
(2008),
NIST Special Database 8. NIST Machine-Print Database of Gray Scale and Binary Images (MPDB), World Wide Web-Internet and Web Information Systems
(Accessed December 9, 2024)