NIST Special Database 8. NIST Machine-Print Database of Gray Scale and Binary Images (MPDB)
This database is a valuable tool for measurement and comparison of system performance on machine-print pages.The NIST machine-printed database contains gray scale and binary images of machine printed pages.There are 360 digitized pages on three CD-ROM discs. There are a total of 3,063,168 characters in the set which is an average of 8509 characters per page. A reference file is included for each page. These reference files are the ASCII text pages that were used to generate the original hardcopy that was digitized.This database is being distributed for use in the development and testing of Optical Character Recognition (OCR) systems on a common set of images. This allows vendors to report results with respect to this common image set.
World Wide Web-Internet and Web Information Systems
ASCII reference, automated character recognition, automated data capture, binary, grayscale image database, machine print, OCR, optical character recognition, style
NIST Special Database 8. NIST Machine-Print Database of Gray Scale and Binary Images (MPDB), World Wide Web-Internet and Web Information Systems
(Accessed December 3, 2023)