NIST Special Database 2. NIST Structured Forms Reference Set of Binary Images (SFRS)
The NIST Structured Forms Database consists of 5,590 pages of binary, black-and-white images of synthesized documents.This database is a valuable tool for measurement of system performance and system comparison on complex forms.The documents in this database are 12 different tax forms from the IRS 1040 Package X for the year 1988. These include Forms 1040, 2106, 2441, 4562, and 6251 together with Schedules A, B, C, D, E, F, and SE. Eight of these forms contain two pages or form faces; therefore, there are 20 different form faces represented in the database. The document images in this database appear to be real forms prepared by individuals, but the images havebeen automatically derived and synthesized using a computer. There are 900 simulated tax submissions represented in the database averaging 6.2 form faces per submission. This significant new database totals approximately 5.9 gigabytes of uncompressed image data including image format documentation and example software.
World Wide Web-Internet and Web Information Systems
ASCII reference, automated character recognition, binary image database, forms identification, image format documentation, IRS, OCR, printed characters, synthesized documents
NIST Special Database 2. NIST Structured Forms Reference Set of Binary Images (SFRS), World Wide Web-Internet and Web Information Systems
(Accessed June 5, 2023)