NOTICE: Due to a lapse in annual appropriations, most of this website is not being updated. Learn more.
Form submissions will still be accepted but will not receive responses at this time. Sections of this site for programs using non-appropriated funds (such as NVLAP) or those that are excepted from the shutdown (such as CHIPS and NVD) will continue to be updated.
An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
Document Image Collection Using Amazon s Mechanical Turk
Published
Author(s)
Audrey N. Tong, Mark A. Przybocki
Abstract
We present findings from a collaborative effort aimed at testing the feasibility of using Amazon s Mechanical Turk as a data collection platform to build a corpus of document images. Experimental design and implementation workflow are described. Preliminary findings and directions for future work are also discussed.
Proceedings Title
Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics
Conference Dates
June 1-6, 2010
Conference Location
Los Angeles, CA
Conference Title
The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics
Tong, A.
and Przybocki, M.
(2010),
Document Image Collection Using Amazon s Mechanical Turk, Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics , Los Angeles, CA, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=906834
(Accessed October 15, 2025)