Take a sneak peek at the new NIST.gov and let us know what you think!
(Please note: some content may not be complete on the beta site.).
NIST Authors in Bold
|Author(s):||Peter Bajcsy; Antoine Vandecreme; Julien M. Amelot; Phuong T. Nguyen; Joe Chalfoun; Mary C. Brady;|
|Title:||Terabyte Size Image Computations on Hadoop Cluster Platforms|
|Published:||October 07, 2013|
|Abstract:||We present a characterization of four basic terabyte size image computations on a Hadoop cluster in terms of their relative efficiency according to the modified Amdahl‰s law. The work is motivated by the fact that there is a lack of standard benchmarks and stress tests for big image processing operations on a Hadoop computer cluster platform. Our benchmark design and evaluations were performed on one of the three microscopy image sets, each consisting of about a half of a terabyte size image volume. All image processing benchmarks executed on the NIST Raritan cluster with Hadoop were compared against baseline measurements, such as the Tera-Sort/Tera-Gen designed for Hadoop testing previously, image processing executions on a multiprocessor desktop and on NIST Raritan cluster using Java Remote Method Invocation (RMI) with multiple configurations. By applying our methodology to assessing efficiencies of computations on computer cluster configurations, we could rank computation configurations and aid scientists in measuring the benefits of running image processing on a cluster.|
|Proceedings:||2013 IEEE International Conference on Big Data|
|Location:||San Diego, CA|
|Dates:||October 6-9, 2013|
|Keywords:||Big Data Industry Standards, Big Data Open Platform, Big Data Applications and Infrastructure|
|Research Areas:||Data and Informatics, Information Processing Systems, Imaging|
|PDF version:||Click here to retrieve PDF version of paper (533KB)|