Back to Search Start Over

Performance Impact of Batching Web-Application Requests Using Hot-Spot Processing on GPUs

Authors :
Tobias Fjalling
Per Stenström
Source :
IPDPS
Publication Year :
2015
Publisher :
IEEE, 2015.

Abstract

Web applications are a good fit for many-core servers because of their inherent high-degree of request-level parallelism. Yet, processing-intensive web-server requests can lead to low quality-of-service due to hot-spots, which calls for methods that can improve single-thread performance. This paper explores how to use off-chip GPUs to speed up web application hot-spots written in productivity-friendly environments (e.g. C#). First, we apply a number of straightforward optimizations through refactoring of a commercial-strength, web application code. This yields a speedup of 7.6 in a CPU multi-threaded, and multi-core test. Second, we then gather similar requests from different threads of the optimized code, by applying a technique called batching, to exploit SIMD parallelism provided by GPUs. Surprisingly, there is ample parallelism to be exploited from the already optimized code yielding a speedup of a factor between 2x to 3x compared to the best optimized CPU version.

Details

Database :
OpenAIRE
Journal :
2015 IEEE International Parallel and Distributed Processing Symposium
Accession number :
edsair.doi...........41d29f2e5b174d80029ff2f8ddc8b1ff
Full Text :
https://doi.org/10.1109/ipdps.2015.64