Performance Impact of Batching Web-Application Requests Using Hot-Spot Processing on GPUs

Authors :: Tobias Fjalling
Per Stenström
Source :: IPDPS
Publication Year :: 2015
Publisher :: IEEE, 2015.
Abstract: Web applications are a good fit for many-core servers because of their inherent high-degree of request-level parallelism. Yet, processing-intensive web-server requests can lead to low quality-of-service due to hot-spots, which calls for methods that can improve single-thread performance. This paper explores how to use off-chip GPUs to speed up web application hot-spots written in productivity-friendly environments (e.g. C#). First, we apply a number of straightforward optimizations through refactoring of a commercial-strength, web application code. This yields a speedup of 7.6 in a CPU multi-threaded, and multi-core test. Second, we then gather similar requests from different threads of the optimized code, by applying a technique called batching, to exploit SIMD parallelism provided by GPUs. Surprisingly, there is ample parallelism to be exploited from the already optimized code yielding a speedup of a factor between 2x to 3x compared to the best optimized CPU version.

Subjects :: Web server
Speedup
business.industry
Data parallelism
Computer science
Task parallelism
Parallel computing
Thread (computing)
Program optimization
computer.software_genre
Code refactoring
Server
Web application
SIMD
business
computer

Database :: OpenAIRE
Journal :: 2015 IEEE International Parallel and Distributed Processing Symposium
Accession number :: edsair.doi...........41d29f2e5b174d80029ff2f8ddc8b1ff
Full Text :: https://doi.org/10.1109/ipdps.2015.64

Full Text Access

Tools