Back to Search
Start Over
Queue-length-aware dispatching in large-scale heterogeneous systems.
- Source :
-
Queueing Systems . Oct2024, Vol. 108 Issue 1/2, p125-184. 60p. - Publication Year :
- 2024
-
Abstract
- One dominant approach for reducing response times in large-scale systems is Join-the-Shortest-Queue-d: whenever a job arrives, the dispatcher queries d servers at random and then assigns the job to the queried server with the shortest queue. While JSQ -d is known to perform quite well in systems where all servers run at the same speed, this is not the case in systems that exhibit heterogeneity with respect to server speeds. Unfortunately, there is no straightforward way to extend JSQ -d (or other so-called power-of-d policies) to heterogeneous systems. Should a job be assigned to the queried server with the shortest queue even if much faster servers were among those queried? Should a job be assigned to the queried server where it is expected to complete the soonest even if there is an idle, albeit slower, server available among those queried? And for that matter, should we query faster servers more often than their slower counterparts? Recent work has introduced a framework for developing strong dispatching policies by pairing suitably chosen querying and assignment rules. Within this framework, prior work has focused on finding strong-performing dispatching policies that use only the idle/busy statuses of the queried servers, rather than detailed queue length information. In this paper, we overcome the challenge of evaluating the performance of—and finding strong-performing—general scalable dispatching policies that make use of both server speed and detailed queue length information, through a combination of mean field analysis and a sequence of modified optimization problems. We find that well-designed length-aware dispatching policies can significantly outperform their idleness-based counterparts in large-scale heterogeneous systems. While the best policies of this kind are often complicated to describe, we find that in the vast majority of cases the relatively simple Shortest Expected Wait policy performs nearly as well, without the need for an especially sophisticated and time-consuming optimization procedure. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 02570130
- Volume :
- 108
- Issue :
- 1/2
- Database :
- Academic Search Index
- Journal :
- Queueing Systems
- Publication Type :
- Academic Journal
- Accession number :
- 179234981
- Full Text :
- https://doi.org/10.1007/s11134-024-09918-x