1. Adaptive Asynchronous Work-Stealing for distributed load-balancing in heterogeneous systems
- Author
-
Fernandes, João B., de Assis, Ítalo A. S., Martins, Idalmis M. S., Barros, Tiago, and Xavier-de-Souza, Samuel
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Supercomputers have revolutionized how industries and scientific fields process large amounts of data. These machines group hundreds or thousands of computing nodes working together to execute time-consuming programs that require a large amount of computational resources. Over the years, supercomputers have expanded to include new and different technologies characterizing them as heterogeneous. However, executing a program in a heterogeneous environment requires attention to a specific aspect of performance degradation: load imbalance. In this research, we address the challenges associated with load imbalance when scheduling many homogeneous tasks in a heterogeneous environment. To address this issue, we introduce the concept of adaptive asynchronous work-stealing. This approach collects information about the nodes and utilizes it to improve work-stealing aspects, such as victim selection and task offloading. Additionally, the proposed approach eliminates the need for extra threads to communicate information, thereby reducing overhead when implementing a fully asynchronous approach. Our experimental results demonstrate a performance improvement of approximately 10.1\% compared to other conventional and state-of-the-art implementations., Comment: 32 pages, 5 figures
- Published
- 2024