Back to Search
Start Over
Deploying Heterogeneity-aware Deep Learning Workloads on the Computing Continuum
- Source :
- BDA 2021-37e Conférence sur la Gestion de Données-Principes, Technologies et Applications, BDA 2021-37e Conférence sur la Gestion de Données-Principes, Technologies et Applications, Oct 2021, Paris, France
- Publication Year :
- 2021
- Publisher :
- HAL CCSD, 2021.
-
Abstract
- National audience; The increasing need for real-time analytics motivated the emergence of new incremental methods to learn representations from continuous flows of data, especially in the context of the Internet of Things. This trend led to the evolution of centralized computing infrastructures towards interconnected processing units spanning from edge devices to cloud data centers. This new paradigm is referred to as the Computing or Edge-to-Cloud Continuum. However, the network and compute heterogeneity across and within clusters may negatively impact Deep Learning (DL) training. We introduce a roadmap for understanding the end-to-end performance of DL workloads in such heterogeneous settings. The goal is to identify key parameters leading to stragglers and devise novel intra- and inter-cluster strategies to address them. We will explore various policies aiming to improve makespan, cost and fairness objectives while ensuring system scalability.
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- BDA 2021-37e Conférence sur la Gestion de Données-Principes, Technologies et Applications, BDA 2021-37e Conférence sur la Gestion de Données-Principes, Technologies et Applications, Oct 2021, Paris, France
- Accession number :
- edsair.dedup.wf.001..9b2c01c5e60b886a92e93049e927d47f