PackStealLB: A scalable distributed load balancer based on work stealing and workload discretization - Systèmes Parallèles
Article Dans Une Revue Journal of Parallel and Distributed Computing Année : 2021

PackStealLB: A scalable distributed load balancer based on work stealing and workload discretization

Résumé

The scalability of high-performance, parallel iterative applications is directly affected by how well they use the available computing resources. These applications are subject to load imbalance due to the nature and dynamics of their computations. It is common that high performance systems employ periodic load balancing to tackle this issue. Dynamic load balancing algorithms redistribute the application’s workload using heuristics to circumvent the NP-hard complexity of the problem However, scheduling heuristics must be fast to avoid hindering application performance when distributing the workload on large and distributed environments. In this work, we present a technique for low overhead, high quality scheduling decisions for parallel iterative applications. The technique relies on combined application workload information paired with distributed scheduling algorithms. An initial distributed step among scheduling agents group application tasks in packs of similar load to minimize messages among them. This information is used by our scheduling algorithm, PackStealLB, for its distributed-memory work stealing heuristic. Experimental results showed that PackStealLB is able to improve the performance of a molecular dynamics benchmark by up to 41%, outperforming other scheduling algorithms in most scenarios over almost one thousand cores.
Fichier principal
Vignette du fichier
main.pdf (538.35 Ko) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02405735 , version 1 (11-12-2019)
hal-02405735 , version 2 (09-06-2020)
hal-02405735 , version 3 (06-07-2020)

Identifiants

Citer

Vinicius Freitas, Laércio Lima Pilla, Alexandre Santana, Márcio C Castro, Johanne Cohen. PackStealLB: A scalable distributed load balancer based on work stealing and workload discretization. Journal of Parallel and Distributed Computing, 2021, 150, pp.34-45. ⟨10.1016/j.jpdc.2020.12.005⟩. ⟨hal-02405735v3⟩
733 Consultations
330 Téléchargements

Altmetric

Partager

More