READYS: A Reinforcement Learning Based Strategy for Heterogeneous Dynamic Scheduling

Nathan Grinsztajn; Olivier Beaumont; Emmanuel Jeannot; Philippe Preux

Communication Dans Un Congrès Année : 2021

READYS: A Reinforcement Learning Based Strategy for Heterogeneous Dynamic Scheduling

(1) , (2) , (3) , (1)

1
2
3

Nathan Grinsztajn

Fonction : Auteur
PersonId : 1083320

Scool

Olivier Beaumont

Fonction : Auteur
PersonId : 181224
IdHAL : olivier-beaumont
ORCID : 0000-0003-2741-6228
IdRef : 124577083

High-End Parallel Algorithms for Challenging Numerical Simulations

Emmanuel Jeannot

Fonction : Auteur
PersonId : 15678
IdHAL : emmanuel-jeannot
ORCID : 0000-0002-3956-2997
IdRef : 084595108

Topology-Aware System-Scale Data Management for High-Performance Computing

Philippe Preux

Fonction : Auteur
PersonId : 5488
IdHAL : preux-philippe
IdRef : 059896353

Scool

Résumé

In this paper, we propose READYS, a reinforcement learning algorithm for the dynamic scheduling of computations modeled as a Directed Acyclic Graph (DAGs). Our goal is to develop a scheduling algorithm in which allocation and scheduling decisions are made at runtime, based on the state of the system, as performed in runtime systems such as StarPU or ParSEC. Reinforcement Learning is a natural candidate to achieve this task, since its general principle is to build step by step a strategy that, given the state of the system (the state of the resources and a view of the ready tasks and their successors in our case), makes a decision to optimize a global criterion. Moreover, the use of Reinforcement Learning is natural in a context where the duration of tasks (and communications) is stochastic. We propose READYS that combines Graph Convolutional Networks (GCN) with an Actor-Critic Algorithm (A2C): it builds an adaptive representation of the scheduling problem on the fly and learns a scheduling strategy, aiming at minimizing the makespan. A crucial point is that READYS builds a general scheduling strategy which is neither limited to only one specific application or task graph nor one particular problem size, and that can be used to schedule any DAG. We focus on different types of task graphs originating from linear algebra factorization kernels (CHOLESKY, LU, QR) and we consider heterogeneous platforms made of a few CPUs and GPUs. We first propose to analyze the performance of READYS when learning is performed on a given (platform, kernel, problem size) combination. Using simulations, we show that the scheduling agent obtains performances very similar or even superior to algorithms from the literature, and that it is especially powerful when the scheduling environment contains a lot of uncertainty. We additionally demonstrate that our agent exhibits very promising generalization capabilities. To the best of our knowledge, this is the first paper which shows that reinforcement learning can really be used for dynamic DAG scheduling on heterogeneous resources.

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

cluster.pdf (416.25 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Jeannot : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03313229

Soumis le : mardi 3 août 2021-15:47:14

Dernière modification le : mercredi 20 mars 2024-17:52:16

Archivage à long terme le : jeudi 4 novembre 2021-19:03:21

Dates et versions

hal-03313229 , version 1 (03-08-2021)

Identifiants

HAL Id : hal-03313229 , version 1

Citer

Nathan Grinsztajn, Olivier Beaumont, Emmanuel Jeannot, Philippe Preux. READYS: A Reinforcement Learning Based Strategy for Heterogeneous Dynamic Scheduling. IEEE Cluster 2021, Sep 2021, Portland / Virtual, United States. ⟨hal-03313229⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA GRID5000 CRISTAL INRIA2 UNIV-LILLE SILECS CRISTAL-SCOOL

910 Consultations

727 Téléchargements

READYS: A Reinforcement Learning Based Strategy for Heterogeneous Dynamic Scheduling

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager