Workload Scheduling on Heterogeneous Devices
Tuesday, May 14, 2024 3:10 PM to 3:35 PM · 25 min. (Europe/Berlin)
Hall F - 2nd floor
Research Paper
Heterogeneous System ArchitecturesResource Management and Scheduling
Information
Hardware accelerators have become the backbone
of many cloud and HPC workloads, but workloads tend to
statically choose accelerators leaving devices unused while others
are oversubscribed. We propose a holistic framework that allows
a computational kernel to span across multiple devices on a node,
as well as multiple applications being scheduled on the same node.
Our work sharing and co-scheduling framework allows kernels
to be migrated between devices, expand to span more devices, or
contract to fewer devices. The scheduler can make these decisions
dynamically based on a pluggable scheduling algorithm in order
to optimize for different objectives, e.g., job throughput, job
priorities or some hybrid. Experiments on a CPU+GPU+FPGA
platform indicate speedups of 2.26X over different applications
and up to 1.25X for co-scheduled workloads over baselines.
Besides performance, a major contribution of our work lies in
ease of programmability with a single code base compiled and
runtime controlled across three vastly different execution devices.
Contributors:
Contributors:
Format
On-siteOn Demand
Documents & Links
Read the Full Paper Open Access at IEEE Xplore!