Emerging HPC Processors and AcceleratorsExascale SystemsHPC WorkflowsManaging Extreme-Scale ParallelismParallel Programming Languages
Information
Format
On-site
Targeted Audience
Scientific software developers, scientists, and students aiming to scale their applications
efficiently across many GPUs.
Attendees interested in identifying, understanding, and resolving performance bottlenecks in multi-GPU applications.
Attendees familiar with multi-GPU applications but wanting to learn new techniques and use the latest software and hardware features.
Prerequisites
We strive to make the tutorial as accessible as possible. As an intermediate-level tutorial, we however expect basic knowledge of distributed computing with MPI, CUDA C++, and programming in C/C++. Additionally, experience in using HPC systems is needed (Linux shell, make, Slurm).
Participants are expected to provide a laptop with which they can access the HPC system. Access will be facilitated via individual accounts using the Jupyter platform.
Beginner Level
Intermediate Level
70%
Advanced Level
25%





