Accelerating Generative AI with PyTorch
Sunday, May 12, 2024 9:00 AM to 1:00 PM · 4 hr. (Europe/Berlin)
Hall Y1 - 2nd floor
Tutorial
AI Applications powered by HPC TechnologiesLarge Language Models and Generative AI in HPC
Information
Generative Artificial Intelligence (Generative AI) has emerged as a transformative force in creating realistic data, images, and content. By leveraging deep learning (DL) techniques, generative AI models learn patterns and structures from large datasets and autonomously produce novel outputs without explicit programming for each possible outcome. Even though these models are usually pre-trained on massive datasets, fine-tuning for specific tasks or performing inference still requires considerable number of computational resources, necessitating strategic optimizations to unlock their full potential.
This tutorial, powered by EuroCC 2, aims to equip participants with a comprehensive understanding of the computational challenges inherent in Generative AI. Moreover, it seeks to present strategies for identifying and mitigating bottlenecks, exclusively leveraging PyTorch’s native features without resorting to additional languages like C++ or CUDA. The tutorial encompasses a diverse array of topics, including hands-on sessions on setting up and optimizing a generic DL pipeline on a supercomputer, an exploration of mainstream Generative AI models, and optimization strategies for inference specifically tailored to Generative AI. Additionally, it investigates advanced subjects such as sparsification and model parallelism for models with a large number of parameters. By the end of this tutorial, participants will be capable at navigating the challenges of accelerating Generative AI using the powerful tools and techniques inherent in PyTorch.
Format
On-site
Targeted Audience
This tutorial targets a broad audience, from basic to advanced users of Generative AI who want to learn optimization techniques applicable to mainstream Generative AI models.
Beginner Level
30%
Intermediate Level
70%
Prerequisites
We strive to make the tutorial as accessible as possible. As an intermediate-level tutorial, we however expect basic knowledge of Deep Learning, and programming in Python. Additionally, some experience in using HPC systems is helpful (Linux shell, Slurm) but not mandatory. Participants are expected to provide a laptop with which they can access the HPC system. Access will be facilitated via individual accounts using the Jupyter platform.
Speakers
Christodoulos Stylianou
Research EngineerThe Cyprus InstituteRobert Jan Schlimbach
Technical Advisor ML/HPCSURF bvivan gentile
Data ScientistIFAB - International Foundation of Artificial Intelligence and Big Data for Human DevelopmentCC
Charalambos Chrysostomou
Associate Research ScientistThe Cyprus InstituteCD
Constantine Dovrolis
ProfessorThe Cyprus Institute & Georgia TechBV
Boris Velichkov
ResearcherSofia University St. Kliment Ohridski