Launching E4S in the Cloud with ODDC

Launching E4S in the Cloud with ODDC

Wednesday, May 24, 2023 2:00 PM to 2:20 PM · 20 min. (Europe/Berlin)
Hall H, Booth K1001 - Ground Floor
HPC Solutions Forum
HPC WorkflowsManaging Extreme-Scale Parallelism

Information

The Extreme-scale Scientific Software Stack (E4S) [https://e4s.io] is a curated, Spack based software distribution of 100+ HPC, EDA, and AI/ML packages. The talk will focus on how E4S is being made available to multiple commercial cloud platforms using Adaptive Computing’s HPC Cloud On-Demand Data Center (ODDC) platform to target AWS, GCP, OCI, and Azure. These cloud images use E4S and Spack as the core components for product integration and deployment of a range of HPC and AI/ML tools. These include performance evaluation tools such as TAU, HPCToolkit, DyninstAPI, PAPI, etc. and support both bare-metal and containerized deployment for CPU and GPU platforms. E4S provides a Spack binary cache and a set of base and full-featured container images with vendor runtimes to support GPU architectures from NVIDIA, Intel, and AMD. E4S is a community effort to provide open-source software packages for developing, deploying, and running scientific applications and tools on HPC platforms. It is a comprehensive, coherent software stack that enables application developers to productively develop highly parallel applications that effectively target diverse exascale architectures. It also includes a container launch tool (e4s-cl) that allows binary distribution of applications by substituting MPI in the containerized application with the system MPI. It features tools to customize base container images provided by E4S by adding packages using Spack and OS package managers. These containers are available for download from the E4S website and DockerHub. E4S is released under the MIT license and is supported by U.S. Department of Energy’s Exascale Computing Project [https://www.exascaleproject.org].
HPC Solutions Forum Topics
How should an organization determine the proper balance between cloud and on-prem, for performance, scalability, and TCO?
Format
On-site
Beginner Level
50%
Intermediate Level
25%
Advanced Level
25%

Log in