Data-Centric Computing for the Next Generation

Data-Centric Computing for the Next Generation

Monday, June 28, 2021 1:15 PM to 1:50 PM · 35 min. (Africa/Abidjan)

Information

Contributors:
Abstract:

The efficient, convenient, and robust execution of data-driven workflows and enhanced data management are key for productivity in scientific computing and computer-aided RD&E. Big data tools integrate compute and storage capabilities into a holistic solution demonstrating the benefit of tight integrating while the HPC community still optimizes the compute and storage components independently from each other, and, moreover, independently from the needs of end-to-end user workflows that ultimately lead to insight. Even within a single data center, utilizing homogeneous storage and compute infrastructure efficiently is complex for experts. The efficient management in a heterogeneous environment, however, is an unresolved question as the execution of individual tasks from workflows may benefit from alternative hardware architectures and infrastructures.

In this BoF, we bring the community together to discuss visions for a data-centric computing environment of the future that gives the fastest time to insight by applying concepts like smart scheduling to workflows which, e.g., minimize data movement for the entire workflow and exploit the capabilities of heterogeneous environments that stretch beyond a single data center and into the cloud. As this has implications on data-center planning, hardware/software infrastructure starting from a higher-level workflow formulation to smarter hardware and software layers, it affects the wider HPC community. We aim to gather stakeholders from industry and academia interested in this approach with the ultimate goal is to establish a new forum that addresses the need for Next Generation Interfaces that defines and realizes the vision that will impact the next generations of scientists.