Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vincent R. Pascuzzi

AI-coupled HPC Workflows

Aug 24, 2022

Shantenu Jha, Vincent R. Pascuzzi, Matteo Turilli

Abstract:Increasingly, scientific discovery requires sophisticated and scalable workflows. Workflows have become the ``new applications,'' wherein multi-scale computing campaigns comprise multiple and heterogeneous executable tasks. In particular, the introduction of AI/ML models into the traditional HPC workflows has been an enabler of highly accurate modeling, typically reducing computational needs compared to traditional methods. This chapter discusses various modes of integrating AI/ML models to HPC computations, resulting in diverse types of AI-coupled HPC workflows. The increasing need of coupling AI/ML and HPC across scientific domains is motivated, and then exemplified by a number of production-grade use cases for each mode. We additionally discuss the primary challenges of extreme-scale AI-coupled HPC campaigns -- task heterogeneity, adaptivity, performance -- and several framework and middleware solutions which aim to address them. While both HPC workflow and AI/ML computing paradigms are independently effective, we highlight how their integration, and ultimate convergence, is leading to significant improvements in scientific performance across a range of domains, ultimately resulting in scientific explorations otherwise unattainable.

Via

Access Paper or Ask Questions

Asynchronous Execution of Heterogeneous Tasks in AI-coupled HPC Workflows

Aug 23, 2022

Vincent R. Pascuzzi, Matteo Turilli, Shantenu Jha

Figure 1 for Asynchronous Execution of Heterogeneous Tasks in AI-coupled HPC Workflows

Figure 2 for Asynchronous Execution of Heterogeneous Tasks in AI-coupled HPC Workflows

Figure 3 for Asynchronous Execution of Heterogeneous Tasks in AI-coupled HPC Workflows

Figure 4 for Asynchronous Execution of Heterogeneous Tasks in AI-coupled HPC Workflows

Abstract:Heterogeneous scientific workflows consist of numerous types of tasks and dependencies between them. Middleware capable of scheduling and submitting different task types across heterogeneous platforms must permit asynchronous execution of tasks for improved resource utilization, task throughput, and reduced makespan. In this paper we present an analysis of an important class of heterogeneous workflows, viz., AI-driven HPC workflows, to investigate asynchronous task execution requirements and properties. We model the degree of asynchronicity permitted for arbitrary workflows, and propose key metrics that can be used to determine qualitative benefits when employing asynchronous execution. Our experiments represent important scientific drivers, are performed at scale on Summit, and performance enhancements due to asynchronous execution are consistent with our model.

* Submitted to 17th Workshop on Workflows in Support of Large-Scale Science

Via

Access Paper or Ask Questions