Picture for Venkatram Vishwanath

Venkatram Vishwanath

LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

Add code
Oct 31, 2024
Viaarxiv icon

Scalable and Consistent Graph Neural Networks for Distributed Mesh-based Data-driven Modeling

Add code
Oct 02, 2024
Viaarxiv icon

Mesh-based Super-Resolution of Fluid Flows with Multiscale Graph Neural Networks

Add code
Sep 12, 2024
Viaarxiv icon

DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

Add code
Oct 11, 2023
Figure 1 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Figure 2 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Figure 3 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Figure 4 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Viaarxiv icon

A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators

Add code
Oct 06, 2023
Viaarxiv icon

Parallel Multi-Objective Hyperparameter Optimization with Uniform Normalization and Bounded Objectives

Add code
Sep 26, 2023
Viaarxiv icon

A Survey of Techniques for Optimizing Transformer Inference

Add code
Jul 16, 2023
Viaarxiv icon

A Multi-Level, Multi-Scale Visual Analytics Approach to Assessment of Multifidelity HPC Systems

Add code
Jun 15, 2023
Viaarxiv icon

Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications

Add code
Jul 20, 2022
Figure 1 for Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications
Figure 2 for Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications
Figure 3 for Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications
Figure 4 for Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications
Viaarxiv icon

Asynchronous Distributed Bayesian Optimization at HPC Scale

Add code
Jul 04, 2022
Figure 1 for Asynchronous Distributed Bayesian Optimization at HPC Scale
Figure 2 for Asynchronous Distributed Bayesian Optimization at HPC Scale
Figure 3 for Asynchronous Distributed Bayesian Optimization at HPC Scale
Figure 4 for Asynchronous Distributed Bayesian Optimization at HPC Scale
Viaarxiv icon