Picture for Charith Mendis

Charith Mendis

Transforming the Hybrid Cloud for Emerging AI Workloads

Add code
Nov 20, 2024
Viaarxiv icon

SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention

Add code
Jul 23, 2024
Figure 1 for SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention
Figure 2 for SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention
Figure 3 for SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention
Figure 4 for SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention
Viaarxiv icon

TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs

Add code
Aug 25, 2023
Figure 1 for TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs
Figure 2 for TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs
Figure 3 for TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs
Figure 4 for TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs
Viaarxiv icon

Input-sensitive dense-sparse primitive compositions for GNN acceleration

Add code
Jun 27, 2023
Viaarxiv icon

FLuRKA: Fast fused Low-Rank & Kernel Attention

Add code
Jun 27, 2023
Figure 1 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Figure 2 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Figure 3 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Figure 4 for FLuRKA: Fast fused Low-Rank & Kernel Attention
Viaarxiv icon

Learning Large Graph Property Prediction via Graph Segment Training

Add code
May 21, 2023
Figure 1 for Learning Large Graph Property Prediction via Graph Segment Training
Figure 2 for Learning Large Graph Property Prediction via Graph Segment Training
Figure 3 for Learning Large Graph Property Prediction via Graph Segment Training
Figure 4 for Learning Large Graph Property Prediction via Graph Segment Training
Viaarxiv icon

CoMEt: x86 Cost Model Explanation Framework

Add code
Feb 14, 2023
Viaarxiv icon

GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation

Add code
Oct 11, 2022
Figure 1 for GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation
Figure 2 for GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation
Figure 3 for GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation
Figure 4 for GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation
Viaarxiv icon

DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates

Add code
Oct 08, 2020
Figure 1 for DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates
Figure 2 for DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates
Figure 3 for DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates
Figure 4 for DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates
Viaarxiv icon

Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks

Add code
Aug 21, 2018
Figure 1 for Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks
Figure 2 for Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks
Figure 3 for Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks
Figure 4 for Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks
Viaarxiv icon