Picture for Phitchaya Mangpo Phothilimthana

Phitchaya Mangpo Phothilimthana

Accelerating Retrieval-Augmented Language Model Serving with Speculation

Add code
Jan 25, 2024
Viaarxiv icon

TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs

Add code
Aug 25, 2023
Viaarxiv icon

Learning Large Graph Property Prediction via Graph Segment Training

Add code
May 21, 2023
Viaarxiv icon

Optimizing Memory Mapping Using Deep Reinforcement Learning

Add code
May 11, 2023
Viaarxiv icon

GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation

Add code
Oct 11, 2022
Figure 1 for GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation
Figure 2 for GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation
Figure 3 for GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation
Figure 4 for GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation
Viaarxiv icon

$α$NAS: Neural Architecture Search using Property Guided Synthesis

Add code
May 08, 2022
Figure 1 for $α$NAS: Neural Architecture Search using Property Guided Synthesis
Figure 2 for $α$NAS: Neural Architecture Search using Property Guided Synthesis
Figure 3 for $α$NAS: Neural Architecture Search using Property Guided Synthesis
Figure 4 for $α$NAS: Neural Architecture Search using Property Guided Synthesis
Viaarxiv icon

A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules

Add code
Dec 07, 2021
Figure 1 for A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Figure 2 for A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Figure 3 for A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Figure 4 for A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Viaarxiv icon

A Learned Performance Model for the Tensor Processing Unit

Add code
Aug 03, 2020
Figure 1 for A Learned Performance Model for the Tensor Processing Unit
Figure 2 for A Learned Performance Model for the Tensor Processing Unit
Figure 3 for A Learned Performance Model for the Tensor Processing Unit
Figure 4 for A Learned Performance Model for the Tensor Processing Unit
Viaarxiv icon