Picture for Arne Symons

Arne Symons

Anda: Unlocking Efficient LLM Inference with a Variable-Length Grouped Activation Data Format

Add code
Nov 24, 2024
Viaarxiv icon

MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices

Add code
Oct 11, 2024
Viaarxiv icon

SALSA: Simulated Annealing based Loop-Ordering Scheduler for DNN Accelerators

Add code
Apr 20, 2023
Viaarxiv icon