Picture for Matt Turner

Matt Turner

Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures

Add code
Apr 16, 2025
Viaarxiv icon