Picture for Yunchu Han

Yunchu Han

Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time

Add code
Mar 27, 2025
Viaarxiv icon

DVFS-Aware DNN Inference on GPUs: Latency Modeling and Performance Analysis

Add code
Feb 10, 2025
Viaarxiv icon