Picture for Haoran Qiu

Haoran Qiu

TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms

Add code
Jan 05, 2025
Figure 1 for TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms
Figure 2 for TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms
Figure 3 for TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms
Figure 4 for TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms
Viaarxiv icon

Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction

Add code
Apr 12, 2024
Viaarxiv icon

Decision Transformer as a Foundation Model for Partially Observable Continuous Control

Add code
Apr 03, 2024
Viaarxiv icon