Picture for Ricardo Bianchini

Ricardo Bianchini

TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms

Add code
Jan 05, 2025
Figure 1 for TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms
Figure 2 for TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms
Figure 3 for TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms
Figure 4 for TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms
Viaarxiv icon

POLCA: Power Oversubscription in LLM Cloud Providers

Add code
Aug 24, 2023
Figure 1 for POLCA: Power Oversubscription in LLM Cloud Providers
Figure 2 for POLCA: Power Oversubscription in LLM Cloud Providers
Figure 3 for POLCA: Power Oversubscription in LLM Cloud Providers
Figure 4 for POLCA: Power Oversubscription in LLM Cloud Providers
Viaarxiv icon