Picture for Yanqi Zhang

Yanqi Zhang

Unifying KV Cache Compression for Large Language Models with LeanKV

Add code
Dec 04, 2024
Figure 1 for Unifying KV Cache Compression for Large Language Models with LeanKV
Figure 2 for Unifying KV Cache Compression for Large Language Models with LeanKV
Figure 3 for Unifying KV Cache Compression for Large Language Models with LeanKV
Figure 4 for Unifying KV Cache Compression for Large Language Models with LeanKV
Viaarxiv icon

ChipExpert: The Open-Source Integrated-Circuit-Design-Specific Large Language Model

Add code
Jul 26, 2024
Viaarxiv icon

Analytically-Driven Resource Management for Cloud-Native Microservices

Add code
Jan 05, 2024
Viaarxiv icon

Sinan: Data-Driven, QoS-Aware Cluster Management for Microservices

Add code
May 27, 2021
Figure 1 for Sinan: Data-Driven, QoS-Aware Cluster Management for Microservices
Figure 2 for Sinan: Data-Driven, QoS-Aware Cluster Management for Microservices
Figure 3 for Sinan: Data-Driven, QoS-Aware Cluster Management for Microservices
Figure 4 for Sinan: Data-Driven, QoS-Aware Cluster Management for Microservices
Viaarxiv icon

Leveraging Deep Learning to Improve the Performance Predictability of Cloud Microservices

Add code
May 02, 2019
Figure 1 for Leveraging Deep Learning to Improve the Performance Predictability of Cloud Microservices
Figure 2 for Leveraging Deep Learning to Improve the Performance Predictability of Cloud Microservices
Figure 3 for Leveraging Deep Learning to Improve the Performance Predictability of Cloud Microservices
Figure 4 for Leveraging Deep Learning to Improve the Performance Predictability of Cloud Microservices
Viaarxiv icon