Picture for Zhi Chen

Zhi Chen

Spring

Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity

Add code
Dec 03, 2024
Viaarxiv icon

Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs

Add code
Nov 11, 2024
Viaarxiv icon

What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration

Add code
Oct 27, 2024
Figure 1 for What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration
Figure 2 for What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration
Figure 3 for What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration
Figure 4 for What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration
Viaarxiv icon

KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing

Add code
Oct 24, 2024
Figure 1 for KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing
Figure 2 for KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing
Figure 3 for KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing
Figure 4 for KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing
Viaarxiv icon

Codebook Design and Performance Analysis for Wideband Beamforming in Terahertz Communications

Add code
Oct 22, 2024
Viaarxiv icon

Evaluating Software Development Agents: Patch Patterns, Code Quality, and Issue Complexity in Real-World GitHub Scenarios

Add code
Oct 16, 2024
Viaarxiv icon

TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text

Add code
Oct 10, 2024
Figure 1 for TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text
Figure 2 for TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text
Figure 3 for TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text
Figure 4 for TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text
Viaarxiv icon

TimeCNN: Refining Cross-Variable Interaction on Time Point for Time Series Forecasting

Add code
Oct 07, 2024
Figure 1 for TimeCNN: Refining Cross-Variable Interaction on Time Point for Time Series Forecasting
Figure 2 for TimeCNN: Refining Cross-Variable Interaction on Time Point for Time Series Forecasting
Figure 3 for TimeCNN: Refining Cross-Variable Interaction on Time Point for Time Series Forecasting
Figure 4 for TimeCNN: Refining Cross-Variable Interaction on Time Point for Time Series Forecasting
Viaarxiv icon

Promise and Peril of Collaborative Code Generation Models: Balancing Effectiveness and Memorization

Add code
Sep 18, 2024
Viaarxiv icon

CF-PRNet: Coarse-to-Fine Prototype Refining Network for Point Cloud Completion and Reconstruction

Add code
Sep 13, 2024
Figure 1 for CF-PRNet: Coarse-to-Fine Prototype Refining Network for Point Cloud Completion and Reconstruction
Figure 2 for CF-PRNet: Coarse-to-Fine Prototype Refining Network for Point Cloud Completion and Reconstruction
Figure 3 for CF-PRNet: Coarse-to-Fine Prototype Refining Network for Point Cloud Completion and Reconstruction
Figure 4 for CF-PRNet: Coarse-to-Fine Prototype Refining Network for Point Cloud Completion and Reconstruction
Viaarxiv icon