Picture for Situo Zhang

Situo Zhang

Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity

Add code
Dec 03, 2024
Viaarxiv icon

Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding

Add code
Oct 29, 2024
Figure 1 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Figure 2 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Figure 3 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Figure 4 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Viaarxiv icon

MobA: A Two-Level Agent System for Efficient Mobile Task Automation

Add code
Oct 17, 2024
Figure 1 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Figure 2 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Figure 3 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Figure 4 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Viaarxiv icon

Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback

Add code
Apr 07, 2024
Viaarxiv icon

Multi: Multimodal Understanding Leaderboard with Text and Images

Add code
Feb 05, 2024
Figure 1 for Multi: Multimodal Understanding Leaderboard with Text and Images
Figure 2 for Multi: Multimodal Understanding Leaderboard with Text and Images
Figure 3 for Multi: Multimodal Understanding Leaderboard with Text and Images
Figure 4 for Multi: Multimodal Understanding Leaderboard with Text and Images
Viaarxiv icon

Large Language Model Is Semi-Parametric Reinforcement Learning Agent

Add code
Jun 09, 2023
Viaarxiv icon