Picture for Shu Liu

Shu Liu

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Add code
Feb 12, 2025
Viaarxiv icon

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Add code
Feb 11, 2025
Viaarxiv icon

Adaptive Semantic Prompt Caching with VectorQ

Add code
Feb 06, 2025
Viaarxiv icon

Locality-aware Fair Scheduling in LLM Serving

Add code
Jan 24, 2025
Viaarxiv icon

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Add code
Dec 12, 2024
Viaarxiv icon

MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs

Add code
Nov 18, 2024
Viaarxiv icon

Pie: Pooling CPU Memory for LLM Inference

Add code
Nov 14, 2024
Figure 1 for Pie: Pooling CPU Memory for LLM Inference
Figure 2 for Pie: Pooling CPU Memory for LLM Inference
Figure 3 for Pie: Pooling CPU Memory for LLM Inference
Figure 4 for Pie: Pooling CPU Memory for LLM Inference
Viaarxiv icon

A Natural Primal-Dual Hybrid Gradient Method for Adversarial Neural Network Training on Solving Partial Differential Equations

Add code
Nov 09, 2024
Viaarxiv icon

Text2SQL is Not Enough: Unifying AI and Databases with TAG

Add code
Aug 27, 2024
Figure 1 for Text2SQL is Not Enough: Unifying AI and Databases with TAG
Figure 2 for Text2SQL is Not Enough: Unifying AI and Databases with TAG
Figure 3 for Text2SQL is Not Enough: Unifying AI and Databases with TAG
Figure 4 for Text2SQL is Not Enough: Unifying AI and Databases with TAG
Viaarxiv icon

MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders

Add code
Jul 02, 2024
Viaarxiv icon