Picture for Jianguo Li

Jianguo Li

Sherman

CAKE: Cascading and Adaptive KV Cache Eviction with Layer Preferences

Add code
Mar 16, 2025
Viaarxiv icon

High-Resolution Uplink Sensing in Millimeter-Wave ISAC Systems

Add code
Mar 13, 2025
Viaarxiv icon

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

Add code
Mar 07, 2025
Figure 1 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 2 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 3 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 4 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Viaarxiv icon

ACCORD: Alleviating Concept Coupling through Dependence Regularization for Text-to-Image Diffusion Personalization

Add code
Mar 03, 2025
Viaarxiv icon

PASemiQA: Plan-Assisted Agent for Question Answering on Semi-Structured Data with Text and Relational Information

Add code
Feb 28, 2025
Viaarxiv icon

CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models

Add code
Oct 09, 2024
Viaarxiv icon

Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient Attentions

Add code
Oct 09, 2024
Viaarxiv icon

E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning

Add code
Sep 10, 2024
Viaarxiv icon

GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding

Add code
Sep 06, 2024
Viaarxiv icon

SQLfuse: Enhancing Text-to-SQL Performance through Comprehensive LLM Synergy

Add code
Jul 19, 2024
Viaarxiv icon