Picture for Yudi Zhang

Yudi Zhang

Self-evolving LLM agents with in-distribution Optimization

Add code
Jun 05, 2026
Viaarxiv icon

DOG-DPO:Dynamic Optimization in Geometry for Safety Alignment

Add code
Jun 04, 2026
Viaarxiv icon

GraphRAG-IRL: Personalized Recommendation with Graph-Grounded Inverse Reinforcement Learning and LLM Re-ranking

Add code
Apr 21, 2026
Viaarxiv icon

ScribbleSense: Generative Scribble-Based Texture Editing with Intent Prediction

Add code
Jan 30, 2026
Viaarxiv icon

C-TLSAN: Content-Enhanced Time-Aware Long- and Short-Term Attention Network for Personalized Recommendation

Add code
Jun 16, 2025
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Figure 1 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 2 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 3 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 4 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Viaarxiv icon

Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design

Add code
May 29, 2025
Viaarxiv icon

Consistent Image Layout Editing with Diffusion Models

Add code
Mar 09, 2025
Viaarxiv icon

Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?

Add code
Feb 26, 2025
Figure 1 for Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?
Figure 2 for Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?
Figure 3 for Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?
Figure 4 for Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?
Viaarxiv icon

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Add code
Feb 20, 2025
Viaarxiv icon