Picture for Yaojie Lu

Yaojie Lu

Coupled Variational Reinforcement Learning for Language Model General Reasoning

Add code
Dec 14, 2025
Viaarxiv icon

AI-Salesman: Towards Reliable Large Language Model Driven Telemarketing

Add code
Nov 15, 2025
Viaarxiv icon

RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing

Add code
Jul 27, 2025
Viaarxiv icon

Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning

Add code
Apr 01, 2025
Figure 1 for Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning
Figure 2 for Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning
Figure 3 for Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning
Figure 4 for Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning
Viaarxiv icon

ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers

Add code
Apr 01, 2025
Viaarxiv icon

The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models

Add code
Mar 05, 2025
Viaarxiv icon

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking

Add code
Feb 28, 2025
Viaarxiv icon

Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch

Add code
Feb 24, 2025
Figure 1 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Figure 2 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Figure 3 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Figure 4 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Viaarxiv icon

Scalable Oversight for Superhuman AI via Recursive Self-Critiquing

Add code
Feb 07, 2025
Viaarxiv icon

SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency

Add code
Feb 04, 2025
Viaarxiv icon