Picture for Jiayu Liu

Jiayu Liu

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Add code
Jun 04, 2026
Viaarxiv icon

SocraticPO: Policy Optimization via Interactive Guidance

Add code
Jun 03, 2026
Viaarxiv icon

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks

Add code
Jun 03, 2026
Viaarxiv icon

$Ψ$-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

Add code
Jun 01, 2026
Viaarxiv icon

MemGuard: Preventing Memory Contamination in Long-Term Memory-Augmented Large Language Models

Add code
May 27, 2026
Viaarxiv icon

UserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mind

Add code
May 26, 2026
Viaarxiv icon

Advancing Creative Physical Intelligence in Large Multimodal Models

Add code
May 25, 2026
Viaarxiv icon

Multi-Scenario User Profile Construction via Recommendation Lists

Add code
Mar 16, 2026
Viaarxiv icon

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Add code
Mar 04, 2026
Viaarxiv icon

Step-Level Sparse Autoencoder for Reasoning Process Interpretation

Add code
Mar 03, 2026
Viaarxiv icon