Picture for Mingyang Song

Mingyang Song

On-Policy Distillation with Curriculum Turn-level Guidance for Multi-turn Agents

Add code
Jun 14, 2026
Viaarxiv icon

Memory Beyond Recall: A Dual-Process Cognitive Memory System for Self-Evolving LLM Agents

Add code
Jun 08, 2026
Viaarxiv icon

SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents

Add code
Jun 04, 2026
Viaarxiv icon

Agent Planning Benchmark: A Diagnostic Framework for Planning Capabilities in LLM Agents

Add code
Jun 03, 2026
Viaarxiv icon

HardMTBench: Stress-Testing Chinese-English Translation on Knowledge-Intensive Domains

Add code
May 27, 2026
Viaarxiv icon

IFMTBench: A Comprehensive Benchmark for Multilingual Translation Instruction Following

Add code
May 27, 2026
Viaarxiv icon

Hy-MT2: A Family of Fast, Efficient and Powerful Multilingual Translation Models in the Wild

Add code
May 21, 2026
Viaarxiv icon

PRISM: Probability Reallocation with In-Span Masking for Knowledge-Sensitive Alignment

Add code
Apr 02, 2026
Viaarxiv icon

Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing

Add code
Apr 02, 2026
Viaarxiv icon

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Add code
Apr 01, 2026
Viaarxiv icon