Picture for Hui Su

Hui Su

Learning to Self-Verify Makes Language Models Better Reasoners

Add code
Feb 07, 2026
Viaarxiv icon

ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

Add code
Feb 06, 2026
Viaarxiv icon

$V_0$: A Generalist Value Model for Any Policy at State Zero

Add code
Feb 03, 2026
Viaarxiv icon

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Add code
Feb 03, 2026
Viaarxiv icon

MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning

Add code
Jan 29, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Add code
Sep 30, 2025
Viaarxiv icon

SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling

Add code
Jun 04, 2025
Viaarxiv icon

LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding

Add code
May 22, 2025
Viaarxiv icon

Investigating and Enhancing the Robustness of Large Multimodal Models Against Temporal Inconsistency

Add code
May 20, 2025
Viaarxiv icon