Picture for Wenbo Su

Wenbo Su

Read As Human: Compressing Context via Parallelizable Close Reading and Skimming

Add code
Feb 02, 2026
Viaarxiv icon

COMI: Coarse-to-fine Context Compression via Marginal Information Gain

Add code
Feb 02, 2026
Viaarxiv icon

Dissecting Outlier Dynamics in LLM NVFP4 Pretraining

Add code
Feb 02, 2026
Viaarxiv icon

PretrainRL: Alleviating Factuality Hallucination of Large Language Models at the Beginning

Add code
Feb 02, 2026
Viaarxiv icon

CoMeT: Collaborative Memory Transformer for Efficient Long Context Modeling

Add code
Feb 02, 2026
Viaarxiv icon

Data Distribution Matters: A Data-Centric Perspective on Context Compression for Large Language Model

Add code
Feb 02, 2026
Viaarxiv icon

CE-RM: A Pointwise Generative Reward Model Optimized via Two-Stage Rollout and Unified Criteria

Add code
Jan 28, 2026
Viaarxiv icon

ShopSimulator: Evaluating and Exploring RL-Driven LLM Agent for Shopping Assistants

Add code
Jan 26, 2026
Viaarxiv icon

Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement

Add code
Jan 08, 2026
Viaarxiv icon

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

Add code
Jan 06, 2026
Viaarxiv icon