Picture for Qipeng Guo

Qipeng Guo

Eric

What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents

Add code
May 19, 2026
Viaarxiv icon

Beyond Mode Collapse: Distribution Matching for Diverse Reasoning

Add code
May 19, 2026
Viaarxiv icon

Synthetic Pre-Pre-Training Improves Language Model Robustness to Noisy Pre-Training Data

Add code
May 11, 2026
Viaarxiv icon

Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization

Add code
Mar 30, 2026
Viaarxiv icon

daVinci-LLM:Towards the Science of Pretraining

Add code
Mar 28, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

AI Can Learn Scientific Taste

Add code
Mar 15, 2026
Viaarxiv icon

Rethinking Multiple-Choice Questions for RLVR: Unlocking Potential via Distractor Design

Add code
Mar 13, 2026
Viaarxiv icon

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Add code
Mar 10, 2026
Viaarxiv icon

Mousse: Rectifying the Geometry of Muon with Curvature-Aware Preconditioning

Add code
Mar 10, 2026
Viaarxiv icon