Picture for Qipeng Guo

Qipeng Guo

Eric

Explicit Multi-head Attention for Inter-head Interaction in Large Language Models

Add code
Jan 27, 2026
Viaarxiv icon

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Add code
Jan 23, 2026
Viaarxiv icon

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

Add code
Jan 23, 2026
Viaarxiv icon

Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go

Add code
Jan 23, 2026
Viaarxiv icon

Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment

Add code
Jan 20, 2026
Viaarxiv icon

How to Set the Learning Rate for Large-Scale Pre-training?

Add code
Jan 08, 2026
Viaarxiv icon

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Add code
Dec 08, 2025
Viaarxiv icon

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Add code
Aug 25, 2025
Figure 1 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 2 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 3 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 4 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Viaarxiv icon

InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling

Add code
Aug 12, 2025
Figure 1 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Figure 2 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Figure 3 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Figure 4 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Viaarxiv icon

IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

Add code
Aug 06, 2025
Viaarxiv icon