Picture for Peng Ye

Peng Ye

LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing

Add code
Jan 12, 2026
Viaarxiv icon

Beyond Gemini-3-Pro: Revisiting LLM Routing and Aggregation at Scale

Add code
Jan 04, 2026
Viaarxiv icon

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Add code
Dec 30, 2025
Viaarxiv icon

EGM: Efficiently Learning General Motion Tracking Policy for High Dynamic Humanoid Whole-Body Control

Add code
Dec 22, 2025
Viaarxiv icon

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Add code
Dec 18, 2025
Viaarxiv icon

M-GRPO: Stabilizing Self-Supervised Reinforcement Learning for Large Language Models with Momentum-Anchored Policy Optimization

Add code
Dec 15, 2025
Viaarxiv icon

P1: Mastering Physics Olympiads with Reinforcement Learning

Add code
Nov 17, 2025
Viaarxiv icon

Private Online Learning against an Adaptive Adversary: Realizable and Agnostic Settings

Add code
Oct 01, 2025
Viaarxiv icon

Learning Compact Representations of LLM Abilities via Item Response Theory

Add code
Oct 01, 2025
Figure 1 for Learning Compact Representations of LLM Abilities via Item Response Theory
Figure 2 for Learning Compact Representations of LLM Abilities via Item Response Theory
Figure 3 for Learning Compact Representations of LLM Abilities via Item Response Theory
Figure 4 for Learning Compact Representations of LLM Abilities via Item Response Theory
Viaarxiv icon

Private Realizable-to-Agnostic Transformation with Near-Optimal Sample Complexity

Add code
Oct 01, 2025
Viaarxiv icon